Skip to content

R0024/2026-03-25/Q004/H2

Research R0024 — Sycophancy and Addiction
Run 2026-03-25
Query Q004
Hypothesis H2

Statement

No, no meaningful commitments or published sycophancy reduction metrics exist from AI companies.

Status

Current: Eliminated

Anthropic has published specific before/after metrics (70-85% reduction) and open-sourced an evaluation tool. Google has claimed measurable reductions in Gemini 3. While the commitments are inconsistent and lack standardization, they are not absent.

Supporting Evidence

No evidence supports the claim that metrics are entirely absent.

Contradicting Evidence

Evidence Summary
SRC01-E01 Anthropic published 70-85% reduction metrics and open-sourced Petri
SRC02-E01 OpenAI published incident analysis and promised improvement process

Reasoning

H2 is eliminated because metrics do exist, even if they are limited and lack standardization.

Relationship to Other Hypotheses

H2 is the null hypothesis and is eliminated. The question is whether existing metrics represent strong commitments (H1) or limited/inconsistent efforts (H3).