R0024/2026-03-25/Q004/H2¶
Statement¶
No, no meaningful commitments or published sycophancy reduction metrics exist from AI companies.
Status¶
Current: Eliminated
Anthropic has published specific before/after metrics (70-85% reduction) and open-sourced an evaluation tool. Google has claimed measurable reductions in Gemini 3. While the commitments are inconsistent and lack standardization, they are not absent.
Supporting Evidence¶
No evidence supports the claim that metrics are entirely absent.
Contradicting Evidence¶
| Evidence | Summary |
|---|---|
| SRC01-E01 | Anthropic published 70-85% reduction metrics and open-sourced Petri |
| SRC02-E01 | OpenAI published incident analysis and promised improvement process |
Reasoning¶
H2 is eliminated because metrics do exist, even if they are limited and lack standardization.
Relationship to Other Hypotheses¶
H2 is the null hypothesis and is eliminated. The question is whether existing metrics represent strong commitments (H1) or limited/inconsistent efforts (H3).