R0042/2026-04-01/Q003/SRC01
Anthropic blog post on user wellbeing protection and anti-sycophancy design
Source
Summary
| Dimension |
Rating |
| Reliability |
High |
| Relevance |
High |
| Bias: Missing data |
Low risk |
| Bias: Measurement |
Some concerns |
| Bias: Selective reporting |
Some concerns |
| Bias: Randomization |
N/A — not an RCT |
| Bias: Protocol deviation |
N/A — not an RCT |
| Bias: COI/Funding |
Some concerns |
Rationale
| Dimension |
Rationale |
| Reliability |
Primary source from the organization doing the work; Anthropic has published extensively on this topic with specific metrics |
| Relevance |
The most documented example of anti-sycophancy as an explicit design goal — but as a model developer, not an enterprise deployer |
| Bias flags |
Self-reported metrics; Anthropic has commercial interest in positioning Claude as less sycophantic than competitors. Some concerns about selective metric reporting (Georgetown Law raises this point). However, the Petri tool is open-source, enabling independent verification. |
| Evidence ID |
Summary |
| SRC01-E01 |
Anthropic anti-sycophancy program: evaluation methodology, metrics, design trade-offs |