R0041/2026-03-28/Q001/S05
WebSearch — Anthropic Claude sycophancy reduction Petri evaluation
Summary
| Field |
Value |
| Source/Database |
WebSearch |
| Query terms |
Anthropic Claude sycophancy reduction research Petri evaluation 2025 2026 |
| Filters |
None |
| Results returned |
10 |
| Results selected |
2 |
| Results rejected |
8 |
Selected Results
| Result |
Title |
URL |
Rationale |
| S05-R01 |
Petri: An open-source auditing tool to accelerate AI safety research |
https://alignment.anthropic.com/2025/petri/ |
Primary source on Petri tool and sycophancy evaluation |
| S05-R02 |
Towards Understanding Sycophancy in Language Models |
https://arxiv.org/pdf/2310.13548 |
Foundational Anthropic research paper on sycophancy |
Rejected Results
| Result |
Title |
URL |
Rationale |
| S05-R03 |
Anthropic details safeguards |
https://yourstory.com/ai-story/anthropic-protecting-well-being-of-users |
Derivative of SRC01, no additional sycophancy content |
| S05-R04 |
How Anthropic Built Safety Into Claude AI (2025 Update) |
https://www.adwaitx.com/anthropic-claude-ai-user-wellbeing-safety-features-2025/ |
Derivative reporting, no new sycophancy data |
| S05-R05 |
Alignment Science Blog |
https://alignment.anthropic.com/ |
Index page, no specific sycophancy content |
| S05-R06 |
Bloom: an open source tool for automated behavioral evaluations |
https://alignment.anthropic.com/2025/bloom-auto-evals/ |
Related evaluation tool, not sycophancy-specific |
| S05-R07 |
OpenAI and Anthropic publish joint AI safety evaluation |
https://www.edtechinnovationhub.com/news/openai-and-anthropic-cross-test-ai-models-in-rare-joint-safety-evaluation |
Joint evaluation, limited sycophancy detail |
| S05-R08 |
Anthropic announces Bloom |
https://siliconangle.com/2025/12/22/anthropic-announces-bloom-open-source-tool-researchers-evaluating-ai-behavior/ |
Bloom tool, not sycophancy-specific |
| S05-R09 |
GitHub - safety-research/petri |
https://github.com/safety-research/petri |
Repository page, code-level detail |
| S05-R10 |
Anthropic — Protecting Well-Being of Users |
https://www.anthropic.com/news/protecting-well-being-of-users |
Duplicate of S01-R01 |
Notes
This search specifically targeted Anthropic's Petri evaluation tool and its sycophancy measurement capabilities. The primary Petri source provided detailed methodology and cross-model comparison data.