Skip to content

R0041/2026-03-28/Q001/S05

Research R0041 — Enterprise Sycophancy
Run 2026-03-28
Query Q001
Search S05

WebSearch — Anthropic Claude sycophancy reduction Petri evaluation

Summary

Field Value
Source/Database WebSearch
Query terms Anthropic Claude sycophancy reduction research Petri evaluation 2025 2026
Filters None
Results returned 10
Results selected 2
Results rejected 8

Selected Results

Result Title URL Rationale
S05-R01 Petri: An open-source auditing tool to accelerate AI safety research https://alignment.anthropic.com/2025/petri/ Primary source on Petri tool and sycophancy evaluation
S05-R02 Towards Understanding Sycophancy in Language Models https://arxiv.org/pdf/2310.13548 Foundational Anthropic research paper on sycophancy

Rejected Results

Result Title URL Rationale
S05-R03 Anthropic details safeguards https://yourstory.com/ai-story/anthropic-protecting-well-being-of-users Derivative of SRC01, no additional sycophancy content
S05-R04 How Anthropic Built Safety Into Claude AI (2025 Update) https://www.adwaitx.com/anthropic-claude-ai-user-wellbeing-safety-features-2025/ Derivative reporting, no new sycophancy data
S05-R05 Alignment Science Blog https://alignment.anthropic.com/ Index page, no specific sycophancy content
S05-R06 Bloom: an open source tool for automated behavioral evaluations https://alignment.anthropic.com/2025/bloom-auto-evals/ Related evaluation tool, not sycophancy-specific
S05-R07 OpenAI and Anthropic publish joint AI safety evaluation https://www.edtechinnovationhub.com/news/openai-and-anthropic-cross-test-ai-models-in-rare-joint-safety-evaluation Joint evaluation, limited sycophancy detail
S05-R08 Anthropic announces Bloom https://siliconangle.com/2025/12/22/anthropic-announces-bloom-open-source-tool-researchers-evaluating-ai-behavior/ Bloom tool, not sycophancy-specific
S05-R09 GitHub - safety-research/petri https://github.com/safety-research/petri Repository page, code-level detail
S05-R10 Anthropic — Protecting Well-Being of Users https://www.anthropic.com/news/protecting-well-being-of-users Duplicate of S01-R01

Notes

This search specifically targeted Anthropic's Petri evaluation tool and its sycophancy measurement capabilities. The primary Petri source provided detailed methodology and cross-model comparison data.