Skip to content

R0055/2026-04-01/C019 — Claim Definition

Claim as Received

Research shows users prefer sycophantic AI, trust it more, and rate it as higher quality

Claim as Clarified

Research shows users prefer sycophantic AI, trust it more, and rate it as higher quality

BLUF

Correct. The Stanford/Science 2026 study found users deemed sycophantic responses more trustworthy and were more likely to return. The Anthropic/ICLR 2024 paper found human preference models prefer sycophantic responses over correct ones. Multiple studies converge on this finding.

Scope

  • Domain: AI alignment, sycophancy, enterprise AI
  • Timeframe: 2022-2026
  • Testability: Verifiable against published research and documentation

Assessment Summary

Probability: Almost certain (95-99%)

Confidence: High

Hypothesis outcome: H1 prevails — see assessment for details.

[Full assessment in assessment.md.]

Status

Field Value
Date created 2026-04-01
Date completed 2026-04-01
Researcher profile Phillip Moore
Prompt version Unified Research Methodology v1
Revisit by 2026-10-01
Revisit trigger Studies finding user segments that actively prefer non-sycophantic AI