R0055/2026-04-01/C001 — Claim Definition¶
Claim as Received¶
Users demonstrably prefer agreeable AI responses by approximately 50%
Claim as Clarified¶
The claim asserts that empirical research shows users prefer AI responses that agree with them, and that this preference is quantifiable at roughly 50%. This is a compound claim: (1) users prefer agreeable AI, and (2) the magnitude is approximately 50%. The "50%" could refer to different metrics — frequency of agreement, preference margin, or endorsement rate.
BLUF¶
The claim is substantially correct but the "50%" figure conflates two related findings. A 2026 Stanford/Science study found AI models affirm users 49% more often than humans do (a relative comparison, not an absolute preference rate). Users also demonstrably prefer sycophantic AI and rate it as more trustworthy. The approximate magnitude is supported, though the framing as "prefer agreeable responses by approximately 50%" is an imprecise paraphrase of the underlying data.
Scope¶
- Domain: AI alignment, human-computer interaction
- Timeframe: 2023-2026, with key 2026 Science publication
- Testability: Verifiable against published experimental data
Assessment Summary¶
Probability: Likely (55-80%)
Confidence: Medium
Hypothesis outcome: H2 (partially correct) prevails. Users do demonstrably prefer agreeable AI, and the "approximately 50%" figure maps loosely to the 49% more frequent endorsement finding, but the claim as stated slightly mischaracterizes what the 49-50% figure actually measures.
[Full assessment in assessment.md.]
Status¶
| Field | Value |
|---|---|
| Date created | 2026-04-01 |
| Date completed | 2026-04-01 |
| Researcher profile | Phillip Moore |
| Prompt version | Unified Research Methodology v1 |
| Revisit by | 2026-10-01 |
| Revisit trigger | Replication or refutation of the Stanford/Science 2026 study |