Skip to content

R0055/2026-04-01/C001/H3

Research R0055 — RLHF Yes-Men Claims
Run 2026-04-01
Claim C001
Hypothesis H3

Statement

The claim is materially wrong — users do not demonstrably prefer agreeable AI by approximately 50%.

Status

Current: Eliminated

Supporting Evidence

Evidence Summary
No evidence supports this hypothesis

Contradicting Evidence

Evidence Summary
SRC01-E01 The 49% figure exists and user preference for sycophantic AI is documented
SRC01-E02 Users rated sycophantic AI as more trustworthy

Reasoning

The evidence clearly shows users do prefer agreeable AI, and a quantifiable ~49-50% figure exists in the literature. H3 is eliminated because the core direction of the claim is supported by peer-reviewed research.

Relationship to Other Hypotheses

H3 is the null hypothesis and is eliminated by the evidence supporting both H1 and H2.