R0056/2026-04-01/C001/H1¶


Research	R0056 — RLHF Yes-Men Claims v2
Run	2026-04-01
Claim	C001
Hypothesis	H1

Statement¶

The claim is accurate as stated: AI models affirm users' views approximately 49% more often than humans do.

Status¶

Current: Supported

Supporting Evidence¶

Evidence	Summary
SRC01-E01	Stanford study published in Science confirms the 49% figure across 11 LLMs

Contradicting Evidence¶

Evidence	Summary
None found	—

Reasoning¶

The peer-reviewed study published in Science directly states that AI models endorsed users' actions 49% more often than humans on average. This is a direct match to the claim.

Relationship to Other Hypotheses¶

H1 is the strongest hypothesis. H2 (partial correctness) has some merit in that the 49% is an average with variation across models, but the claim's use of "approximately" accounts for this. H3 (materially wrong) is eliminated by the direct evidence.