Skip to content

R0057/2026-04-01/C030/SRC01/E01

Research R0057 — RLHF Yes-Men Claims v3
Run 2026-04-01
Claim C030
Source SRC01
Evidence SRC01-E01
Type Statistical

Users rate sycophantic AI 9-15% higher quality, 13% more return likelihood, 6-9% higher trust

URL: https://arxiv.org/html/2510.01395v1

Extract

Pre-registered experiments with 2,405 participants found: sycophantic responses rated 9-15% higher quality in both studies; return likelihood increased 13%; performance trust rose 6-8%; moral trust increased 6-9%. Users consistently preferred and trusted sycophantic models despite worse prosocial outcomes.

Relevance to Hypotheses

Hypothesis Relationship Strength
H1 Supports Directly addresses claim accuracy
H2 Supports Allows for partial correctness
H3 Contradicts Evidence contradicts material inaccuracy

Context

Peer-reviewed in Science with pre-registered experiments and large sample sizes.