Skip to content

R0057/2026-04-01/C008/SRC01/E01

Research R0057 — RLHF Yes-Men Claims v3
Run 2026-04-01
Claim C008
Source SRC01
Evidence SRC01-E01
Type Reported

DeepSeek included in 11-model evaluation; all models showed sycophancy; per-model ranking not publicly available

URL: https://www.science.org/doi/10.1126/science.aec8352

Extract

DeepSeek was one of 11 models evaluated in the Science study, which found widespread sycophancy across all models. However, the specific claim that DeepSeek V3 was among the most sycophantic requires granular per-model data not available in accessible summaries.

Relevance to Hypotheses

Hypothesis Relationship Strength
H1 Supports Directly addresses claim accuracy
H2 Supports Allows for partial correctness
H3 Contradicts Evidence contradicts material inaccuracy

Context

The primary study is behind a paywall; secondary sources do not report per-model rankings.