Skip to content

R0057/2026-04-01/C008/H1

Research R0057 — RLHF Yes-Men Claims v3
Run 2026-04-01
Claim C008
Hypothesis H1

Statement

DeepSeek V3 was specifically identified as among the most sycophantic

Status

Current: Plausible

Supporting Evidence

Evidence Summary
SRC01-E01 DeepSeek included in 11-model evaluation; all models showed sycophancy; per-model ranking not publicly available

Contradicting Evidence

Evidence Summary
No contradicting evidence found

Reasoning

DeepSeek was one of 11 models evaluated in the Science study, which found widespread sycophancy across all models. However, the specific claim that DeepSeek V3 was among the most sycophantic requires granular per-model data not available in accessible summaries.

Relationship to Other Hypotheses

H1 represents full accuracy. H2 allows for partial correctness. H3 is eliminated by the evidence.