C006 — Claim Definition¶


Research	R0057 — RLHF Yes-Men Claims v3
Run	2026-04-01
Claim	C006

Claim as Received¶

At least six major alternatives to RLHF have emerged since 2022 (DPO, KTO, GRPO, Constitutional AI, ORPO, RLVR).

Claim as Clarified¶

At least six major alternatives to RLHF have emerged since 2022 (DPO, KTO, GRPO, Constitutional AI, ORPO, RLVR).

BLUF¶

Confirmed. All six named alternatives are well-documented: DPO (2023), KTO (2024), GRPO (2024), Constitutional AI (2022), ORPO (2024), and RLVR (2024-2025). All are widely adopted or cited.

Scope¶

Domain: AI sycophancy research
Timeframe: Current (2024-2026)
Testability: Verifiable against published research and public records

Assessment Summary¶

Probability: Almost certain (95-99%)

Confidence: High

Hypothesis outcome: H1 is supported based on available evidence.

[Full assessment in assessment.md.]

Status¶

Field	Value
Date created	2026-04-01
Date completed	2026-04-01
Researcher profile	Phillip Moore
Prompt version	Unified Research Methodology v1
Revisit by	2027-04-01
Revisit trigger	If any of the six named alternatives is shown to not qualify as a major alternative