Skip to content

R0055/2026-04-01/C007

Research R0055 — RLHF Yes-Men Claims
Run 2026-04-01
Claim C007

Claim: Six major alternatives to RLHF have emerged since 2022 (DPO, Constitutional AI, GRPO, KTO, ORPO, RLVR)

BLUF: Substantially correct. All six named methods exist and have emerged as alternatives or complements to standard RLHF. DPO (2023), Constitutional AI (2022), GRPO (2025), KTO (2024), ORPO (2024), and RLVR (2024-2025) are all documented. Whether they are all 'major' is debatable — some like DPO and GRPO are widely adopted while others like KTO and ORPO have narrower use.

Probability: Very likely (80-95%) | Confidence: High


Summary

Entity Description
Claim Definition Claim text, scope, status
Assessment Full analytical product with reasoning chain
ACH Matrix Evidence x hypotheses diagnosticity analysis
Self-Audit ROBIS-adapted 5-domain audit

Hypotheses

ID Hypothesis Status
H1 Claim is accurate as stated Inconclusive
H2 Claim is partially correct or correct with caveats Supported
H3 Claim is materially wrong Eliminated

Searches

ID Target Results Selected
S01 RLHF alternatives DPO Constitutional AI GRPO KTO O 10 2

Sources

Source Description Reliability Relevance
SRC01 Post-training survey 2026 Medium High

Revisit Triggers

  • New methods displacing any of these six; changes in adoption patterns