Skip to content
Research R0040 — RLHF Alternatives
Run 2026-03-29
Query Q001 — RLHF Alternatives
Search S04
Result S04-R01

S04-R01 — Constitutional AI: Harmlessness from AI Feedback

Summary

Title Constitutional AI: Harmlessness from AI Feedback
URL https://arxiv.org/abs/2212.08073
Date accessed 2026-03-29
Publication date December 2022
Authors Yuntao Bai et al. (50+ co-authors, Anthropic)
Publication arXiv (Anthropic research report)

Selection Decision

Selected as the primary paper introducing Constitutional AI and RLAIF. Foundational work deployed in Anthropic's Claude.