R06¶

PMC article on sociotechnical limits of RLHF alignment.

Summary¶

Field	Value
Title	Helpful, harmless, honest? Sociotechnical limits of AI alignment and safety through RLHF
URL	https://pmc.ncbi.nlm.nih.gov/articles/PMC12137480/
Date accessed	2026-03-28
Publication date	2025
Author(s)	Multiple
Publication	PMC

Included in evidence base: No

Rationale: Broader scope covering RLHF limitations generally, not sycophancy specifically. Not directly relevant to the query.