R0040/2026-03-28/Q002/S01/R06¶
PMC article on sociotechnical limits of RLHF alignment.
Summary¶
| Field | Value |
|---|---|
| Title | Helpful, harmless, honest? Sociotechnical limits of AI alignment and safety through RLHF |
| URL | https://pmc.ncbi.nlm.nih.gov/articles/PMC12137480/ |
| Date accessed | 2026-03-28 |
| Publication date | 2025 |
| Author(s) | Multiple |
| Publication | PMC |
Selection Decision¶
Included in evidence base: No
Rationale: Broader scope covering RLHF limitations generally, not sycophancy specifically. Not directly relevant to the query.