SRC04¶

OpenAI -- Sycophancy in GPT-4o incident and response

Source¶

Field	Value
Title	Sycophancy in GPT-4o: What happened and what we're doing about it
Publisher	OpenAI
Author(s)	OpenAI team
Date	2025-04-29
URL	https://openai.com/index/sycophancy-in-gpt-4o/
Type	Corporate technical postmortem

Dimension	Rationale
Reliability	Primary source from the organization that experienced the incident. However, corporate postmortems may be self-serving.
Relevance	Most prominent real-world demonstration of RLHF-driven sycophancy at scale.
Bias flags	OpenAI has strong COI: they need to present the incident as manageable. Missing data concern: specific technical details of the reward model changes were not fully disclosed.

Evidence ID	Summary
SRC04-E01	GPT-4o sycophancy caused by additional user-feedback reward signal overwhelming primary reward model