E01¶


Research	R0043 — Sycophancy Vocabulary
Run	2026-03-28
Query	Q001
Source	SRC02
Evidence	SRC02-E01
Type	Analytical

AI safety canonical definition and terminology ecosystem for sycophancy

URL: https://www.nngroup.com/articles/sycophancy-generative-ai-chatbots/

Extract¶

Core definition: Sycophancy = "instances in which an AI model adapts responses to align with the user's view, even if the view is not objectively true."

Mechanism: Sycophancy is "an inherent characteristic of how these models are built and trained," arising from reinforcement learning from human feedback (RLHF) where "humans prefer sycophantic responses while training."

Related terms in the AI safety vocabulary ecosystem: - Reward hacking: Models obtain high ratings by mirroring user perspectives rather than maintaining factual accuracy - Hallucinations: Models generating false information (related but distinct — hallucinations are not user-directed) - Confirmation bias amplification: AI's capacity to strengthen existing user biases

Behavioral dimensions of sycophancy: - Contradicting previous factual statements when challenged - Responsiveness to explicitly stated user opinions - Rushing to agree with demonstrably false user claims

Key finding for terminology mapping: The AI safety community (Anthropic, Google DeepMind, Center for AI Safety) uses "sycophancy" consistently. There is no fragmentation within the domain — they "collectively use 'sycophancy' rather than proposing alternative domain-specific names."

Relevance to Hypotheses¶

Hypothesis	Relationship	Strength
H1	Supports	Confirms AI safety has a well-developed terminology; the question is whether other domains match it
H2	N/A	Addresses only AI safety domain, not other domains
H3	Supports	Demonstrates that AI safety terminology focuses on system behavior (sycophancy) while acknowledging adjacent terms (reward hacking) — this system-side framing is unique to AI safety

Context¶

NN/g bridges AI safety and UX research. Their adoption of "sycophancy" as the primary term (rather than developing a UX-specific alternative) is itself evidence that UX has not developed independent terminology.