Q002 — Sycophancy Warnings — Assessment¶


Research	R0048 — Corporate AI Training
Run	2026-03-29
Query	Q002 — Sycophancy Warnings

BLUF¶

No corporate or government AI training material examined specifically warns users about sycophancy — the tendency of AI to agree with users, provide comforting answers, confirm assumptions, or prioritize helpfulness over accuracy. This is despite extensive academic research (including a 2026 Science publication), a major real-world incident (OpenAI GPT-4o rollback, April 2025), and multiple policy analyses identifying sycophancy as a significant risk. The gap extends to equivalent concepts: automation bias, overtrust, and confirmation reinforcement are absent from standard training. Even the NIST AI Risk Management Framework, which addresses "confabulation," does not specifically identify sycophancy. The research-to-practice gap is driven by three factors: publication lag, commercial disincentives (sycophancy drives engagement metrics), and absence of regulatory mandates.

Probability¶

Dimension	Value
Rating	Almost certain (97%)
Confidence	High
Confidence rationale	Absence finding based on comprehensive search across major training providers, government agencies, policy templates, and regulatory frameworks. Corroborated by multiple independent sources identifying sycophancy as "hidden" or unaddressed.

The assessment that no standard training warns about sycophancy is rated "almost certain" based on the complete absence of sycophancy content across all examined training materials, combined with independent sources confirming this gap.

Reasoning Chain¶

Extensive search across corporate training providers (NAVEX, DataCamp), consulting firms (Deloitte, KPMG), government agencies (GSA, DoD, NHS, UK GDS), policy templates (Fisher Phillips), and regulatory frameworks (EU AI Act, NIST AI RMF) found no sycophancy warnings.
The term "sycophancy" does not appear in any training material or policy template examined.
Equivalent terms (automation bias, overtrust, confirmation reinforcement, acquiescence) also do not appear in training materials, though automation bias appears in academic research.
Academic research is extensive: the Stanford study in Science (SRC03-E01), Bayesian analysis (SRC06-E01), Microsoft's ~60-paper review (SRC07-E01), and NN/g UX research (SRC05-E01).
The OpenAI rollback incident (SRC04-E01) demonstrated real-world harm from sycophancy affecting millions of users.
Georgetown Law identifies a structural disincentive: firms are unlikely to self-regulate because sycophancy drives user engagement metrics (SRC02-E01).
The gap is confirmed by IPR calling sycophancy "hidden" (SRC01-E01) and by the absence of sycophancy from NIST's risk framework (SRC08-E01).

Evidence Base Summary¶

Source	Reliability	Relevance	Key Finding
SRC01	Medium-High	High	Sycophancy called "hidden" and "most dangerous"
SRC02	High	High	Firms won't self-regulate; product recalls recommended
SRC03	High	High	Science publication; 50% excess affirmation; users prefer sycophancy
SRC04	Medium-High	High	Real-world incident; RLHF feedback loop mechanism
SRC05	Medium-High	High	Practical mitigations exist in UX literature
SRC06	Medium-High	High	Even rational users misled; biased sampling mechanism
SRC07	High	High	Research recommends training; products do not implement
SRC08	High	High	Confabulation addressed; sycophancy not named
SRC09	Medium	Medium-High	40% zero-scrutiny rate
SRC10	Medium	High	No legislation targets sycophancy

Collection Synthesis¶

Dimension	Assessment
Evidence quality	High: Science publication, NIST framework, Microsoft Research review, Georgetown policy analysis
Source agreement	Complete: all sources confirm sycophancy is not in training
Independence	Strong: academic, government, commercial, legal, and journalistic sources
Outliers	None — no source contradicts the finding

Detail¶

This is the strongest finding in the research run. The absence of sycophancy from training is confirmed from every angle examined. The most important nuance is that the absence is not due to ignorance — the concept is well-researched and well-documented — but to a structural gap between research and practice, compounded by commercial disincentives.

Gaps¶

Gap	Impact on Confidence
Proprietary internal training at tech companies (Google, Meta, Anthropic) not examined	Low — these are not standard corporate training
AI safety bootcamps or specialized courses may address sycophancy	Low — these are not "standard" training
Training may address sycophancy indirectly through "verify outputs" advice	Low — indirect reference without naming the concept is not a warning

Researcher Bias Check¶

The researcher acknowledges that a negative finding (absence) is inherently harder to prove than a positive finding. The comprehensive search strategy mitigates this: 10 independent source types were examined, and the absence is consistent across all. The researcher also sought disconfirming evidence (training that does address sycophancy) and found none.

Cross-References¶

ACH Matrix
Self-Audit
H1, H2, H3
Related: Q001 (general training content), Q003 (hallucination characterization)