Q002 — Assessment¶


Research	R0043 — Sycophancy Vocabulary
Run	2026-04-01
Query	Q002

BLUF¶

No regulatory framework or procurement standard directly addresses "sycophancy" by name. However, four distinct regulatory mechanisms provide indirect coverage: the EU AI Act's automation bias awareness requirement (Article 14), NIST AI 600-1's confabulation and information integrity risk categories, SR 11-7's effective challenge and independent validation requirements in banking, and the FDA's human factors evaluation requirements for AI medical devices. The coverage gap is specifically at the intersection of model behavior and regulatory language — regulations address human responses (automation bias) and output quality (confabulation) but not the model tendency to prioritize agreement over accuracy.

Probability¶

Rating: N/A (open-ended query)

Confidence in assessment: Medium

Confidence rationale: High confidence in the finding that no regulation directly names sycophancy. Medium confidence in the completeness of the indirect coverage inventory — additional regulatory frameworks (DOD-specific, aviation-specific) may contain relevant provisions not discovered in this search.

Reasoning Chain¶

The EU AI Act Article 14 explicitly requires that high-risk AI systems be designed to enable human oversight with awareness of "automation bias" — the first EU norm to name a cognitive bias. [SRC01-E01, High reliability, High relevance]
NIST AI 600-1 identifies "confabulation" and "information integrity" as generative AI risks but does not name sycophancy or agreement-seeking behavior as a distinct risk category. [SRC02-E01, High reliability, Medium-High relevance]
SR 11-7's "effective challenge" requirement mandates independent validation and challenge of model outputs in banking — a governance mechanism that structurally opposes sycophancy but was not designed for it. [SRC03-E01, High reliability, Medium-High relevance]
FDA requires human factors evaluation of AI medical devices but does not identify sycophancy or acquiescence as specific risk categories. [SRC04-E01, High reliability, Medium relevance]
IEEE 3119 procurement standard provides structured AI procurement processes but without sycophancy-specific evaluation criteria. [SRC05-E01, High reliability, Medium relevance]
Georgetown analysis confirms no explicit regulatory framework addresses sycophancy; industry self-regulation (exemplified by OpenAI's voluntary GPT-4o rollback) is the current approach. [SRC06-E01, Medium-High reliability, High relevance]
JUDGMENT: The regulatory landscape addresses sycophancy-adjacent concerns through four distinct mechanisms — human cognition (EU AI Act/automation bias), output quality (NIST/confabulation), governance process (SR 11-7/effective challenge), and human factors (FDA) — but none targets the model behavior itself. This creates a specific gap where a model could be sycophantic in ways not captured by existing frameworks.

Evidence Base Summary¶

Source	Description	Reliability	Relevance	Key Finding
SRC01	EU AI Act Article 14	High	High	Names "automation bias" — closest regulatory provision
SRC02	NIST AI 600-1	High	Medium-High	Addresses confabulation, not sycophancy
SRC03	SR 11-7	High	Medium-High	Effective challenge requirement in banking
SRC04	FDA AI guidance	High	Medium	Human factors evaluation without sycophancy
SRC05	IEEE 3119	High	Medium	Procurement standard without sycophancy
SRC06	Georgetown brief	Medium-High	High	Confirms regulatory gap

Collection Synthesis¶

Dimension	Assessment
Evidence quality	Medium — regulatory texts are high quality but the absence of sycophancy provisions is confirmed rather than demonstrated
Source agreement	High — all sources converge on indirect-but-incomplete coverage
Source independence	High — EU, US federal agencies, standards bodies, and academic institutions
Outliers	None — no source found direct sycophancy regulation

Detail¶

The most significant finding is the structural nature of the gap. Regulations address three aspects of the human-AI interaction: (1) human cognition (automation bias), (2) output quality (confabulation/accuracy), (3) governance process (independent validation). The missing fourth dimension is model behavior — the tendency to prioritize agreement over accuracy. This gap means a model could pass all existing regulatory tests while still being systemically sycophantic.

Gaps¶

Missing Evidence	Impact on Assessment
DOD-specific AI deployment standards	Could contain human-machine teaming requirements addressing sycophancy
Aviation-specific AI deployment standards (FAA/ICAO)	May contain automation complacency provisions
Actual enterprise procurement RFPs	Would show whether organizations require sycophancy testing in practice
ISO/IEC 42001 full text	Could contain relevant AI management system requirements

Researcher Bias Check¶

Declared biases: The researcher's anti-sycophancy stance and publication incentive could lead to overstating the regulatory gap. The finding that indirect coverage exists through four mechanisms should temper the "no regulation" narrative.

Influence assessment: The assessment is balanced — it documents both the existing indirect coverage and the specific gap. The researcher should note that indirect coverage may be sufficient in practice even if it does not name sycophancy directly.

Cross-References¶

Entity	ID	File
Hypotheses	H1, H2, H3	`hypotheses/`
Sources	SRC01, SRC02, SRC03, SRC04, SRC05, SRC06	`sources/`
ACH Matrix	—	ach-matrix.md
Self-Audit	—	self-audit.md