Skip to content

R0043/2026-04-01

Research R0043 — Sycophancy Vocabulary
Mode Query
Run date 2026-04-01
Queries 3
Prompt Unified Research Methodology v1
Model Claude Opus 4.6 (1M context)

This run investigates the vocabulary landscape around AI sycophancy across eight industries, the regulatory coverage of the phenomenon under domain-specific names, and whether the vocabulary gap has been recognized as a problem requiring solutions.

Queries

Q001 — Vocabulary Mapping — Medium Confidence

Query: What terms do different industries and disciplines use to describe AI behavior that prioritizes user agreement, comfort, or satisfaction over accuracy, correctness, or safety?

Answer: The phenomenon maps to a rich but fragmented vocabulary. No two fields use the same primary term, and the terms describe different facets of the same system: sycophancy (model behavior), automation bias (human cognition), overreliance (human behavior), and domain-specific terms for manifestations in each context.

Hypothesis Status Probability
H1: Comprehensive map constructable Supported
H2: Some domains lack terminology Partially Supported
H3: Terms describe different phenomena Partially Supported

Confidence: Medium · Sources: 9 · Searches: 5

Full analysis

Q002 — Enterprise Requirements — Medium Confidence

Query: Search for enterprise requirements, procurement specifications, regulatory guidance, or deployment standards that address the sycophancy phenomenon under its domain-specific names.

Answer: No regulatory framework directly addresses "sycophancy" by name. Four indirect mechanisms provide partial coverage: EU AI Act (automation bias), NIST AI 600-1 (confabulation), SR 11-7 (effective challenge), FDA (human factors). The gap is at the intersection of model behavior and regulatory language.

Hypothesis Status Probability
H1: Direct requirements exist Eliminated
H2: No requirements at all Eliminated
H3: Indirect coverage only Supported

Confidence: Medium · Sources: 6 · Searches: 2

Full analysis

Q003 — Vocabulary Gap as Problem — Medium-High Confidence

Query: Has the vocabulary gap itself been identified as a problem in the AI safety or AI governance literature?

Answer: The broader AI terminology gap is well-recognized, with multiple organizations proposing solutions. However, the specific sycophancy vocabulary gap has not been prioritized in any identified taxonomy, glossary, or framework — even the most comprehensive efforts (53-threat taxonomy, 100+ term glossary) exclude sycophancy.

Hypothesis Status Probability
H1: Gap recognized, active efforts Supported (broad level)
H2: Gap not recognized Eliminated
H3: Broader gap recognized, sycophancy excluded Supported (best fit)

Confidence: Medium-High · Sources: 4 · Searches: 1

Full analysis


Collection Analysis

Cross-Cutting Patterns

Pattern Queries Affected Significance
Sycophancy falls between taxonomy categories Q001, Q003 Not a governance concept, not a security threat, not a process term — a behavioral model property that current frameworks miss
Human-side vs model-side framing divide Q001, Q002 Regulations address human responses (automation bias) but not the model behavior that triggers them
Indirect regulatory coverage is the norm Q002, Q003 Four distinct regulatory mechanisms provide partial coverage, but none names sycophancy directly
Structural research isolation impedes diffusion Q001, Q003 83% homophily between AI safety and ethics communities limits terminology diffusion

Collection Statistics

Metric Value
Queries investigated 3
Answered with high confidence 0
Answered with medium-high confidence 1 (Q003)
Answered with medium confidence 2 (Q001, Q002)

Source Independence Assessment

Sources span 5 countries (US, UK, Australia, EU, Germany), 4 institutional types (academic, government, professional association, journalism), and 8+ domains. No two sources share a common upstream origin. The convergence on vocabulary fragmentation is derived from independent observations across multiple communities, strengthening the finding.

The Roytburg & Miller network analysis provides structural evidence for WHY the sources are independent — the 83% homophily finding means the communities that coined these terms rarely interact, producing genuinely independent vocabulary development.

Collection Gaps

Gap Impact Mitigation
NIST AI 600-1 full text inaccessible (PDF) Could not verify sycophancy-related content Used secondary sources and NIST website summaries
DOD-specific AI deployment standards not fully investigated May contain sycophancy-relevant human-machine teaming requirements Covered through CSET brief and DOD CaTE references
Non-English terminology not searched EU, Asian, and other language communities may use different terms English-language search covers the primary research literature
Actual procurement RFPs not accessible Cannot confirm whether organizations require sycophancy testing in practice Used regulatory frameworks as proxy
No sources defending the status quo found Cannot fully test whether vocabulary fragmentation is viewed as appropriate by some communities This absence may reflect genuine consensus or search bias

Collection Self-Audit

Domain Rating Notes
Eligibility criteria Low risk Clear criteria: domain-specific terminology, regulatory provisions, taxonomy efforts
Search comprehensiveness Some concerns 8 search queries across 16 WebSearch calls; some domains received more attention than others
Evaluation consistency Low risk All 19 sources across 3 queries scored using identical framework
Synthesis fairness Low risk Counter-evidence included (SRC06-braun acquiescence reversal); nuanced hypotheses (H3) in all queries

Resources

Summary

Metric Value
Queries investigated 3
Files produced 176
Sources scored 19
Evidence extracts 19
Results dispositioned 13 + 62 = 75 (Q001) + 6 + 44 = 50 (Q002) + 4 + 16 = 20 (Q003) = 145 total

Tool Breakdown

Tool Uses Purpose
WebSearch 16 Search queries across domains
WebFetch 13 Page content retrieval (3 failed — PDF/403/redirect)
Write 176 File creation
Read 2 Methodology and output format reading
Edit 0 No file modifications
Bash 6 Directory creation, file generation, validation

Token Distribution

Category Tokens
Input (context) ~250,000
Output (generation) ~120,000
Total ~370,000