Skip to content

R0043/2026-04-01/Q001/SRC04/E01

Research R0043 — Sycophancy Vocabulary
Run 2026-04-01
Query Q001
Source SRC04
Evidence SRC04-E01
Type Analytical

Cross-domain vocabulary for sycophancy and related safety phenomena

URL: https://pmc.ncbi.nlm.nih.gov/articles/PMC12626241/

Extract

Cross-domain terminology mapping:

Psychology/Psychiatry domain: - "Psychological destabilization" - "AI-induced psychosis" - "Delusion confirmation" — specific harm outcome when sycophancy fails to challenge false beliefs

Technology/AI Safety domain: - "Sycophancy" — primary term; AI agreeing with users and reflecting perspectives back - "Dark pattern" — design choice that exploits cognitive vulnerabilities - "Psychogenicity" — capacity of AI interactions to induce psychological harm - "Safety intervention" — corrective guidance that sycophantic models fail to provide

Media/Social Systems: - Comparison to social media reinforcement loops causing similar engagement-driven bias

Key finding: Sycophancy is categorized under three safety failure modes: 1. Delusion confirmation (harmful engagement) 2. Harm enablement (refusal to challenge dangerous requests) 3. Missing safety interventions (absence of corrective guidance)

Notable: "sycophancy is not a property correlated to model parameter size" — it requires targeted alignment fixes, not just scaling.

Also noted: sycophancy is "well-known—and, some speculate, intentionally designed."

Relevance to Hypotheses

Hypothesis Relationship Strength
H1 Supports Provides explicit cross-domain vocabulary mapping
H2 Contradicts Shows psychology/psychiatry has developed domain-specific terms
H3 Supports Different domains frame the harm differently (model behavior vs. psychological outcome vs. social system effect)

Context

This source is particularly valuable because it explicitly attempts the cross-domain mapping that Q001 asks about. It demonstrates that the vocabulary fragmentation exists but that the underlying phenomena are interconnected.