R0044/2026-03-29/Q003 — Query Definition¶
Query as Received¶
Has anyone in the regulated industries (aviation, defense, healthcare, finance) published research or guidance that explicitly connects the human-factors concept of "automation bias" or "overtrust" to the AI safety concept of "sycophancy"? Is anyone bridging these two vocabularies?
Query as Clarified¶
- Subject: Published work that explicitly connects human-factors vocabulary (automation bias, overtrust, complacency) to AI safety vocabulary (sycophancy, RLHF alignment, reward hacking)
- Scope: Work from or about regulated industries, not just pure AI safety research
- Evidence basis: Papers, guidance documents, or policy analyses that name both concepts and draw an explicit connection between them
- Key distinction: The query asks whether anyone has recognized that automation bias and sycophancy describe overlapping aspects of the same problem from different disciplinary perspectives
Ambiguities Identified¶
- "Explicitly connects" could mean formally proving equivalence, or simply mentioning both terms in the same document. The query appears to seek the former — work that recognizes the conceptual relationship, not just co-occurrence.
- "Bridging" implies someone working across both disciplines, not just citing the other field's literature in passing.
Sub-Questions¶
- Do any regulated-industry research papers cite both "automation bias" and "sycophancy" and discuss their relationship?
- Has the AI safety community acknowledged that their "sycophancy" concept maps to the longer-established human factors concept of "automation bias"?
- Has the human factors community acknowledged that "sycophancy" in AI systems represents a system-side mechanism that amplifies their existing concern about automation bias?
Hypotheses¶
| ID | Hypothesis | Description |
|---|---|---|
| H1 | Yes, explicit bridging exists | Published research or guidance explicitly connects automation bias/overtrust to AI sycophancy, recognizing them as related phenomena requiring integrated solutions |
| H2 | No bridging exists | The two vocabularies remain siloed in their respective communities with no published work connecting them |
| H3 | Partial/emerging bridging exists | Some researchers acknowledge both concepts but do not yet draw explicit connections; the bridging is implicit or incidental rather than the focus of the work |