Skip to content

R0055/2026-04-01/C018

Research R0055 — RLHF Yes-Men Claims
Run 2026-04-01
Claim C018

Claim: 40% of users apply zero scrutiny to AI outputs

BLUF: Partially correct. A Microsoft/CMU CHI 2025 study found participants self-reported using no critical thinking for 40% of their tasks when using AI. The nuance: this is self-reported behavior for a percentage of tasks, not 40% of users applying zero scrutiny to all outputs. The distinction matters.

Probability: Likely (55-80%) | Confidence: Medium


Summary

Entity Description
Claim Definition Claim text, scope, status
Assessment Full analytical product with reasoning chain
ACH Matrix Evidence x hypotheses diagnosticity analysis
Self-Audit ROBIS-adapted 5-domain audit

Hypotheses

ID Hypothesis Status
H1 Claim is accurate as stated Inconclusive
H2 Claim is partially correct or correct with caveats Supported
H3 Claim is materially wrong Eliminated

Searches

ID Target Results Selected
S01 40% users zero critical thinking AI outputs Micros 10 3

Sources

Source Description Reliability Relevance
SRC01 Lee et al. 2025 (Microsoft/CMU) High High

Revisit Triggers

  • Replication with larger sample; observational (not self-reported) measurement of scrutiny