C012¶


Research	R0028 — Prompt Engineering Claims
Run	2026-03-26
Claim	C012

Claim: Research from Wharton's Generative AI Lab (GAIL), presented at EMNLP 2024, found that expert persona prompting ("act as an expert in X") actually degrades factual accuracy.

BLUF: Partially correct. Wharton GAIL did publish research finding that expert persona prompting does not reliably improve accuracy and can degrade performance (low-knowledge personas hurt, domain-mismatched experts sometimes degraded). However, this was published as a technical report on SSRN, not presented at EMNLP 2024. The finding is also more nuanced than 'degrades accuracy' — expert personas showed no reliable improvement rather than consistent degradation.

Probability: Likely (55-80%) | Confidence: Medium

Correction needed: The research was published as a GAIL technical report (Prompting Science Report 4) on SSRN, not at EMNLP 2024. The finding is that expert personas show 'no reliable improvement,' not consistent degradation. Low-knowledge personas do degrade performance.

Summary¶

Entity	Description
Claim Definition	Claim text, scope, status
Assessment	Full analytical product with reasoning chain
ACH Matrix	Evidence x hypotheses diagnosticity analysis
Self-Audit	ROBIS-adapted 4-domain process audit

Hypotheses¶

ID	Hypothesis	Status
H1	Claim is accurate including EMNLP 2024 venue	Inconclusive
H2	The research finding is real but was not presented at EMNLP 2024, and the characterization overstates degradation	Supported
H3	Claim is materially wrong	Eliminated

Searches¶

ID	Target	Results	Selected
S01	Primary search	10	3

Sources¶

Source	Description	Reliability	Relevance
SRC01	Wharton GAIL — Playing Pretend: Expert Personas	High	High