SRC05¶


Research	R0023 — Counterproductive advice and prompt lifecycle
Run	2026-03-25
Query	Q001
Search	S02
Result	S02-R04
Source	SRC05

Wharton GAIL foundational study: prompt engineering is complicated and contingent

Source¶

Field	Value
Title	Prompting Science Report 1: Prompt Engineering is Complicated and Contingent
Publisher	SSRN / Wharton Generative AI Labs
Author(s)	Lennart Meincke, Ethan Mollick, Lilach Mollick, Dan Shapiro
Date	2025-03-04
URL	https://gail.wharton.upenn.edu/research-and-insights/tech-report-prompt-engineering-is-complicated-and-contingent/
Type	Research paper (technical report)

Dimension	Rationale
Reliability	100 repetitions per condition, GPQA Diamond benchmark, multiple correctness thresholds. Foundational methodology paper for the series.
Relevance	Establishes that prompt engineering effects are measurement-dependent and highly variable — the meta-finding that explains why popular advice appears to work in demos but fails in practice.
Bias flags	Low risk. Academic institution, no vendor affiliation, transparent methodology.

Evidence ID	Summary
SRC05-E01	Prompt tweaks produce 60-point swings on individual questions that average out across datasets, masking critical variability