C023¶


Research	R0028 — Prompt Engineering Claims
Run	2026-03-26
Claim	C023

Claim: Approximately 72-87% of cross-language failures are attributable to model limitations (primarily tokenization inefficiency) rather than to the linguistic structures themselves, with only about 2% of failures tracing to direct linguistic nuances like word order or inflection.

BLUF: Confirmed. The LILT analysis explicitly states: 'Fundamental model limitations drive 72.1% to 87.3% of errors, while data artifacts account for 10.6% to 25.6%, and inherent language nuances represent approximately 2% of the gap.' These exact figures match the claim.

Probability: Very likely (80-95%) | Confidence: High

Summary¶

Entity	Description
Claim Definition	Claim text, scope, status
Assessment	Full analytical product with reasoning chain
ACH Matrix	Evidence x hypotheses diagnosticity analysis
Self-Audit	ROBIS-adapted 4-domain process audit

Hypotheses¶

ID	Hypothesis	Status
H1	Claim is accurate — exact percentages confirmed	Supported
H2	Partially correct — percentages may vary by study	Inconclusive
H3	Claim is materially wrong	Eliminated

Searches¶

ID	Target	Results	Selected
S01	Primary search	10	3

Sources¶

Source	Description	Reliability	Relevance
SRC01	LILT Multilingual LLM Performance Gap Analysis	High	High