R0028/2026-03-26/C023
Claim: Approximately 72-87% of cross-language failures are attributable to model limitations (primarily tokenization inefficiency) rather than to the linguistic structures themselves, with only about 2% of failures tracing to direct linguistic nuances like word order or inflection.
BLUF: Confirmed. The LILT analysis explicitly states: 'Fundamental model limitations drive 72.1% to 87.3% of errors, while data artifacts account for 10.6% to 25.6%, and inherent language nuances represent approximately 2% of the gap.' These exact figures match the claim.
Probability: Very likely (80-95%) | Confidence: High
Summary
Hypotheses
| ID |
Hypothesis |
Status |
| H1 |
Claim is accurate — exact percentages confirmed |
Supported |
| H2 |
Partially correct — percentages may vary by study |
Inconclusive |
| H3 |
Claim is materially wrong |
Eliminated |
Searches
| ID |
Target |
Results |
Selected |
| S01 |
Primary search |
10 |
3 |
Sources
| Source |
Description |
Reliability |
Relevance |
| SRC01 |
LILT Multilingual LLM Performance Gap Analysis |
High |
High |