R0027/2026-03-26/Q001/SRC03/E01¶
XLT prompting achieves 10+ point improvement and reduces cross-language performance gaps
URL: https://aclanthology.org/2023.findings-emnlp.826/
Extract¶
XLT "brings over 10 points of average improvement in arithmetic reasoning and open-domain question-answering tasks" and "significantly reduces the gap between the average performance and the best performance of each task in different languages." The paper's title — "Not All Languages Are Created Equal in LLMs" — itself confirms that language inequality is a documented phenomenon.
Relevance to Hypotheses¶
| Hypothesis | Relationship | Strength |
|---|---|---|
| H1 | Supports | The existence of a 10+ point gap that can be reduced by prompting technique confirms the gap exists |
| H2 | Contradicts | Clear quantified performance differences across languages |
| H3 | Supports | The gap is reducible through technique choice, confirming conditionality |
Context¶
This is a foundational, peer-reviewed paper in the field. It establishes both the problem (language inequality) and a mitigation (XLT prompting). Highly cited in subsequent multilingual LLM research.