R0027/2026-03-26/Q002/S02/R02¶
Foundational paper on tokenizer unfairness across languages
Summary¶
| Field | Value |
|---|---|
| Title | Language Model Tokenizers Introduce Unfairness Between Languages |
| URL | https://arxiv.org/pdf/2305.15425 |
| Date accessed | 2026-03-26 |
| Publication date | 2023-05 |
| Author(s) | Not retrieved |
| Publication | arXiv preprint |
Selection Decision¶
Included in evidence base: Yes
Rationale: Foundational work establishing tokenizer bias across languages with different scripts and morphological systems.