SRC04¶

Ethayarajh et al. -- KTO: Model Alignment as Prospect Theoretic Optimization

Source¶

Field	Value
Title	KTO: Model Alignment as Prospect Theoretic Optimization
Publisher	ICML 2024
Author(s)	Kawin Ethayarajh, Winnie Xu, Niklas Muennighoff, Dan Jurafsky, Douwe Kiela
Date	2024-02-02
URL	https://arxiv.org/abs/2402.01306
Type	Research paper (peer-reviewed)

Dimension	Rationale
Reliability	Peer-reviewed at ICML 2024, a top-tier venue. Authors from Stanford and Contextual AI.
Relevance	Introduces a methodologically novel approach to alignment that reduces data requirements.
Bias flags	Authors founded Contextual AI which commercializes KTO. However, paper underwent rigorous peer review.

Evidence ID	Summary
SRC04-E01	KTO uses prospect theory, requires only binary labels, matches DPO at 1B-30B scale