arXiv Open Access 2025

Can you map it to English? The Role of Cross-Lingual Alignment in Multilingual Performance of LLMs

Kartik Ravisankar Hyojung Han Sarah Wiegreffe Marine Carpuat
Lihat Sumber

Abstrak

Large language models (LLMs) can answer prompts in many languages, despite being trained predominantly on English; yet, the mechanisms driving this generalization remain poorly understood. This work asks: How does an LLM's ability to align representations of non-English inputs to English impact its performance on natural language understanding (NLU) tasks? We study the role of representation alignment in instance-level task decisions, complementing prior analyses conducted both at the language level and task-independently. We introduce the Discriminative Alignment Index ($\DALI$) to quantify instance-level alignment across 24 languages other than English and three distinct NLU tasks. Results show that incorrect NLU predictions are strongly associated with lower representation alignment with English in the model's middle layers. Through activation patching, we show that incorrect predictions in languages other than English can be fixed by patching their parallel English activations in the middle layers, thereby demonstrating the causal role of representation (mis)alignment in cross-lingual correctness.

Topik & Kata Kunci

Penulis (4)

K

Kartik Ravisankar

H

Hyojung Han

S

Sarah Wiegreffe

M

Marine Carpuat

Format Sitasi

Ravisankar, K., Han, H., Wiegreffe, S., Carpuat, M. (2025). Can you map it to English? The Role of Cross-Lingual Alignment in Multilingual Performance of LLMs. https://arxiv.org/abs/2504.09378

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓