arXiv Open Access 2025

Can you map it to English? The Role of Cross-Lingual Alignment in Multilingual Performance of LLMs

Kartik Ravisankar Hyojung Han Sarah Wiegreffe Marine Carpuat

Lihat Sumber

Abstrak

Large language models (LLMs) can answer prompts in many languages, despite being trained predominantly on English; yet, the mechanisms driving this generalization remain poorly understood. This work asks: How does an LLM's ability to align representations of non-English inputs to English impact its performance on natural language understanding (NLU) tasks? We study the role of representation alignment in instance-level task decisions, complementing prior analyses conducted both at the language level and task-independently. We introduce the Discriminative Alignment Index ($\DALI$) to quantify instance-level alignment across 24 languages other than English and three distinct NLU tasks. Results show that incorrect NLU predictions are strongly associated with lower representation alignment with English in the model's middle layers. Through activation patching, we show that incorrect predictions in languages other than English can be fixed by patching their parallel English activations in the middle layers, thereby demonstrating the causal role of representation (mis)alignment in cross-lingual correctness.

Topik & Kata Kunci

cs.CL

Penulis (4)

Kartik Ravisankar

Hyojung Han

Sarah Wiegreffe

Marine Carpuat

Format Sitasi

APA MLA BibTeX

Ravisankar, K., Han, H., Wiegreffe, S., Carpuat, M. (2025). Can you map it to English? The Role of Cross-Lingual Alignment in Multilingual Performance of LLMs. https://arxiv.org/abs/2504.09378

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓