arXiv Open Access 2025

Breaking Physical and Linguistic Borders: Multilingual Federated Prompt Tuning for Low-Resource Languages

Wanru Zhao Yihong Chen Royson Lee Xinchi Qiu Yan Gao +2 lainnya
Lihat Sumber

Abstrak

Pre-trained large language models (LLMs) have become a cornerstone of modern natural language processing, with their capabilities extending across a wide range of applications and languages. However, the fine-tuning of multilingual LLMs, especially for low-resource languages, faces significant challenges arising from data-sharing restrictions (the physical border) and inherent linguistic differences (the linguistic border). These barriers hinder users of various languages, particularly those in low-resource regions, from fully benefiting from the advantages of LLMs. To address these challenges, we propose the Federated Prompt Tuning Paradigm for multilingual scenarios, which utilizes parameter-efficient fine-tuning while adhering to data sharing restrictions. We design a comprehensive set of experiments and analyze them using a novel notion of language distance to highlight the strengths of our paradigm: Even under computational constraints, our method not only improves data efficiency but also facilitates mutual enhancements across languages, particularly benefiting low-resource ones. Compared to traditional local cross-lingual transfer tuning methods, our approach achieves 6.9\% higher accuracy with improved data efficiency, and demonstrates greater stability and generalization. These findings underscore the potential of our approach to promote social equality and champion linguistic diversity, ensuring that no language is left behind.

Topik & Kata Kunci

Penulis (7)

W

Wanru Zhao

Y

Yihong Chen

R

Royson Lee

X

Xinchi Qiu

Y

Yan Gao

H

Hongxiang Fan

N

Nicholas D. Lane

Format Sitasi

Zhao, W., Chen, Y., Lee, R., Qiu, X., Gao, Y., Fan, H. et al. (2025). Breaking Physical and Linguistic Borders: Multilingual Federated Prompt Tuning for Low-Resource Languages. https://arxiv.org/abs/2507.03003

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓