arXiv Open Access 2025

Multilingual Question Answering in Low-Resource Settings: A Dzongkha-English Benchmark for Foundation Models

Md. Tanzib Hosain Rajan Das Gupta Md. Kishor Morol

Lihat Sumber

Abstrak

In this work, we provide DZEN, a dataset of parallel Dzongkha and English test questions for Bhutanese middle and high school students. The over 5K questions in our collection span a variety of scientific topics and include factual, application, and reasoning-based questions. We use our parallel dataset to test a number of Large Language Models (LLMs) and find a significant performance difference between the models in English and Dzongkha. We also look at different prompting strategies and discover that Chain-of-Thought (CoT) prompting works well for reasoning questions but less well for factual ones. We also find that adding English translations enhances the precision of Dzongkha question responses. Our results point to exciting avenues for further study to improve LLM performance in Dzongkha and, more generally, in low-resource languages. We release the dataset at: https://github.com/kraritt/llm_dzongkha_evaluation.

Topik & Kata Kunci

cs.CL

Penulis (3)

Md. Tanzib Hosain

Rajan Das Gupta

Md. Kishor Morol

Format Sitasi

APA MLA BibTeX

Hosain, M.T., Gupta, R.D., Morol, M.K. (2025). Multilingual Question Answering in Low-Resource Settings: A Dzongkha-English Benchmark for Foundation Models. https://arxiv.org/abs/2505.18638

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓