Semantic Scholar Open Access 2019 589 sitasi

MLQA: Evaluating Cross-lingual Extractive Question Answering

Patrick Lewis Barlas Oğuz Ruty Rinott Sebastian Riedel Holger Schwenk

Lihat Sumber DOI

Abstrak

Question answering (QA) models have shown rapid progress enabled by the availability of large, high-quality benchmark datasets. Such annotated datasets are difficult and costly to collect, and rarely exist in languages other than English, making building QA systems that work well in other languages challenging. In order to develop such systems, it is crucial to invest in high quality multilingual evaluation benchmarks to measure progress. We present MLQA, a multi-way aligned extractive QA evaluation benchmark intended to spur research in this area. MLQA contains QA instances in 7 languages, English, Arabic, German, Spanish, Hindi, Vietnamese and Simplified Chinese. MLQA has over 12K instances in English and 5K in each other language, with each instance parallel between 4 languages on average. We evaluate state-of-the-art cross-lingual models and machine-translation-based baselines on MLQA. In all cases, transfer results are shown to be significantly behind training-language performance.

Topik & Kata Kunci

Computer Science

Penulis (5)

Patrick Lewis

Barlas Oğuz

Ruty Rinott

Sebastian Riedel

Holger Schwenk

Format Sitasi

APA MLA BibTeX

Lewis, P., Oğuz, B., Rinott, R., Riedel, S., Schwenk, H. (2019). MLQA: Evaluating Cross-lingual Extractive Question Answering. https://doi.org/10.18653/v1/2020.acl-main.653

Akses Cepat

Lihat di Sumber doi.org/10.18653/v1/2020.acl-main.653

Informasi Jurnal

Tahun Terbit: 2019
Bahasa: en
Total Sitasi: 589×
Sumber Database: Semantic Scholar
DOI: 10.18653/v1/2020.acl-main.653
Akses: Open Access ✓