arXiv Open Access 2024

RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs

Ekaterina Taktasheva Maxim Bazhukov Kirill Koncha Alena Fenogenova Ekaterina Artemova +1 lainnya
Lihat Sumber

Abstrak

Minimal pairs are a well-established approach to evaluating the grammatical knowledge of language models. However, existing resources for minimal pairs address a limited number of languages and lack diversity of language-specific grammatical phenomena. This paper introduces the Russian Benchmark of Linguistic Minimal Pairs (RuBLiMP), which includes 45k pairs of sentences that differ in grammaticality and isolate a morphological, syntactic, or semantic phenomenon. In contrast to existing benchmarks of linguistic minimal pairs, RuBLiMP is created by applying linguistic perturbations to automatically annotated sentences from open text corpora and carefully curating test data. We describe the data collection protocol and present the results of evaluating 25 language models in various scenarios. We find that the widely used language models for Russian are sensitive to morphological and agreement-oriented contrasts but fall behind humans on phenomena requiring understanding of structural relations, negation, transitivity, and tense. RuBLiMP, the codebase, and other materials are publicly available.

Topik & Kata Kunci

Penulis (6)

E

Ekaterina Taktasheva

M

Maxim Bazhukov

K

Kirill Koncha

A

Alena Fenogenova

E

Ekaterina Artemova

V

Vladislav Mikhailov

Format Sitasi

Taktasheva, E., Bazhukov, M., Koncha, K., Fenogenova, A., Artemova, E., Mikhailov, V. (2024). RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs. https://arxiv.org/abs/2406.19232

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓