arXiv Open Access 2024

A Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese

Yikang Liu Yeting Shen Hongao Zhu Lilong Xu Zhiheng Qian +7 lainnya

Lihat Sumber

Abstrak

We present ZhoBLiMP, the largest linguistic minimal pair benchmark for Chinese, with over 100 paradigms, ranging from topicalization to the \textit{Ba} construction. We then train from scratch a suite of Chinese language models (LMs) with different tokenizers, parameter sizes, and token volumes, to study the learning curves of LMs on Chinese. To mitigate the biases introduced by unequal lengths of the sentences in a minimal pair, we propose a new metric named sub-linear length normalized log-probabilities (SLLN-LP). Using SLLN-LP as the metric, our results show that \textsc{Anaphor}, \textsc{Quantifiers}, and \textsc{Ellipsis} in Chinese are difficult for LMs even up to 32B parameters, and that SLLN-LP successfully mitigates biases in ZhoBLiMP, JBLiMP and BLiMP. We conclude that future evaluations should be more carefully designed to consider the intricate relations between linking functions, LMs, and targeted minimal pairs.

Topik & Kata Kunci

cs.CL

Penulis (12)

Yikang Liu

Yeting Shen

Hongao Zhu

Lilong Xu

Zhiheng Qian

Siyuan Song

Kejia Zhang

Jialong Tang

Pei Zhang

Baosong Yang

Rui Wang

Hai Hu

Format Sitasi

APA MLA BibTeX

Liu, Y., Shen, Y., Zhu, H., Xu, L., Qian, Z., Song, S. et al. (2024). A Systematic Assessment of Language Models with Linguistic Minimal Pairs in Chinese. https://arxiv.org/abs/2411.06096

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓