arXiv Open Access 2025

Linguistic Generalizations are not Rules: Impacts on Evaluation of LMs

Leonie Weissweiler Kyle Mahowald Adele Goldberg
Lihat Sumber

Abstrak

Linguistic evaluations of how well LMs generalize to produce or understand language often implicitly take for granted that natural languages are generated by symbolic rules. According to this perspective, grammaticality is determined by whether sentences obey such rules. Interpretation is compositionally generated by syntactic rules operating on meaningful words. Semantic parsing maps sentences into formal logic. Failures of LMs to obey strict rules are presumed to reveal that LMs do not produce or understand language like humans. Here we suggest that LMs' failures to obey symbolic rules may be a feature rather than a bug, because natural languages are not based on neatly separable, compositional rules. Rather, new utterances are produced and understood by a combination of flexible, interrelated, and context-dependent constructions. Considering gradient factors such as frequencies, context, and function will help us reimagine new benchmarks and analyses to probe whether and how LMs capture the rich, flexible generalizations that comprise natural languages.

Topik & Kata Kunci

Penulis (3)

L

Leonie Weissweiler

K

Kyle Mahowald

A

Adele Goldberg

Format Sitasi

Weissweiler, L., Mahowald, K., Goldberg, A. (2025). Linguistic Generalizations are not Rules: Impacts on Evaluation of LMs. https://arxiv.org/abs/2502.13195

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓