DOAJ Open Access 2025

In Pursuit of the Trivial

Steinþór Steingrímsson Bjarki Ármannsson

Abstrak

We compare the performance of state-of-the-art Large Language Models on a recently released benchmarking set for automated question answering for Icelandic and compare it with performance on questions from an Icelandic trivia game. We find that the models perform worse for questions on Icelandic subjects, specifically Icelandic culture, but somewhat surprisingly do better on a trivia game for people than on the benchmark set meant for language models, built around data that the model has seen during training. We also call into question some aspects of the benchmarking set and discuss what  playing trivia games can tell us - if anything - about the capabilities of these models.

Penulis (2)

S

Steinþór Steingrímsson

B

Bjarki Ármannsson

Format Sitasi

Steingrímsson, S., Ármannsson, B. (2025). In Pursuit of the Trivial. https://doi.org/10.5617/dhnbpub.12302

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.5617/dhnbpub.12302
Informasi Jurnal
Tahun Terbit
2025
Sumber Database
DOAJ
DOI
10.5617/dhnbpub.12302
Akses
Open Access ✓