arXiv Open Access 2025

Assessment of L2 Oral Proficiency using Speech Large Language Models

Rao Ma Mengjie Qian Siyuan Tang Stefano Bannò Kate M. Knill +1 lainnya

Lihat Sumber

Abstrak

The growing population of L2 English speakers has increased the demand for developing automatic graders for spoken language assessment (SLA). Historically, statistical models, text encoders, and self-supervised speech models have been utilised for this task. However, cascaded systems suffer from the loss of information, while E2E graders also have limitations. With the recent advancements of multi-modal large language models (LLMs), we aim to explore their potential as L2 oral proficiency graders and overcome these issues. In this work, we compare various training strategies using regression and classification targets. Our results show that speech LLMs outperform all previous competitive baselines, achieving superior performance on two datasets. Furthermore, the trained grader demonstrates strong generalisation capabilities in the cross-part or cross-task evaluation, facilitated by the audio understanding knowledge acquired during LLM pre-training.

Topik & Kata Kunci

cs.CL cs.SD eess.AS

Penulis (6)

Rao Ma

Mengjie Qian

Siyuan Tang

Stefano Bannò

Kate M. Knill

Mark J. F. Gales

Format Sitasi

APA MLA BibTeX

Ma, R., Qian, M., Tang, S., Bannò, S., Knill, K.M., Gales, M.J.F. (2025). Assessment of L2 Oral Proficiency using Speech Large Language Models. https://arxiv.org/abs/2505.21148

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓