arXiv Open Access 2023

Spaiche: Extending State-of-the-Art ASR Models to Swiss German Dialects

Clement Sicard Kajetan Pyszkowski Victor Gillioz
Lihat Sumber

Abstrak

Recent breakthroughs in NLP largely increased the presence of ASR systems in our daily lives. However, for many low-resource languages, ASR models still need to be improved due in part to the difficulty of acquiring pertinent data. This project aims to help advance research in ASR models for Swiss German dialects, by providing insights about the performance of state-of-the-art ASR models on recently published Swiss German speech datasets. We propose a novel loss that takes into account the semantic distance between the predicted and the ground-truth labels. We outperform current state-of-the-art results by fine-tuning OpenAI's Whisper model on Swiss-German datasets.

Penulis (3)

C

Clement Sicard

K

Kajetan Pyszkowski

V

Victor Gillioz

Format Sitasi

Sicard, C., Pyszkowski, K., Gillioz, V. (2023). Spaiche: Extending State-of-the-Art ASR Models to Swiss German Dialects. https://arxiv.org/abs/2304.11075

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓