Semantic Scholar Open Access 2020 733 sitasi

MLS: A Large-Scale Multilingual Dataset for Speech Research

Vineel Pratap Qiantong Xu Anuroop Sriram Gabriel Synnaeve R. Collobert

Lihat Sumber DOI

Abstrak

This paper introduces Multilingual LibriSpeech (MLS) dataset, a large multilingual corpus suitable for speech research. The dataset is derived from read audiobooks from LibriVox and consists of 8 languages, including about 44.5K hours of English and a total of about 6K hours for other languages. Additionally, we provide Language Models (LM) and baseline Automatic Speech Recognition (ASR) models and for all the languages in our dataset. We believe such a large transcribed dataset will open new avenues in ASR and Text-To-Speech (TTS) research. The dataset will be made freely available for anyone at this http URL.

Topik & Kata Kunci

Engineering Computer Science

Penulis (5)

Vineel Pratap

Qiantong Xu

Anuroop Sriram

Gabriel Synnaeve

R. Collobert

Format Sitasi

APA MLA BibTeX

Pratap, V., Xu, Q., Sriram, A., Synnaeve, G., Collobert, R. (2020). MLS: A Large-Scale Multilingual Dataset for Speech Research. https://doi.org/10.21437/Interspeech.2020-2826

Akses Cepat

Lihat di Sumber doi.org/10.21437/Interspeech.2020-2826

Informasi Jurnal

Tahun Terbit: 2020
Bahasa: en
Total Sitasi: 733×
Sumber Database: Semantic Scholar
DOI: 10.21437/Interspeech.2020-2826
Akses: Open Access ✓