arXiv Open Access 2021

The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods

Xian Shi Fan Yu Yizhou Lu Yuhao Liang Qiangze Feng +3 lainnya

Lihat Sumber

Abstrak

The variety of accents has posed a big challenge to speech recognition. The Accented English Speech Recognition Challenge (AESRC2020) is designed for providing a common testbed and promoting accent-related research. Two tracks are set in the challenge -- English accent recognition (track 1) and accented English speech recognition (track 2). A set of 160 hours of accented English speech collected from 8 countries is released with labels as the training set. Another 20 hours of speech without labels is later released as the test set, including two unseen accents from another two countries used to test the model generalization ability in track 2. We also provide baseline systems for the participants. This paper first reviews the released dataset, track setups, baselines and then summarizes the challenge results and major techniques used in the submissions.

Topik & Kata Kunci

cs.SD eess.AS

Penulis (8)

Xian Shi

Fan Yu

Yizhou Lu

Yuhao Liang

Qiangze Feng

Daliang Wang

Yanmin Qian

Lei Xie

Format Sitasi

APA MLA BibTeX

Shi, X., Yu, F., Lu, Y., Liang, Y., Feng, Q., Wang, D. et al. (2021). The Accented English Speech Recognition Challenge 2020: Open Datasets, Tracks, Baselines, Results and Methods. https://arxiv.org/abs/2102.10233

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2021
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓