DOAJ Open Access 2023

An enhanced method for dialect transcription via error‐correcting thesaurus

Xiaoliang Ma Congjian Deng Dequan Du Qingqi Pei

Abstrak

Abstract Automatic speech recognition (ASR) has been widely used in the field of customer service, but the performance of general ASR in dialect transcription is not satisfactory, especially in Guangdong Province. Targeted training of ASR transcription engine will produce effect, but the training cost is high, and it is not suitable for small‐scale training with multiple dialects and frequencies. The complaint problems in the customer service field have obvious clustering and are suitable for few‐shot and multi‐frequency training. In view of this, in the actual engineering application, the method of ASR transcribed into the dialect error correction thesaurus is tried to be used to replace the wrong words, and have achieved good results. The optimization technology after automatic speech transcription proposed in this study can improve the recognition accuracy of general ASR by 13.75% for dialect words.

Topik & Kata Kunci

Penulis (4)

X

Xiaoliang Ma

C

Congjian Deng

D

Dequan Du

Q

Qingqi Pei

Format Sitasi

Ma, X., Deng, C., Du, D., Pei, Q. (2023). An enhanced method for dialect transcription via error‐correcting thesaurus. https://doi.org/10.1049/cmu2.12671

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.1049/cmu2.12671
Informasi Jurnal
Tahun Terbit
2023
Sumber Database
DOAJ
DOI
10.1049/cmu2.12671
Akses
Open Access ✓