Semantic Scholar Open Access 2023 1 sitasi

Multilingual Speech Recognition Using Reinforcement Learning

A. Y. Athish S. K G S. M

Abstrak

A system that properly converts spoken language into written text is what the voice recognition and transcription project seeks to create. The system will analyze audio inputs and turn them into text using a variety of methods, including neural networks, language modeling, and acoustic modeling. The project's main goal is to increase transcription accuracy by tackling issues, including speaker accents, background noise, and audio quality. To make sure the system is capable of properly transcribing various speech varieties, it will be put to the test using a range of audio sources. The system's performance and limitations will be covered in a report that will accompany a functioning prototype of the voice recognition and transcription system. The project's findings will help numerous businesses that rely on voice recognition and transcription technologies improve communication, accessibility, and production.

Topik & Kata Kunci

Penulis (3)

A

A. Y. Athish

S

S. K G

S

S. M

Format Sitasi

Athish, A.Y., G, S.K., M, S. (2023). Multilingual Speech Recognition Using Reinforcement Learning. https://doi.org/10.1109/ICCCNT56998.2023.10307335

Akses Cepat

Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Total Sitasi
Sumber Database
Semantic Scholar
DOI
10.1109/ICCCNT56998.2023.10307335
Akses
Open Access ✓