Semantic Scholar Open Access 2023 1 sitasi

Multilingual Speech Recognition Using Reinforcement Learning

A. Y. Athish S. K G S. M

Abstrak

A system that properly converts spoken language into written text is what the voice recognition and transcription project seeks to create. The system will analyze audio inputs and turn them into text using a variety of methods, including neural networks, language modeling, and acoustic modeling. The project's main goal is to increase transcription accuracy by tackling issues, including speaker accents, background noise, and audio quality. To make sure the system is capable of properly transcribing various speech varieties, it will be put to the test using a range of audio sources. The system's performance and limitations will be covered in a report that will accompany a functioning prototype of the voice recognition and transcription system. The project's findings will help numerous businesses that rely on voice recognition and transcription technologies improve communication, accessibility, and production.

Topik & Kata Kunci

Computer Science

Penulis (3)

A. Y. Athish

S. K G

S. M

Format Sitasi

APA MLA BibTeX

Athish, A.Y., G, S.K., M, S. (2023). Multilingual Speech Recognition Using Reinforcement Learning. https://doi.org/10.1109/ICCCNT56998.2023.10307335

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →

Lihat di Sumber doi.org/10.1109/ICCCNT56998.2023.10307335

Informasi Jurnal

Tahun Terbit: 2023
Bahasa: en
Total Sitasi: 1×
Sumber Database: Semantic Scholar
DOI: 10.1109/ICCCNT56998.2023.10307335
Akses: Open Access ✓