arXiv Open Access 2026

Speech Emotion Recognition with ASR Integration

Yuanchao Li
Lihat Sumber

Abstrak

Speech Emotion Recognition (SER) plays a pivotal role in understanding human communication, enabling emotionally intelligent systems, and serving as a fundamental component in the development of Artificial General Intelligence (AGI). However, deploying SER in real-world, spontaneous, and low-resource scenarios remains a significant challenge due to the complexity of emotional expression and the limitations of current speech and language technologies. This thesis investigates the integration of Automatic Speech Recognition (ASR) into SER, with the goal of enhancing the robustness, scalability, and practical applicability of emotion recognition from spoken language.

Topik & Kata Kunci

Penulis (1)

Y

Yuanchao Li

Format Sitasi

Li, Y. (2026). Speech Emotion Recognition with ASR Integration. https://arxiv.org/abs/2601.17901

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓