arXiv Open Access 2026

Linguistic Signatures for Enhanced Emotion Detection

Florian Lecourt Madalina Croitoru Konstantin Todorov
Lihat Sumber

Abstrak

Emotion detection is a central problem in NLP, with recent progress driven by transformer-based models trained on established datasets. However, little is known about the linguistic regularities that characterize how emotions are expressed across different corpora and labels. This study examines whether linguistic features can serve as reliable interpretable signals for emotion recognition in text. We extract emotion-specific linguistic signatures from 13 English datasets and evaluate how incorporating these features into transformer models impacts performance. Our RoBERTa-based models enriched with high level linguistic features achieve consistent performance gains of up to +2.4 macro F1 on the GoEmotions benchmark, showing that explicit lexical cues can complement neural representations and improve robustness in predicting emotion categories.

Topik & Kata Kunci

Penulis (3)

F

Florian Lecourt

M

Madalina Croitoru

K

Konstantin Todorov

Format Sitasi

Lecourt, F., Croitoru, M., Todorov, K. (2026). Linguistic Signatures for Enhanced Emotion Detection. https://arxiv.org/abs/2603.20222

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓