arXiv Open Access 2022

Automatic Identification of Motivation for Code-Switching in Speech Transcripts

Ritu Belani Jeffrey Flanigan
Lihat Sumber

Abstrak

Code-switching, or switching between languages, occurs for many reasons and has important linguistic, sociological, and cultural implications. Multilingual speakers code-switch for a variety of purposes, such as expressing emotions, borrowing terms, making jokes, introducing a new topic, etc. The reason for code-switching may be quite useful for analysis, but is not readily apparent. To remedy this situation, we annotate a new dataset of motivations for code-switching in Spanish-English. We build the first system (to our knowledge) to automatically identify a wide range of motivations that speakers code-switch in everyday speech, achieving an accuracy of 75% across all motivations. Additionally, we show that the system can be adapted to new language pairs, achieving 66% accuracy on a new language pair (Hindi-English), demonstrating the cross-lingual applicability of our annotation scheme

Topik & Kata Kunci

Penulis (2)

R

Ritu Belani

J

Jeffrey Flanigan

Format Sitasi

Belani, R., Flanigan, J. (2022). Automatic Identification of Motivation for Code-Switching in Speech Transcripts. https://arxiv.org/abs/2212.08565

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2022
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓