Semantic Scholar Open Access 2020 8 sitasi

Applications of Natural Language Processing in Bilingual Language Teaching: An Indonesian-English Case Study

Zara Maxwell-Smith Simón González Ochoa Ben Foley H. Suominen

Abstrak

Multilingual corpora are difficult to compile and a classroom setting adds pedagogy to the mix of factors which make this data so rich and problematic to classify. In this paper, we set out methodological considerations of using automated speech recognition to build a corpus of teacher speech in an Indonesian language classroom. Our preliminary results (64% word error rate) suggest these tools have the potential to speed data collection in this context. We provide practical examples of our data structure, details of our piloted computer-assisted processes, and fine-grained error analysis. Our study is informed and directed by genuine research questions and discussion in both the education and computational linguistics fields. We highlight some of the benefits and risks of using these emerging technologies to analyze the complex work of language teachers and in education more generally.

Topik & Kata Kunci

Penulis (4)

Z

Zara Maxwell-Smith

S

Simón González Ochoa

B

Ben Foley

H

H. Suominen

Format Sitasi

Maxwell-Smith, Z., Ochoa, S.G., Foley, B., Suominen, H. (2020). Applications of Natural Language Processing in Bilingual Language Teaching: An Indonesian-English Case Study. https://doi.org/10.18653/v1/2020.bea-1.12

Akses Cepat

Lihat di Sumber doi.org/10.18653/v1/2020.bea-1.12
Informasi Jurnal
Tahun Terbit
2020
Bahasa
en
Total Sitasi
Sumber Database
Semantic Scholar
DOI
10.18653/v1/2020.bea-1.12
Akses
Open Access ✓