arXiv Open Access 2025

On the robustness of ChatGPT in teaching Korean Mathematics

Phuong-Nam Nguyen Quang Nguyen-The An Vu-Minh Diep-Anh Nguyen Xuan-Lam Pham
Lihat Sumber

Abstrak

ChatGPT, an Artificial Intelligence model, has the potential to revolutionize education. However, its effectiveness in solving non-English questions remains uncertain. This study evaluates ChatGPT's robustness using 586 Korean mathematics questions. ChatGPT achieves 66.72% accuracy, correctly answering 391 out of 586 questions. We also assess its ability to rate mathematics questions based on eleven criteria and perform a topic analysis. Our findings show that ChatGPT's ratings align with educational theory and test-taker perspectives. While ChatGPT performs well in question classification, it struggles with non-English contexts, highlighting areas for improvement. Future research should address linguistic biases and enhance accuracy across diverse languages. Domain-specific optimizations and multilingual training could improve ChatGPT's role in personalized education.

Topik & Kata Kunci

Penulis (5)

P

Phuong-Nam Nguyen

Q

Quang Nguyen-The

A

An Vu-Minh

D

Diep-Anh Nguyen

X

Xuan-Lam Pham

Format Sitasi

Nguyen, P., Nguyen-The, Q., Vu-Minh, A., Nguyen, D., Pham, X. (2025). On the robustness of ChatGPT in teaching Korean Mathematics. https://arxiv.org/abs/2502.11915

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓