arXiv Open Access 2025

On the robustness of ChatGPT in teaching Korean Mathematics

Phuong-Nam Nguyen Quang Nguyen-The An Vu-Minh Diep-Anh Nguyen Xuan-Lam Pham

Lihat Sumber

Abstrak

ChatGPT, an Artificial Intelligence model, has the potential to revolutionize education. However, its effectiveness in solving non-English questions remains uncertain. This study evaluates ChatGPT's robustness using 586 Korean mathematics questions. ChatGPT achieves 66.72% accuracy, correctly answering 391 out of 586 questions. We also assess its ability to rate mathematics questions based on eleven criteria and perform a topic analysis. Our findings show that ChatGPT's ratings align with educational theory and test-taker perspectives. While ChatGPT performs well in question classification, it struggles with non-English contexts, highlighting areas for improvement. Future research should address linguistic biases and enhance accuracy across diverse languages. Domain-specific optimizations and multilingual training could improve ChatGPT's role in personalized education.

Topik & Kata Kunci

cs.AI math.HO

Penulis (5)

Phuong-Nam Nguyen

Quang Nguyen-The

An Vu-Minh

Diep-Anh Nguyen

Xuan-Lam Pham

Format Sitasi

APA MLA BibTeX

Nguyen, P., Nguyen-The, Q., Vu-Minh, A., Nguyen, D., Pham, X. (2025). On the robustness of ChatGPT in teaching Korean Mathematics. https://arxiv.org/abs/2502.11915

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓