arXiv Open Access 2024

ChatGPT as a Solver and Grader of Programming Exams written in Spanish

Pablo Saborido-Fernández Marcos Fernández-Pichel David E. Losada
Lihat Sumber

Abstrak

Evaluating the capabilities of Large Language Models (LLMs) to assist teachers and students in educational tasks is receiving increasing attention. In this paper, we assess ChatGPT's capacities to solve and grade real programming exams, from an accredited BSc degree in Computer Science, written in Spanish. Our findings suggest that this AI model is only effective for solving simple coding tasks. Its proficiency in tackling complex problems or evaluating solutions authored by others are far from effective. As part of this research, we also release a new corpus of programming tasks and the corresponding prompts for solving the problems or grading the solutions. This resource can be further exploited by other research teams.

Topik & Kata Kunci

Penulis (3)

P

Pablo Saborido-Fernández

M

Marcos Fernández-Pichel

D

David E. Losada

Format Sitasi

Saborido-Fernández, P., Fernández-Pichel, M., Losada, D.E. (2024). ChatGPT as a Solver and Grader of Programming Exams written in Spanish. https://arxiv.org/abs/2409.15112

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓