DOAJ Open Access 2025

Student-Generated User Story Quality: A Study on Practitioner and ChatGPT Evaluation

Muhammad Ihsan Zul Suhaila Mohd. Yasin Dadang Syarif Sihabudin Sahid

Abstrak

Evaluating the quality of student-generated user stories is important in software engineering education, but only a limited number of industry practitioners can assist. The integration of generative AI can facilitate this process. To do so, the INVEST quality evaluation framework is widely recognized for assessing user story quality; however, prior research has not explored its use in conjunction with generative AI. This study investigated ChatGPT's ability to evaluate user stories using the INVEST framework. This study compares two ChatGPT-based evaluation approaches with those of experienced practitioners, focusing on student-generated user stories. Discrepancies between ChatGPT and practitioner evaluations were measured using Mean Absolute Deviation (MAD), Mean Squared Error (MSE), and Root Mean Squared Error (RMSE). Statistical significance was tested using the Mann-Whitney U Test. The results indicate that ChatGPT’s 1st approach yielded lower discrepancies than practitioner evaluations. Moreover, significance testing showed no statistically significant differences between the ChatGPT and practitioner results for the two INVEST criteria- Independent and Estimable. These findings suggest that the 1st approach can assist in the evaluation process, although practitioners must ensure comprehensive and accurate evaluations. ChatGPT can provide preliminary evaluations in educational contexts, enabling students to receive formative feedback and allowing educators to streamline evaluation processes. Although practitioner validation is still required, their role may shift toward verifying AI-generated results, thus reducing the overall workload and accelerating quality evaluation

Penulis (3)

M

Muhammad Ihsan Zul

S

Suhaila Mohd. Yasin

D

Dadang Syarif Sihabudin Sahid

Format Sitasi

Zul, M.I., Yasin, S.M., Sahid, D.S.S. (2025). Student-Generated User Story Quality: A Study on Practitioner and ChatGPT Evaluation. https://doi.org/10.29207/resti.v9i5.6950

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.29207/resti.v9i5.6950
Informasi Jurnal
Tahun Terbit
2025
Sumber Database
DOAJ
DOI
10.29207/resti.v9i5.6950
Akses
Open Access ✓