DOAJ Open Access 2026

AI-Driven Code Documentation: Comparative Evaluation of LLMs for Commit Message Generation

Mohamed Mehdi Trigui Wasfi G. Al-Khatib Mohammad Amro Fatma Mallouli

Abstrak

Commit messages are essential for understanding software evolution and maintaining traceability of projects; however, their quality varies across repositories. Recent Large Language Models provide a promising path to automate this task by generating concise context-sensitive commit messages directly from code diffs. This paper provides a comparative study of three paradigms of large language models: zero-shot prompting, retrieval-augmented generation, and fine-tuning, using the large-scale CommitBench dataset that spans six programming languages. We assess the performance of the models with automatic metrics, namely BLEU, ROUGE-L, METEOR, and Adequacy, and a human assessment of 100 commits. In the latter, experienced developers rated each generated commit message for Adequacy and Fluency on a five-point Likert scale. The results show that fine-tuning and domain adaptation yield models that perform consistently better than general-purpose baselines across all evaluation metrics, thus generating commit messages with higher semantic adequacy and clearer phrasing than zero-shot approaches. The correlation analysis suggests that the Adequacy and BLEU scores are closer to human judgment, while ROUGE-L and METEOR tend to underestimate the quality in cases where the models generate stylistically diverse or paraphrased outputs. Finally, the study outlines a conceptual integration pathway for incorporating such models into software development workflows, emphasizing a human-in-the-loop approach for quality assurance.

Penulis (4)

M

Mohamed Mehdi Trigui

W

Wasfi G. Al-Khatib

M

Mohammad Amro

F

Fatma Mallouli

Format Sitasi

Trigui, M.M., Al-Khatib, W.G., Amro, M., Mallouli, F. (2026). AI-Driven Code Documentation: Comparative Evaluation of LLMs for Commit Message Generation. https://doi.org/10.3390/computers15020087

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.3390/computers15020087
Informasi Jurnal
Tahun Terbit
2026
Sumber Database
DOAJ
DOI
10.3390/computers15020087
Akses
Open Access ✓