Evaluation of Artificial Intelligence Answers for Short Stature in Paediatric Endocrinology by Paediatric Endocrinologists
Abstrak
Objective: Artificial intelligence (AI) is increasingly used in medicine, including pediatric endocrinology. AI models have the potential to support clinical decision-making, patient education, and guidance. However, their accuracy, reliability, and effectiveness in providing medical information and recommendations remain unclear. The aim was to evaluate and compare the performance of four AI models, ChatGPT, Bard, Microsoft Copilot, and Pi, in answering frequently asked questions related to pediatric endocrinology. Methods: Nine questions commonly asked by parents regarding short stature in pediatric endocrinology were selected, based on literature reviews and expert opinions. These questions were posed to four AI models in both Turkish and English. The AI-generated responses were evaluated by 10 pediatric endocrinologists using a 12-item Likert-scale questionnaire assessing medical accuracy, completeness, guidance, and informativeness. Statistical analyses, including Kruskal-Wallis and post-hoc tests, were conducted to determine significant differences between AI models. Results: Bard outperformed other models in guidance and recommendation categories, excelling in directing users to medical consultation. Microsoft Copilot demonstrated strong medical accuracy but lacked guidance capacity. ChatGPT showed consistent performance in knowledge dissemination, making it effective for patient education. Pi scored the lowest in guidance and recommendations, indicating limited applicability in clinical settings. Significant differences were observed between AI models (p<0.05), particularly in completeness and guidance-related categories. Conclusion: The present study highlights the varying strengths and weaknesses of AI models in an area of pediatric endocrinology. While Bard was effective in guidance, Microsoft Copilot excelled at accuracy, and ChatGPT was informative. Future AI improvements should focus on balancing accuracy and guidance to enhance clinical decision-support and patient education. Tailored AI applications may optimize the role of AI in specialized medical fields.
Topik & Kata Kunci
Penulis (10)
Kamber Kaşali
Özgür Fırat Özpolat
Merve Ülkü
Ayşe Sena Dönmez
Serap Kılıç Kaya
Esra Dişçi
Serkan Bilge Koca
Ufuk Özkaya
Hüseyin Demirbilek
Atilla Çayır
Akses Cepat
PDF tidak tersedia langsung
Cek di sumber asli →- Tahun Terbit
- 2026
- Sumber Database
- DOAJ
- DOI
- 10.4274/jcrpe.galenos.2025.2025-6-14
- Akses
- Open Access ✓