DOAJ Open Access 2026

Large language model bias auditing for periodontal diagnosis using an ambiguity-probe methodology: a pilot study

Teerachate Nantakeeratipat

Abstrak

BackgroundLarge Language Models (LLMs) in healthcare holds immense promise yet carries the risk of perpetuating social biases. While artificial intelligence (AI) fairness is a growing concern, a gap exists in understanding how these models perform under conditions of clinical ambiguity, a common feature in real-world practice.MethodsWe conducted a study using an ambiguity-probe methodology with a set of 42 sociodemographic personas and 15 clinical vignettes based on the 2018 classification of periodontal diseases. Ten were clear-cut scenarios with established ground truths, while five were intentionally ambiguous. OpenAI's GPT-4o and Google's Gemini 2.5 Pro were prompted to provide periodontal stage and grade assessments using 630 vignette-persona combinations per model.ResultsIn clear-cut scenarios, GPT-4o demonstrated significantly higher combined (stage and grade) accuracy (70.5%) than Gemini Pro (33.3%). However, a robust fairness analysis using cumulative link models with false discovery rate correction revealed no statistically significant sociodemographic bias in either model. This finding held true across both clear-cut and ambiguous clinical scenarios.ConclusionTo our knowledge, this is among the first study to use simulated clinical ambiguity to reveal the distinct ethical fingerprints of LLMs in a dental context. While LLM performance gaps exist, our analysis decouples accuracy from fairness, demonstrating that both models maintain sociodemographic neutrality. We identify that the observed errors are not bias, but rather diagnostic boundary instability. This highlights a critical need for future research to differentiate between these two distinct types of model failure to build genuinely reliable AI.

Topik & Kata Kunci

Medicine Public aspects of medicine Electronic computers. Computer science

Penulis (1)

Teerachate Nantakeeratipat

Format Sitasi

APA MLA BibTeX

Nantakeeratipat, T. (2026). Large language model bias auditing for periodontal diagnosis using an ambiguity-probe methodology: a pilot study. https://doi.org/10.3389/fdgth.2025.1687820

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →

Lihat di Sumber doi.org/10.3389/fdgth.2025.1687820

Informasi Jurnal

Tahun Terbit: 2026
Sumber Database: DOAJ
DOI: 10.3389/fdgth.2025.1687820
Akses: Open Access ✓