arXiv Open Access 2025

Asking the Right Questions: Benchmarking Large Language Models in the Development of Clinical Consultation Templates

Liam G. McCoy Fateme Nateghi Haredasht Kanav Chopra David Wu David JH Wu +13 lainnya
Lihat Sumber

Abstrak

This study evaluates the capacity of large language models (LLMs) to generate structured clinical consultation templates for electronic consultation. Using 145 expert-crafted templates developed and routinely used by Stanford's eConsult team, we assess frontier models -- including o3, GPT-4o, Kimi K2, Claude 4 Sonnet, Llama 3 70B, and Gemini 2.5 Pro -- for their ability to produce clinically coherent, concise, and prioritized clinical question schemas. Through a multi-agent pipeline combining prompt optimization, semantic autograding, and prioritization analysis, we show that while models like o3 achieve high comprehensiveness (up to 92.2\%), they consistently generate excessively long templates and fail to correctly prioritize the most clinically important questions under length constraints. Performance varies across specialties, with significant degradation in narrative-driven fields such as psychiatry and pain medicine. Our findings demonstrate that LLMs can enhance structured clinical information exchange between physicians, while highlighting the need for more robust evaluation methods that capture a model's ability to prioritize clinically salient information within the time constraints of real-world physician communication.

Topik & Kata Kunci

Penulis (18)

L

Liam G. McCoy

F

Fateme Nateghi Haredasht

K

Kanav Chopra

D

David Wu

D

David JH Wu

A

Abass Conteh

S

Sarita Khemani

S

Saloni Kumar Maharaj

V

Vishnu Ravi

A

Arth Pahwa

Y

Yingjie Weng

L

Leah Rosengaus

L

Lena Giang

K

Kelvin Zhenghao Li

O

Olivia Jee

D

Daniel Shirvani

E

Ethan Goh

J

Jonathan H. Chen

Format Sitasi

McCoy, L.G., Haredasht, F.N., Chopra, K., Wu, D., Wu, D.J., Conteh, A. et al. (2025). Asking the Right Questions: Benchmarking Large Language Models in the Development of Clinical Consultation Templates. https://arxiv.org/abs/2508.01159

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓