arXiv Open Access 2025

Medicine on the Edge: Comparative Performance Analysis of On-Device LLMs for Clinical Reasoning

Leon Nissen Philipp Zagar Vishnu Ravi Aydin Zahedivash Lara Marie Reimer +3 lainnya

Lihat Sumber

Abstrak

The deployment of Large Language Models (LLM) on mobile devices offers significant potential for medical applications, enhancing privacy, security, and cost-efficiency by eliminating reliance on cloud-based services and keeping sensitive health data local. However, the performance and accuracy of on-device LLMs in real-world medical contexts remain underexplored. In this study, we benchmark publicly available on-device LLMs using the AMEGA dataset, evaluating accuracy, computational efficiency, and thermal limitation across various mobile devices. Our results indicate that compact general-purpose models like Phi-3 Mini achieve a strong balance between speed and accuracy, while medically fine-tuned models such as Med42 and Aloe attain the highest accuracy. Notably, deploying LLMs on older devices remains feasible, with memory constraints posing a greater challenge than raw processing power. Our study underscores the potential of on-device LLMs for healthcare while emphasizing the need for more efficient inference and models tailored to real-world clinical reasoning.

Topik & Kata Kunci

cs.CL

Penulis (8)

Leon Nissen

Philipp Zagar

Vishnu Ravi

Aydin Zahedivash

Lara Marie Reimer

Stephan Jonas

Oliver Aalami

Paul Schmiedmayer

Format Sitasi

APA MLA BibTeX

Nissen, L., Zagar, P., Ravi, V., Zahedivash, A., Reimer, L.M., Jonas, S. et al. (2025). Medicine on the Edge: Comparative Performance Analysis of On-Device LLMs for Clinical Reasoning. https://arxiv.org/abs/2502.08954

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓