arXiv Open Access 2025

Medicine on the Edge: Comparative Performance Analysis of On-Device LLMs for Clinical Reasoning

Leon Nissen Philipp Zagar Vishnu Ravi Aydin Zahedivash Lara Marie Reimer +3 lainnya
Lihat Sumber

Abstrak

The deployment of Large Language Models (LLM) on mobile devices offers significant potential for medical applications, enhancing privacy, security, and cost-efficiency by eliminating reliance on cloud-based services and keeping sensitive health data local. However, the performance and accuracy of on-device LLMs in real-world medical contexts remain underexplored. In this study, we benchmark publicly available on-device LLMs using the AMEGA dataset, evaluating accuracy, computational efficiency, and thermal limitation across various mobile devices. Our results indicate that compact general-purpose models like Phi-3 Mini achieve a strong balance between speed and accuracy, while medically fine-tuned models such as Med42 and Aloe attain the highest accuracy. Notably, deploying LLMs on older devices remains feasible, with memory constraints posing a greater challenge than raw processing power. Our study underscores the potential of on-device LLMs for healthcare while emphasizing the need for more efficient inference and models tailored to real-world clinical reasoning.

Topik & Kata Kunci

Penulis (8)

L

Leon Nissen

P

Philipp Zagar

V

Vishnu Ravi

A

Aydin Zahedivash

L

Lara Marie Reimer

S

Stephan Jonas

O

Oliver Aalami

P

Paul Schmiedmayer

Format Sitasi

Nissen, L., Zagar, P., Ravi, V., Zahedivash, A., Reimer, L.M., Jonas, S. et al. (2025). Medicine on the Edge: Comparative Performance Analysis of On-Device LLMs for Clinical Reasoning. https://arxiv.org/abs/2502.08954

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓