DOAJ
Open Access
2025
Evaluating base and retrieval augmented LLMs with document or online support for evidence based neurology
Lars Masanneck
Sven G. Meuth
Marc Pawlitzki
Abstrak
Abstract Effectively managing evidence-based information is increasingly challenging. This study tested large language models (LLMs), including document- and online-enabled retrieval-augmented generation (RAG) systems, using 13 recent neurology guidelines across 130 questions. Results showed substantial variability. RAG improved accuracy compared to base models but still produced potentially harmful answers. RAG-based systems performed worse on case-based than knowledge-based questions. Further refinement and improved regulation is needed for safe clinical integration of RAG-enhanced LLMs.
Topik & Kata Kunci
Penulis (3)
L
Lars Masanneck
S
Sven G. Meuth
M
Marc Pawlitzki
Akses Cepat
Informasi Jurnal
- Tahun Terbit
- 2025
- Sumber Database
- DOAJ
- DOI
- 10.1038/s41746-025-01536-y
- Akses
- Open Access ✓