arXiv Open Access 2025

MIRAGE: Multimodal foundation model and benchmark for comprehensive retinal OCT image analysis

José Morano Botond Fazekas Emese Sükei Ronald Fecso Taha Emre +5 lainnya
Lihat Sumber

Abstrak

Artificial intelligence (AI) has become a fundamental tool for assisting clinicians in analyzing ophthalmic images, such as optical coherence tomography (OCT). However, developing AI models often requires extensive annotation, and existing models tend to underperform on independent, unseen data. Foundation models (FMs), large AI models trained on vast unlabeled datasets, have shown promise in overcoming these challenges. Nonetheless, available FMs for ophthalmology lack extensive validation, especially for segmentation tasks, and focus on a single imaging modality. In this context, we propose MIRAGE, a novel multimodal FM for the analysis of OCT and scanning laser ophthalmoscopy (SLO) images. Additionally, we propose a new evaluation benchmark with OCT/SLO classification and segmentation tasks. The comparison with general and specialized FMs and segmentation methods shows the superiority of MIRAGE in both types of tasks, highlighting its suitability as a basis for the development of robust AI systems for retinal OCT image analysis. Both MIRAGE and the evaluation benchmark are publicly available: https://github.com/j-morano/MIRAGE.

Topik & Kata Kunci

Penulis (10)

J

José Morano

B

Botond Fazekas

E

Emese Sükei

R

Ronald Fecso

T

Taha Emre

M

Markus Gumpinger

G

Georg Faustmann

M

Marzieh Oghbaie

U

Ursula Schmidt-Erfurth

H

Hrvoje Bogunović

Format Sitasi

Morano, J., Fazekas, B., Sükei, E., Fecso, R., Emre, T., Gumpinger, M. et al. (2025). MIRAGE: Multimodal foundation model and benchmark for comprehensive retinal OCT image analysis. https://arxiv.org/abs/2506.08900

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓