arXiv Open Access 2024

A Multimodal Vision Foundation Model for Clinical Dermatology

Siyuan Yan Zhen Yu Clare Primiero Cristina Vico-Alonso Zhonghua Wang +20 lainnya

Lihat Sumber

Abstrak

Diagnosing and treating skin diseases require advanced visual skills across domains and the ability to synthesize information from multiple imaging modalities. While current deep learning models excel at specific tasks like skin cancer diagnosis from dermoscopic images, they struggle to meet the complex, multimodal requirements of clinical practice. Here, we introduce PanDerm, a multimodal dermatology foundation model pretrained through self-supervised learning on over 2 million real-world skin disease images from 11 clinical institutions across 4 imaging modalities. We evaluated PanDerm on 28 diverse benchmarks, including skin cancer screening, risk stratification, differential diagnosis of common and rare skin conditions, lesion segmentation, longitudinal monitoring, and metastasis prediction and prognosis. PanDerm achieved state-of-the-art performance across all evaluated tasks, often outperforming existing models when using only 10% of labeled data. We conducted three reader studies to assess PanDerm's potential clinical utility. PanDerm outperformed clinicians by 10.2% in early-stage melanoma detection through longitudinal analysis, improved clinicians' skin cancer diagnostic accuracy by 11% on dermoscopy images, and enhanced non-dermatologist healthcare providers' differential diagnosis by 16.5% across 128 skin conditions on clinical photographs. These results demonstrate PanDerm's potential to improve patient care across diverse clinical scenarios and serve as a model for developing multimodal foundation models in other medical specialties, potentially accelerating the integration of AI support in healthcare. The code can be found at https://github.com/SiyuanYan1/PanDerm.

Topik & Kata Kunci

cs.CV cs.AI

Penulis (25)

Siyuan Yan

Zhen Yu

Clare Primiero

Cristina Vico-Alonso

Zhonghua Wang

Litao Yang

Philipp Tschandl

Ming Hu

Lie Ju

Gin Tan

Vincent Tang

Aik Beng Ng

David Powell

Paul Bonnington

Simon See

Elisabetta Magnaterra

Peter Ferguson

Jennifer Nguyen

Pascale Guitera

Jose Banuls

Monika Janda

Victoria Mar

Harald Kittler

H. Peter Soyer

Zongyuan Ge

Format Sitasi

APA MLA BibTeX

Yan, S., Yu, Z., Primiero, C., Vico-Alonso, C., Wang, Z., Yang, L. et al. (2024). A Multimodal Vision Foundation Model for Clinical Dermatology. https://arxiv.org/abs/2410.15038

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓