arXiv Open Access 2024

A Multimodal Vision Foundation Model for Clinical Dermatology

Siyuan Yan Zhen Yu Clare Primiero Cristina Vico-Alonso Zhonghua Wang +20 lainnya
Lihat Sumber

Abstrak

Diagnosing and treating skin diseases require advanced visual skills across domains and the ability to synthesize information from multiple imaging modalities. While current deep learning models excel at specific tasks like skin cancer diagnosis from dermoscopic images, they struggle to meet the complex, multimodal requirements of clinical practice. Here, we introduce PanDerm, a multimodal dermatology foundation model pretrained through self-supervised learning on over 2 million real-world skin disease images from 11 clinical institutions across 4 imaging modalities. We evaluated PanDerm on 28 diverse benchmarks, including skin cancer screening, risk stratification, differential diagnosis of common and rare skin conditions, lesion segmentation, longitudinal monitoring, and metastasis prediction and prognosis. PanDerm achieved state-of-the-art performance across all evaluated tasks, often outperforming existing models when using only 10% of labeled data. We conducted three reader studies to assess PanDerm's potential clinical utility. PanDerm outperformed clinicians by 10.2% in early-stage melanoma detection through longitudinal analysis, improved clinicians' skin cancer diagnostic accuracy by 11% on dermoscopy images, and enhanced non-dermatologist healthcare providers' differential diagnosis by 16.5% across 128 skin conditions on clinical photographs. These results demonstrate PanDerm's potential to improve patient care across diverse clinical scenarios and serve as a model for developing multimodal foundation models in other medical specialties, potentially accelerating the integration of AI support in healthcare. The code can be found at https://github.com/SiyuanYan1/PanDerm.

Topik & Kata Kunci

Penulis (25)

S

Siyuan Yan

Z

Zhen Yu

C

Clare Primiero

C

Cristina Vico-Alonso

Z

Zhonghua Wang

L

Litao Yang

P

Philipp Tschandl

M

Ming Hu

L

Lie Ju

G

Gin Tan

V

Vincent Tang

A

Aik Beng Ng

D

David Powell

P

Paul Bonnington

S

Simon See

E

Elisabetta Magnaterra

P

Peter Ferguson

J

Jennifer Nguyen

P

Pascale Guitera

J

Jose Banuls

M

Monika Janda

V

Victoria Mar

H

Harald Kittler

H

H. Peter Soyer

Z

Zongyuan Ge

Format Sitasi

Yan, S., Yu, Z., Primiero, C., Vico-Alonso, C., Wang, Z., Yang, L. et al. (2024). A Multimodal Vision Foundation Model for Clinical Dermatology. https://arxiv.org/abs/2410.15038

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓