arXiv Open Access 2025

Vision Foundation Models for Computed Tomography

Suraj Pai Ibrahim Hadzic Dennis Bontempi Keno Bressem Benjamin H. Kann +3 lainnya
Lihat Sumber

Abstrak

Foundation models (FMs) have shown transformative potential in radiology by performing diverse, complex tasks across imaging modalities. Here, we developed CT-FM, a large-scale 3D image-based pre-trained model designed explicitly for various radiological tasks. CT-FM was pre-trained using 148,000 computed tomography (CT) scans from the Imaging Data Commons through label-agnostic contrastive learning. We evaluated CT-FM across four categories of tasks, namely, whole-body and tumor segmentation, head CT triage, medical image retrieval, and semantic understanding, showing superior performance against state-of-the-art models. Beyond quantitative success, CT-FM demonstrated the ability to cluster regions anatomically and identify similar anatomical and structural concepts across scans. Furthermore, it remained robust across test-retest settings and indicated reasonable salient regions attached to its embeddings. This study demonstrates the value of large-scale medical imaging foundation models and by open-sourcing the model weights, code, and data, aims to support more adaptable, reliable, and interpretable AI solutions in radiology.

Topik & Kata Kunci

Penulis (8)

S

Suraj Pai

I

Ibrahim Hadzic

D

Dennis Bontempi

K

Keno Bressem

B

Benjamin H. Kann

A

Andriy Fedorov

R

Raymond H. Mak

H

Hugo J. W. L. Aerts

Format Sitasi

Pai, S., Hadzic, I., Bontempi, D., Bressem, K., Kann, B.H., Fedorov, A. et al. (2025). Vision Foundation Models for Computed Tomography. https://arxiv.org/abs/2501.09001

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓