arXiv Open Access 2025

A Synthetic Data-Driven Radiology Foundation Model for Pan-tumor Clinical Diagnosis

Wenhui Lei Hanyu Chen Zitian Zhang Luyang Luo Qiong Xiao +11 lainnya
Lihat Sumber

Abstrak

AI-assisted imaging made substantial advances in tumor diagnosis and management. However, a major barrier to developing robust oncology foundation models is the scarcity of large-scale, high-quality annotated datasets, which are limited by privacy restrictions and the high cost of manual labeling. To address this gap, we present PASTA, a pan-tumor radiology foundation model built on PASTA-Gen, a synthetic data framework that generated 30,000 3D CT scans with pixel-level lesion masks and structured reports of tumors across ten organ systems. Leveraging this resource, PASTA achieves state-of-the-art performance on 45 of 46 oncology tasks, including non-contrast CT tumor screening, lesion segmentation, structured reporting, tumor staging, survival prediction, and MRI-modality transfer. To assess clinical applicability, we developed PASTA-AID, a clinical decision support system, and ran a retrospective simulated clinical trial across two scenarios. For pan-tumor screening on plain CT with fixed reading time, PASTA-AID increased radiologists' throughput by 11.1-25.1% and improved sensitivity by 17.0-31.4% and precision by 10.5-24.9%; additionally, in a diagnosis-aid workflow, it reduced segmentation time by up to 78.2% and reporting time by up to 36.5%. Beyond gains in accuracy and efficiency, PASTA-AID narrowed the expertise gap, enabling less-experienced radiologists to approach expert-level performance. Together, this work establishes an end-to-end, synthetic data-driven pipeline spanning data generation, model development, and clinical validation, thereby demonstrating substantial potential for pan-tumor research and clinical translation.

Topik & Kata Kunci

Penulis (16)

W

Wenhui Lei

H

Hanyu Chen

Z

Zitian Zhang

L

Luyang Luo

Q

Qiong Xiao

Y

Yannian Gu

P

Peng Gao

Y

Yankai Jiang

C

Ci Wang

G

Guangtao Wu

T

Tongjia Xu

Y

Yingjie Zhang

P

Pranav Rajpurkar

X

Xiaofan Zhang

S

Shaoting Zhang

Z

Zhenning Wang

Format Sitasi

Lei, W., Chen, H., Zhang, Z., Luo, L., Xiao, Q., Gu, Y. et al. (2025). A Synthetic Data-Driven Radiology Foundation Model for Pan-tumor Clinical Diagnosis. https://arxiv.org/abs/2502.06171

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓