Semantic Scholar Open Access 2024 381 sitasi

A multimodal generative AI copilot for human pathology

Ming Y. Lu Bowen Chen Drew F. K. Williamson Richard J. Chen Melissa Zhao +15 lainnya

Abstrak

Computational pathology1,2 has witnessed considerable progress in the development of both task-specific predictive models and task-agnostic self-supervised vision encoders3,4. However, despite the explosive growth of generative artificial intelligence (AI), there have been few studies on building general-purpose multimodal AI assistants and copilots5 tailored to pathology. Here we present PathChat, a vision-language generalist AI assistant for human pathology. We built PathChat by adapting a foundational vision encoder for pathology, combining it with a pretrained large language model and fine-tuning the whole system on over 456,000 diverse visual-language instructions consisting of 999,202 question and answer turns. We compare PathChat with several multimodal vision-language AI assistants and GPT-4V, which powers the commercially available multimodal general-purpose AI assistant ChatGPT-4 (ref. 6). PathChat achieved state-of-the-art performance on multiple-choice diagnostic questions from cases with diverse tissue origins and disease models. Furthermore, using open-ended questions and human expert evaluation, we found that overall PathChat produced more accurate and pathologist-preferable responses to diverse queries related to pathology. As an interactive vision-language AI copilot that can flexibly handle both visual and natural language inputs, PathChat may potentially find impactful applications in pathology education, research and human-in-the-loop clinical decision-making. PathChat, a multimodal generative AI copilot for human pathology, has been trained on a large dataset of visual-language instructions to interactively assist users with diverse pathology tasks.

Topik & Kata Kunci

Penulis (20)

M

Ming Y. Lu

B

Bowen Chen

D

Drew F. K. Williamson

R

Richard J. Chen

M

Melissa Zhao

A

Aaron K Chow

K

Kenji Ikemura

A

Ahrong Kim

D

Dimitra Pouli

A

Ankush U Patel

A

Amr Soliman

C

Chengkuan Chen

T

Tong Ding

J

Judy J. Wang

G

Georg K. Gerber

I

Ivy Liang

L

L. Le

A

Anil V. Parwani

L

Luca L. Weishaupt

F

Faisal Mahmood

Format Sitasi

Lu, M.Y., Chen, B., Williamson, D.F.K., Chen, R.J., Zhao, M., Chow, A.K. et al. (2024). A multimodal generative AI copilot for human pathology. https://doi.org/10.1038/s41586-024-07618-3

Akses Cepat

Lihat di Sumber doi.org/10.1038/s41586-024-07618-3
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Total Sitasi
381×
Sumber Database
Semantic Scholar
DOI
10.1038/s41586-024-07618-3
Akses
Open Access ✓