arXiv Open Access 2025

Pathology Report Generation and Multimodal Representation Learning for Cutaneous Melanocytic Lesions

Ruben T. Lucassen Sander P. J. Moonemans Tijn van de Luijtgaarden Gerben E. Breimer Willeke A. M. Blokx +1 lainnya
Lihat Sumber

Abstrak

Millions of melanocytic skin lesions are examined by pathologists each year, the majority of which concern common nevi (i.e., ordinary moles). While most of these lesions can be diagnosed in seconds, writing the corresponding pathology report is much more time-consuming. Automating part of the report writing could, therefore, alleviate the increasing workload of pathologists. In this work, we develop a vision-language model specifically for the pathology domain of cutaneous melanocytic lesions. The model follows the Contrastive Captioner framework and was trained and evaluated using a melanocytic lesion dataset of 42,512 H&E-stained whole slide images and 19,645 corresponding pathology reports. Our results show that the quality scores of model-generated reports were on par with pathologist-written reports for common nevi, assessed by an expert pathologist in a reader study. While report generation revealed to be more difficult for rare melanocytic lesion subtypes, the cross-modal retrieval performance for these cases was considerably better.

Topik & Kata Kunci

Penulis (6)

R

Ruben T. Lucassen

S

Sander P. J. Moonemans

T

Tijn van de Luijtgaarden

G

Gerben E. Breimer

W

Willeke A. M. Blokx

M

Mitko Veta

Format Sitasi

Lucassen, R.T., Moonemans, S.P.J., Luijtgaarden, T.v.d., Breimer, G.E., Blokx, W.A.M., Veta, M. (2025). Pathology Report Generation and Multimodal Representation Learning for Cutaneous Melanocytic Lesions. https://arxiv.org/abs/2502.19293

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓