arXiv Open Access 2026

Fine-tuned Vision Language Model for Localization of Parasitic Eggs in Microscopic Images

Chan Hao Sien Hezerul Abdul Karim Nouar AlDahoul
Lihat Sumber

Abstrak

Soil-transmitted helminth (STH) infections continuously affect a large proportion of the global population, particularly in tropical and sub-tropical regions, where access to specialized diagnostic expertise is limited. Although manual microscopic diagnosis of parasitic eggs remains the diagnostic gold standard, the approach can be labour-intensive, time-consuming, and prone to human error. This paper aims to utilize a vision language model (VLM) such as Microsoft Florence that was fine-tuned to localize all parasitic eggs within microscopic images. The preliminary results show that our localization VLM performs comparatively better than the other object detection methods, such as EfficientDet, with an mIOU of 0.94. This finding demonstrates the potential of the proposed VLM to serve as a core component of an automated framework, offering a scalable engineering solution for intelligent parasitological diagnosis.

Topik & Kata Kunci

Penulis (3)

C

Chan Hao Sien

H

Hezerul Abdul Karim

N

Nouar AlDahoul

Format Sitasi

Sien, C.H., Karim, H.A., AlDahoul, N. (2026). Fine-tuned Vision Language Model for Localization of Parasitic Eggs in Microscopic Images. https://arxiv.org/abs/2602.13712

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓