arXiv Open Access 2025

LogicQA: Logical Anomaly Detection with Vision Language Model Generated Questions

Yejin Kwon Daeun Moon Youngje Oh Hyunsoo Yoon
Lihat Sumber

Abstrak

Anomaly Detection (AD) focuses on detecting samples that differ from the standard pattern, making it a vital tool in process control. Logical anomalies may appear visually normal yet violate predefined constraints on object presence, arrangement, or quantity, depending on reasoning and explainability. We introduce LogicQA, a framework that enhances AD by providing industrial operators with explanations for logical anomalies. LogicQA compiles automatically generated questions into a checklist and collects responses to identify violations of logical constraints. LogicQA is training-free, annotation-free, and operates in a few-shot setting. We achieve state-of-the-art (SOTA) Logical AD performance on public benchmarks, MVTec LOCO AD, with an AUROC of 87.6 percent and an F1-max of 87.0 percent along with the explanations of anomalies. Also, our approach has shown outstanding performance on semiconductor SEM corporate data, further validating its effectiveness in industrial applications.

Topik & Kata Kunci

Penulis (4)

Y

Yejin Kwon

D

Daeun Moon

Y

Youngje Oh

H

Hyunsoo Yoon

Format Sitasi

Kwon, Y., Moon, D., Oh, Y., Yoon, H. (2025). LogicQA: Logical Anomaly Detection with Vision Language Model Generated Questions. https://arxiv.org/abs/2503.20252

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓