arXiv Open Access 2026

Referring Industrial Anomaly Segmentation

Pengfei Yue Xiaokang Jiang Yilin Lu Jianghang Lin Shengchuan Zhang +1 lainnya
Lihat Sumber

Abstrak

Industrial Anomaly Detection (IAD) is vital for manufacturing, yet traditional methods face significant challenges: unsupervised approaches yield rough localizations requiring manual thresholds, while supervised methods overfit due to scarce, imbalanced data. Both suffer from the "One Anomaly Class, One Model" limitation. To address this, we propose Referring Industrial Anomaly Segmentation (RIAS), a paradigm leveraging language to guide detection. RIAS generates precise masks from text descriptions without manual thresholds and uses universal prompts to detect diverse anomalies with a single model. We introduce the MVTec-Ref dataset to support this, designed with diverse referring expressions and focusing on anomaly patterns, notably with 95% small anomalies. We also propose the Dual Query Token with Mask Group Transformer (DQFormer) benchmark, enhanced by Language-Gated Multi-Level Aggregation (LMA) to improve multi-scale segmentation. Unlike traditional methods using redundant queries, DQFormer employs only "Anomaly" and "Background" tokens for efficient visual-textual integration. Experiments demonstrate RIAS's effectiveness in advancing IAD toward open-set capabilities. Code: https://github.com/swagger-coder/RIAS-MVTec-Ref.

Topik & Kata Kunci

Penulis (6)

P

Pengfei Yue

X

Xiaokang Jiang

Y

Yilin Lu

J

Jianghang Lin

S

Shengchuan Zhang

L

Liujuan Cao

Format Sitasi

Yue, P., Jiang, X., Lu, Y., Lin, J., Zhang, S., Cao, L. (2026). Referring Industrial Anomaly Segmentation. https://arxiv.org/abs/2602.03673

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2026
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓