Semantic Scholar
Open Access
2021
464 sitasi
VT-ADL: A Vision Transformer Network for Image Anomaly Detection and Localization
P. Mishra
Riccardo Verk
Daniele Fornasier
C. Piciarelli
G. Foresti
Abstrak
We present a transformer-based image anomaly detection and localization network. Our proposed model is a combination of a reconstruction-based approach and patch embedding. The use of transformer networks helps preserving the spatial information of the embedded patches, which is later processed by a Gaussian mixture density network to localize the anomalous areas. In addition, we also publish BTAD, a real-world industrial anomaly dataset. Our results are compared with other state-of-the-art algorithms using publicly available datasets like MNIST and MVTec.
Topik & Kata Kunci
Penulis (5)
P
P. Mishra
R
Riccardo Verk
D
Daniele Fornasier
C
C. Piciarelli
G
G. Foresti
Akses Cepat
Informasi Jurnal
- Tahun Terbit
- 2021
- Bahasa
- en
- Total Sitasi
- 464×
- Sumber Database
- Semantic Scholar
- DOI
- 10.1109/ISIE45552.2021.9576231
- Akses
- Open Access ✓