arXiv Open Access 2023

Fine-Grained Product Classification on Leaflet Advertisements

Daniel Ladwig Bianca Lamm Janis Keuper
Lihat Sumber

Abstrak

In this paper, we describe a first publicly available fine-grained product recognition dataset based on leaflet images. Using advertisement leaflets, collected over several years from different European retailers, we provide a total of 41.6k manually annotated product images in 832 classes. Further, we investigate three different approaches for this fine-grained product classification task, Classification by Image, by Text, as well as by Image and Text. The approach "Classification by Text" uses the text extracted directly from the leaflet product images. We show, that the combination of image and text as input improves the classification of visual difficult to distinguish products. The final model leads to an accuracy of 96.4% with a Top-3 score of 99.2%. We release our code at https://github.com/ladwigd/Leaflet-Product-Classification.

Topik & Kata Kunci

Penulis (3)

D

Daniel Ladwig

B

Bianca Lamm

J

Janis Keuper

Format Sitasi

Ladwig, D., Lamm, B., Keuper, J. (2023). Fine-Grained Product Classification on Leaflet Advertisements. https://arxiv.org/abs/2305.03706

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓