DOAJ Open Access 2022

Cor<i>Deep</i> and the Sacrobosco Dataset: Detection of Visual Elements in Historical Documents

Jochen Büttner Julius Martinetz Hassan El-Hajj Matteo Valleriani

Abstrak

Recent advances in object detection facilitated by deep learning have led to numerous solutions in a myriad of fields ranging from medical diagnosis to autonomous driving. However, historical research is yet to reap the benefits of such advances. This is generally due to the low number of large, coherent, and annotated datasets of historical documents, as well as the overwhelming focus on Optical Character Recognition to support the analysis of historical documents. In this paper, we highlight the importance of visual elements, in particular illustrations in historical documents, and offer a public multi-class historical visual element dataset based on the <i>Sphaera</i> corpus. Additionally, we train an image extraction model based on YOLO architecture and publish it through a publicly available web-service to detect and extract multi-class images from historical documents in an effort to bridge the gap between traditional and computational approaches in historical studies.

Penulis (4)

J

Jochen Büttner

J

Julius Martinetz

H

Hassan El-Hajj

M

Matteo Valleriani

Format Sitasi

Büttner, J., Martinetz, J., El-Hajj, H., Valleriani, M. (2022). Cor<i>Deep</i> and the Sacrobosco Dataset: Detection of Visual Elements in Historical Documents. https://doi.org/10.3390/jimaging8100285

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber doi.org/10.3390/jimaging8100285
Informasi Jurnal
Tahun Terbit
2022
Sumber Database
DOAJ
DOI
10.3390/jimaging8100285
Akses
Open Access ✓