arXiv Open Access 2023

A tailored Handwritten-Text-Recognition System for Medieval Latin

Philipp Koch Gilary Vera Nuñez Esteban Garces Arias Christian Heumann Matthias Schöffel +2 lainnya

Lihat Sumber

Abstrak

The Bavarian Academy of Sciences and Humanities aims to digitize its Medieval Latin Dictionary. This dictionary entails record cards referring to lemmas in medieval Latin, a low-resource language. A crucial step of the digitization process is the Handwritten Text Recognition (HTR) of the handwritten lemmas found on these record cards. In our work, we introduce an end-to-end pipeline, tailored to the medieval Latin dictionary, for locating, extracting, and transcribing the lemmas. We employ two state-of-the-art (SOTA) image segmentation models to prepare the initial data set for the HTR task. Furthermore, we experiment with different transformer-based models and conduct a set of experiments to explore the capabilities of different combinations of vision encoders with a GPT-2 decoder. Additionally, we also apply extensive data augmentation resulting in a highly competitive model. The best-performing setup achieved a Character Error Rate (CER) of 0.015, which is even superior to the commercial Google Cloud Vision model, and shows more stable performance.

Topik & Kata Kunci

cs.CV cs.CL cs.CY cs.LG stat.ML

Penulis (7)

Philipp Koch

Gilary Vera Nuñez

Esteban Garces Arias

Christian Heumann

Matthias Schöffel

Alexander Häberlin

Matthias Aßenmacher

Format Sitasi

APA MLA BibTeX

Koch, P., Nuñez, G.V., Arias, E.G., Heumann, C., Schöffel, M., Häberlin, A. et al. (2023). A tailored Handwritten-Text-Recognition System for Medieval Latin. https://arxiv.org/abs/2308.09368

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2023
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓