arXiv Open Access 2023

A tailored Handwritten-Text-Recognition System for Medieval Latin

Philipp Koch Gilary Vera Nuñez Esteban Garces Arias Christian Heumann Matthias Schöffel +2 lainnya
Lihat Sumber

Abstrak

The Bavarian Academy of Sciences and Humanities aims to digitize its Medieval Latin Dictionary. This dictionary entails record cards referring to lemmas in medieval Latin, a low-resource language. A crucial step of the digitization process is the Handwritten Text Recognition (HTR) of the handwritten lemmas found on these record cards. In our work, we introduce an end-to-end pipeline, tailored to the medieval Latin dictionary, for locating, extracting, and transcribing the lemmas. We employ two state-of-the-art (SOTA) image segmentation models to prepare the initial data set for the HTR task. Furthermore, we experiment with different transformer-based models and conduct a set of experiments to explore the capabilities of different combinations of vision encoders with a GPT-2 decoder. Additionally, we also apply extensive data augmentation resulting in a highly competitive model. The best-performing setup achieved a Character Error Rate (CER) of 0.015, which is even superior to the commercial Google Cloud Vision model, and shows more stable performance.

Penulis (7)

P

Philipp Koch

G

Gilary Vera Nuñez

E

Esteban Garces Arias

C

Christian Heumann

M

Matthias Schöffel

A

Alexander Häberlin

M

Matthias Aßenmacher

Format Sitasi

Koch, P., Nuñez, G.V., Arias, E.G., Heumann, C., Schöffel, M., Häberlin, A. et al. (2023). A tailored Handwritten-Text-Recognition System for Medieval Latin. https://arxiv.org/abs/2308.09368

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓