arXiv Open Access 2024

Docling Technical Report

Christoph Auer Maksym Lysak Ahmed Nassar Michele Dolfi Nikolaos Livathinos +14 lainnya
Lihat Sumber

Abstrak

This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion. It is powered by state-of-the-art specialized AI models for layout analysis (DocLayNet) and table structure recognition (TableFormer), and runs efficiently on commodity hardware in a small resource budget. The code interface allows for easy extensibility and addition of new features and models.

Topik & Kata Kunci

Penulis (19)

C

Christoph Auer

M

Maksym Lysak

A

Ahmed Nassar

M

Michele Dolfi

N

Nikolaos Livathinos

P

Panos Vagenas

C

Cesar Berrospi Ramis

M

Matteo Omenetti

F

Fabian Lindlbauer

K

Kasper Dinkla

L

Lokesh Mishra

Y

Yusik Kim

S

Shubham Gupta

R

Rafael Teixeira de Lima

V

Valery Weber

L

Lucas Morin

I

Ingmar Meijer

V

Viktor Kuropiatnyk

P

Peter W. J. Staar

Format Sitasi

Auer, C., Lysak, M., Nassar, A., Dolfi, M., Livathinos, N., Vagenas, P. et al. (2024). Docling Technical Report. https://arxiv.org/abs/2408.09869

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2024
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓