arXiv
Open Access
2024
Docling Technical Report
Christoph Auer
Maksym Lysak
Ahmed Nassar
Michele Dolfi
Nikolaos Livathinos
+14 lainnya
Abstrak
This technical report introduces Docling, an easy to use, self-contained, MIT-licensed open-source package for PDF document conversion. It is powered by state-of-the-art specialized AI models for layout analysis (DocLayNet) and table structure recognition (TableFormer), and runs efficiently on commodity hardware in a small resource budget. The code interface allows for easy extensibility and addition of new features and models.
Penulis (19)
C
Christoph Auer
M
Maksym Lysak
A
Ahmed Nassar
M
Michele Dolfi
N
Nikolaos Livathinos
P
Panos Vagenas
C
Cesar Berrospi Ramis
M
Matteo Omenetti
F
Fabian Lindlbauer
K
Kasper Dinkla
L
Lokesh Mishra
Y
Yusik Kim
S
Shubham Gupta
R
Rafael Teixeira de Lima
V
Valery Weber
L
Lucas Morin
I
Ingmar Meijer
V
Viktor Kuropiatnyk
P
Peter W. J. Staar
Akses Cepat
Informasi Jurnal
- Tahun Terbit
- 2024
- Bahasa
- en
- Sumber Database
- arXiv
- Akses
- Open Access ✓