arXiv Open Access 2021

Document Collection Visual Question Answering

Rubèn Tito Dimosthenis Karatzas Ernest Valveny

Lihat Sumber

Abstrak

Current tasks and methods in Document Understanding aims to process documents as single elements. However, documents are usually organized in collections (historical records, purchase invoices), that provide context useful for their interpretation. To address this problem, we introduce Document Collection Visual Question Answering (DocCVQA) a new dataset and related task, where questions are posed over a whole collection of document images and the goal is not only to provide the answer to the given question, but also to retrieve the set of documents that contain the information needed to infer the answer. Along with the dataset we propose a new evaluation metric and baselines which provide further insights to the new dataset and task.

Topik & Kata Kunci

cs.IR

Penulis (3)

Rubèn Tito

Dimosthenis Karatzas

Ernest Valveny

Format Sitasi

APA MLA BibTeX

Tito, R., Karatzas, D., Valveny, E. (2021). Document Collection Visual Question Answering. https://arxiv.org/abs/2104.14336

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2021
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓