arXiv Open Access 2025

DomainCQA: Crafting Knowledge-Intensive QA from Domain-Specific Charts

Yujing Lu Ling Zhong Jing Yang Weiming Li Peng Wei +3 lainnya
Lihat Sumber

Abstrak

Chart Question Answering (CQA) evaluates Multimodal Large Language Models (MLLMs) on visual understanding and reasoning over chart data. However, existing benchmarks mostly test surface-level parsing, such as reading labels and legends, while overlooking deeper scientific reasoning. We propose DomainCQA, a framework for constructing domain-specific CQA benchmarks that emphasize both visual comprehension and knowledge-intensive reasoning. It integrates complexity-aware chart selection, multitier QA generation, and expert validation. Applied to astronomy, DomainCQA yields AstroChart, a benchmark of 1,690 QA pairs over 482 charts, exposing persistent weaknesses in fine-grained perception, numerical reasoning, and domain knowledge integration across 21 MLLMs. Fine-tuning on AstroChart improves performance across fundamental and advanced tasks. Pilot QA sets in biochemistry, economics, medicine, and social science further demonstrate DomainCQA's generality. Together, our results establish DomainCQA as a unified pipeline for constructing and augmenting domain-specific chart reasoning benchmarks.

Topik & Kata Kunci

Penulis (8)

Y

Yujing Lu

L

Ling Zhong

J

Jing Yang

W

Weiming Li

P

Peng Wei

Y

Yongheng Wang

M

Manni Duan

Q

Qing Zhang

Format Sitasi

Lu, Y., Zhong, L., Yang, J., Li, W., Wei, P., Wang, Y. et al. (2025). DomainCQA: Crafting Knowledge-Intensive QA from Domain-Specific Charts. https://arxiv.org/abs/2503.19498

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓