arXiv Open Access 2023

CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning

Yang Liu Weixing Chen Guanbin Li Liang Lin
Lihat Sumber

Abstrak

We present CausalVLR (Causal Visual-Linguistic Reasoning), an open-source toolbox containing a rich set of state-of-the-art causal relation discovery and causal inference methods for various visual-linguistic reasoning tasks, such as VQA, image/video captioning, medical report generation, model generalization and robustness, etc. These methods have been included in the toolbox with PyTorch implementations under NVIDIA computing system. It not only includes training and inference codes, but also provides model weights. We believe this toolbox is by far the most complete visual-linguitic causal reasoning toolbox. We wish that the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to re-implement existing methods and develop their own new causal reasoning methods. Code and models are available at https://github.com/HCPLab-SYSU/CausalVLR. The project is under active development by HCP-Lab's contributors and we will keep this document updated.

Topik & Kata Kunci

Penulis (4)

Y

Yang Liu

W

Weixing Chen

G

Guanbin Li

L

Liang Lin

Format Sitasi

Liu, Y., Chen, W., Li, G., Lin, L. (2023). CausalVLR: A Toolbox and Benchmark for Visual-Linguistic Causal Reasoning. https://arxiv.org/abs/2306.17462

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2023
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓