arXiv Open Access 2024

Behavioral Bias of Vision-Language Models: A Behavioral Finance View

Yuhang Xiao Yudi Lin Ming-Chang Chiu

Lihat Sumber

Abstrak

Large Vision-Language Models (LVLMs) evolve rapidly as Large Language Models (LLMs) was equipped with vision modules to create more human-like models. However, we should carefully evaluate their applications in different domains, as they may possess undesired biases. Our work studies the potential behavioral biases of LVLMs from a behavioral finance perspective, an interdisciplinary subject that jointly considers finance and psychology. We propose an end-to-end framework, from data collection to new evaluation metrics, to assess LVLMs' reasoning capabilities and the dynamic behaviors manifested in two established human financial behavioral biases: recency bias and authority bias. Our evaluations find that recent open-source LVLMs such as LLaVA-NeXT, MobileVLM-V2, Mini-Gemini, MiniCPM-Llama3-V 2.5 and Phi-3-vision-128k suffer significantly from these two biases, while the proprietary model GPT-4o is negligibly impacted. Our observations highlight directions in which open-source models can improve. The code is available at https://github.com/mydcxiao/vlm_behavioral_fin.

Topik & Kata Kunci

cs.CL cs.AI

Penulis (3)

Yuhang Xiao

Yudi Lin

Ming-Chang Chiu

Format Sitasi

APA MLA BibTeX

Xiao, Y., Lin, Y., Chiu, M. (2024). Behavioral Bias of Vision-Language Models: A Behavioral Finance View. https://arxiv.org/abs/2409.15256

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2024
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓