arXiv Open Access 2025

Finding Culture-Sensitive Neurons in Vision-Language Models

Xiutian Zhao Rochelle Choenni Rohit Saxena Ivan Titov
Lihat Sumber

Abstrak

Despite their impressive performance, vision-language models (VLMs) still struggle on culturally situated inputs. To understand how VLMs process culturally grounded information, we study the presence of culture-sensitive neurons, i.e. neurons whose activations show preferential sensitivity to inputs associated with particular cultural contexts. We examine whether such neurons are important for culturally diverse visual question answering and where they are located. Using the CVQA benchmark, we identify neurons of culture selectivity and perform causal tests by deactivating the neurons flagged by different identification methods. Experiments on three VLMs across 25 cultural groups demonstrate the existence of neurons whose ablation disproportionately harms performance on questions about the corresponding cultures, while having minimal effects on others. Moreover, we propose a new margin-based selector - Contrastive Activation Selection (CAS), and show that it outperforms existing probability- and entropy-based methods in identifying culture-sensitive neurons. Finally, our layer-wise analyses reveals that such neurons tend to cluster in certain decoder layers. Overall, our findings shed new light on the internal organization of multimodal representations.

Topik & Kata Kunci

Penulis (4)

X

Xiutian Zhao

R

Rochelle Choenni

R

Rohit Saxena

I

Ivan Titov

Format Sitasi

Zhao, X., Choenni, R., Saxena, R., Titov, I. (2025). Finding Culture-Sensitive Neurons in Vision-Language Models. https://arxiv.org/abs/2510.24942

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓