arXiv Open Access 2025

Poisoned-MRAG: Knowledge Poisoning Attacks to Multimodal Retrieval Augmented Generation

Yinuo Liu Zenghui Yuan Guiyao Tie Jiawen Shi Pan Zhou +2 lainnya
Lihat Sumber

Abstrak

Multimodal retrieval-augmented generation (RAG) enhances the visual reasoning capability of vision-language models (VLMs) by dynamically accessing information from external knowledge bases. In this work, we introduce \textit{Poisoned-MRAG}, the first knowledge poisoning attack on multimodal RAG systems. Poisoned-MRAG injects a few carefully crafted image-text pairs into the multimodal knowledge database, manipulating VLMs to generate the attacker-desired response to a target query. Specifically, we formalize the attack as an optimization problem and propose two cross-modal attack strategies, dirty-label and clean-label, tailored to the attacker's knowledge and goals. Our extensive experiments across multiple knowledge databases and VLMs show that Poisoned-MRAG outperforms existing methods, achieving up to 98\% attack success rate with just five malicious image-text pairs injected into the InfoSeek database (481,782 pairs). Additionally, We evaluate 4 different defense strategies, including paraphrasing, duplicate removal, structure-driven mitigation, and purification, demonstrating their limited effectiveness and trade-offs against Poisoned-MRAG. Our results highlight the effectiveness and scalability of Poisoned-MRAG, underscoring its potential as a significant threat to multimodal RAG systems.

Topik & Kata Kunci

Penulis (7)

Y

Yinuo Liu

Z

Zenghui Yuan

G

Guiyao Tie

J

Jiawen Shi

P

Pan Zhou

L

Lichao Sun

N

Neil Zhenqiang Gong

Format Sitasi

Liu, Y., Yuan, Z., Tie, G., Shi, J., Zhou, P., Sun, L. et al. (2025). Poisoned-MRAG: Knowledge Poisoning Attacks to Multimodal Retrieval Augmented Generation. https://arxiv.org/abs/2503.06254

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓