arXiv Open Access 2025

Poisoned-MRAG: Knowledge Poisoning Attacks to Multimodal Retrieval Augmented Generation

Yinuo Liu Zenghui Yuan Guiyao Tie Jiawen Shi Pan Zhou +2 lainnya

Lihat Sumber

Abstrak

Multimodal retrieval-augmented generation (RAG) enhances the visual reasoning capability of vision-language models (VLMs) by dynamically accessing information from external knowledge bases. In this work, we introduce \textit{Poisoned-MRAG}, the first knowledge poisoning attack on multimodal RAG systems. Poisoned-MRAG injects a few carefully crafted image-text pairs into the multimodal knowledge database, manipulating VLMs to generate the attacker-desired response to a target query. Specifically, we formalize the attack as an optimization problem and propose two cross-modal attack strategies, dirty-label and clean-label, tailored to the attacker's knowledge and goals. Our extensive experiments across multiple knowledge databases and VLMs show that Poisoned-MRAG outperforms existing methods, achieving up to 98\% attack success rate with just five malicious image-text pairs injected into the InfoSeek database (481,782 pairs). Additionally, We evaluate 4 different defense strategies, including paraphrasing, duplicate removal, structure-driven mitigation, and purification, demonstrating their limited effectiveness and trade-offs against Poisoned-MRAG. Our results highlight the effectiveness and scalability of Poisoned-MRAG, underscoring its potential as a significant threat to multimodal RAG systems.

Topik & Kata Kunci

cs.CR cs.LG

Penulis (7)

Yinuo Liu

Zenghui Yuan

Guiyao Tie

Jiawen Shi

Pan Zhou

Lichao Sun

Neil Zhenqiang Gong

Format Sitasi

APA MLA BibTeX

Liu, Y., Yuan, Z., Tie, G., Shi, J., Zhou, P., Sun, L. et al. (2025). Poisoned-MRAG: Knowledge Poisoning Attacks to Multimodal Retrieval Augmented Generation. https://arxiv.org/abs/2503.06254

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓