arXiv Open Access 2025

Practical Poisoning Attacks against Retrieval-Augmented Generation

Baolei Zhang Yuxi Chen Zhuqing Liu Lihai Nie Tong Li +2 lainnya

Lihat Sumber

Abstrak

Large language models (LLMs) have demonstrated impressive natural language processing abilities but face challenges such as hallucination and outdated knowledge. Retrieval-Augmented Generation (RAG) has emerged as a state-of-the-art approach to mitigate these issues. While RAG enhances LLM outputs, it remains vulnerable to poisoning attacks. Recent studies show that injecting poisoned text into the knowledge database can compromise RAG systems, but most existing attacks assume that the attacker can insert a sufficient number of poisoned texts per query to outnumber correct-answer texts in retrieval, an assumption that is often unrealistic. To address this limitation, we propose CorruptRAG, a practical poisoning attack against RAG systems in which the attacker injects only a single poisoned text, enhancing both feasibility and stealth. Extensive experiments conducted on multiple large-scale datasets demonstrate that CorruptRAG achieves higher attack success rates than existing baselines.

Topik & Kata Kunci

cs.CR cs.IR cs.LG

Penulis (7)

Baolei Zhang

Yuxi Chen

Zhuqing Liu

Lihai Nie

Tong Li

Zheli Liu

Minghong Fang

Format Sitasi

APA MLA BibTeX

Zhang, B., Chen, Y., Liu, Z., Nie, L., Li, T., Liu, Z. et al. (2025). Practical Poisoning Attacks against Retrieval-Augmented Generation. https://arxiv.org/abs/2504.03957

Akses Cepat

Lihat di Sumber

Informasi Jurnal

Tahun Terbit: 2025
Bahasa: en
Sumber Database: arXiv
Akses: Open Access ✓