arXiv Open Access 2025

Defending Against Beta Poisoning Attacks in Machine Learning Models

Nilufer Gulciftci M. Emre Gursoy
Lihat Sumber

Abstrak

Poisoning attacks, in which an attacker adversarially manipulates the training dataset of a machine learning (ML) model, pose a significant threat to ML security. Beta Poisoning is a recently proposed poisoning attack that disrupts model accuracy by making the training dataset linearly nonseparable. In this paper, we propose four defense strategies against Beta Poisoning attacks: kNN Proximity-Based Defense (KPB), Neighborhood Class Comparison (NCC), Clustering-Based Defense (CBD), and Mean Distance Threshold (MDT). The defenses are based on our observations regarding the characteristics of poisoning samples generated by Beta Poisoning, e.g., poisoning samples have close proximity to one another, and they are centered near the mean of the target class. Experimental evaluations using MNIST and CIFAR-10 datasets demonstrate that KPB and MDT can achieve perfect accuracy and F1 scores, while CBD and NCC also provide strong defensive capabilities. Furthermore, by analyzing performance across varying parameters, we offer practical insights regarding defenses' behaviors under varying conditions.

Topik & Kata Kunci

Penulis (2)

N

Nilufer Gulciftci

M

M. Emre Gursoy

Format Sitasi

Gulciftci, N., Gursoy, M.E. (2025). Defending Against Beta Poisoning Attacks in Machine Learning Models. https://arxiv.org/abs/2508.01276

Akses Cepat

Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2025
Bahasa
en
Sumber Database
arXiv
Akses
Open Access ✓