Semantic Scholar Open Access 2020 227 sitasi

MetaPoison: Practical General-purpose Clean-label Data Poisoning

W. R. Huang Jonas Geiping Liam H. Fowl Gavin Taylor T. Goldstein

Abstrak

Data poisoning--the process by which an attacker takes control of a model by making imperceptible changes to a subset of the training data--is an emerging threat in the context of neural networks. Existing attacks for data poisoning have relied on hand-crafted heuristics. Instead, we pose crafting poisons more generally as a bi-level optimization problem, where the inner level corresponds to training a network on a poisoned dataset and the outer level corresponds to updating those poisons to achieve a desired behavior on the trained model. We then propose MetaPoison, a first-order method to solve this optimization quickly. MetaPoison is effective: it outperforms previous clean-label poisoning methods by a large margin under the same setting. MetaPoison is robust: its poisons transfer to a variety of victims with unknown hyperparameters and architectures. MetaPoison is also general-purpose, working not only in fine-tuning scenarios, but also for end-to-end training from scratch with remarkable success, e.g. causing a target image to be misclassified 90% of the time via manipulating just 1% of the dataset. Additionally, MetaPoison can achieve arbitrary adversary goals not previously possible--like using poisons of one class to make a target image don the label of another arbitrarily chosen class. Finally, MetaPoison works in the real-world. We demonstrate successful data poisoning of models trained on Google Cloud AutoML Vision. Code and premade poisons are provided at this https URL

Penulis (5)

W

W. R. Huang

J

Jonas Geiping

L

Liam H. Fowl

G

Gavin Taylor

T

T. Goldstein

Format Sitasi

Huang, W.R., Geiping, J., Fowl, L.H., Taylor, G., Goldstein, T. (2020). MetaPoison: Practical General-purpose Clean-label Data Poisoning. https://www.semanticscholar.org/paper/8465338724f00a1f57a86717e4c898256c522be0

Akses Cepat

PDF tidak tersedia langsung

Cek di sumber asli →
Lihat di Sumber
Informasi Jurnal
Tahun Terbit
2020
Bahasa
en
Total Sitasi
227×
Sumber Database
Semantic Scholar
Akses
Open Access ✓