论文信息 - Generative Poisoning Attack Method Against Neural Networks

Generative Poisoning Attack Method Against Neural Networks

Poisoning attack is identified as a severe security threat to machine learning algorithms. In many applications, for example, deep neural network (DNN) models collect public data as the inputs to perform re-training, where the input data can be poisoned. Although poisoning attack against support vector machines (SVM) has been extensively studied before, there is still very limited knowledge about how such attack can be implemented on neural networks (NN), especially DNNs. In this work, we first examine the possibility of applying traditional gradient-based method (named as the direct gradient method) to generate poisoned data against NNs by leveraging the gradient of the target model w.r.t. the normal data. We then propose a generative method to accelerate the generation rate of the poisoned data: an auto-encoder (generator) used to generate poisoned data is updated by a reward function of the loss, and the target NN model (discriminator) receives the poisoned data to calculate the loss w.r.t. the normal data. Our experiment results show that the generative method can speed up the poisoned data generation rate by up to 239.38x compared with the direct gradient method, with slightly lower model accuracy degradation. A countermeasure is also designed to detect such poisoning attack methods by checking the loss of the target model.

[1] Eugenio Culurciello,et al. An Analysis of Deep Neural Network Models for Practical Applications , 2016, ArXiv.

[2] Alexander Gruenstein,et al. Accurate and compact large vocabulary speech recognition on mobile devices , 2013, INTERSPEECH.

[3] Marius Kloft,et al. Online Anomaly Detection under Adversarial Impact , 2010, AISTATS.

[4] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[5] Claudia Eckert,et al. Adversarial Label Flips Attack on Support Vector Machines , 2012, ECAI.

[6] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[7] Zheng Zhang,et al. MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems , 2015, ArXiv.

[8] Susmita Sur-Kolay,et al. Systematic Poisoning Attacks on and Defenses for Machine Learning in Healthcare , 2015, IEEE Journal of Biomedical and Health Informatics.

[9] Yiran Chen,et al. Security of neuromorphic computing: Thwarting learning attacks using memristor's obsolescence effect , 2016, 2016 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[10] Blaine Nelson,et al. The security of machine learning , 2010, Machine Learning.

[11] J. Doug Tygar,et al. Adversarial machine learning , 2019, AISec '11.

[12] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[13] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[14] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[15] Ling Huang,et al. ANTIDOTE: understanding and defending against poisoning of anomaly detectors , 2009, IMC '09.

[16] Stefan Daniel Dumitrescu,et al. Robust deep-learning models for text-to-speech synthesis support on embedded devices , 2015, MEDES.

[17] Blaine Nelson,et al. Poisoning Attacks against Support Vector Machines , 2012, ICML.