论文信息 - SoK: Security and Privacy in Machine Learning

SoK: Security and Privacy in Machine Learning

Advances in machine learning (ML) in recent years have enabled a dizzying array of applications such as data analytics, autonomous systems, and security diagnostics. ML is now pervasive—new systems and models are being deployed in every domain imaginable, leading to widespread deployment of software based inference and decision making. There is growing recognition that ML exposes new vulnerabilities in software systems, yet the technical community's understanding of the nature and extent of these vulnerabilities remains limited. We systematize findings on ML security and privacy, focusing on attacks identified on these systems and defenses crafted to date.We articulate a comprehensive threat model for ML, and categorize attacks and defenses within an adversarial framework. Key insights resulting from works both in the ML and security communities are identified and the effectiveness of approaches are related to structural elements of ML algorithms and the data used to train them. In particular, it is apparent that constructing a theoretical understanding of the sensitivity of modern ML algorithms to the data they analyze, à la PAC theory, will foster a science of security and privacy in ML.

[1] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[2] Amos J. Storkey,et al. Censoring Representations with an Adversary , 2015, ICLR.

[3] Claudia Eckert,et al. Is Feature Selection Secure against Training Data Poisoning? , 2015, ICML.

[4] Ian Goodfellow,et al. Deep Learning with Differential Privacy , 2016, CCS.

[5] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[6] Lujo Bauer,et al. Accessorize to a Crime: Real and Stealthy Attacks on State-of-the-Art Face Recognition , 2016, CCS.

[7] Dan Boneh,et al. The Space of Transferable Adversarial Examples , 2017, ArXiv.

[8] R. Altman,et al. Estimation of the warfarin dose with clinical and pharmacogenetic data. , 2009, The New England journal of medicine.

[9] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[10] A. Joseph,et al. Bounding an Attack ’ s Complexity for a Simple Learning Model , 2006 .

[11] Harris Drucker,et al. Support vector machines for spam categorization , 1999, IEEE Trans. Neural Networks.

[12] Blaine Nelson,et al. Poisoning Attacks against Support Vector Machines , 2012, ICML.

[13] Milind Tambe,et al. Learning Adversary Behavior in Security Games: A PAC Model Perspective , 2015, AAMAS.

[14] Luca Rigazio,et al. Towards Deep Neural Network Architectures Robust to Adversarial Examples , 2014, ICLR.

[15] Emiliano De Cristofaro,et al. LOGAN: Evaluating Privacy Leakage of Generative Models Using Generative Adversarial Networks , 2017, ArXiv.

[16] Yann LeCun,et al. The mnist database of handwritten digits , 2005 .

[17] Mohammad Zulkernine,et al. Anomaly Based Network Intrusion Detection with Unsupervised Outlier Detection , 2006, 2006 IEEE International Conference on Communications.

[18] Patrick D. McDaniel,et al. On the Effectiveness of Defensive Distillation , 2016, ArXiv.

[19] Léon Bottou,et al. Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[20] N. Altman. An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[21] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.

[22] Paul Barford,et al. Data Poisoning Attacks against Autoregressive Models , 2016, AAAI.

[23] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[24] Thomas C. Rindfleisch,et al. Privacy, information technology, and health care , 1997, CACM.

[25] Micah Sherr,et al. Hidden Voice Commands , 2016, USENIX Security Symposium.

[26] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[27] Michael P. Wellman,et al. Nash Q-Learning for General-Sum Stochastic Games , 2003, J. Mach. Learn. Res..

[28] B. Ripley,et al. Pattern Recognition , 1968, Nature.

[29] Martín Abadi,et al. Semi-supervised Knowledge Transfer for Deep Learning from Private Training Data , 2016, ICLR.

[30] Christopher Meek,et al. Good Word Attacks on Statistical Spam Filters , 2005, CEAS.

[31] Patrick D. McDaniel,et al. Transferability in Machine Learning: from Phenomena to Black-Box Attacks using Adversarial Samples , 2016, ArXiv.

[32] Niels Provos,et al. A Virtual Honeypot Framework , 2004, USENIX Security Symposium.

[33] David J. Hand,et al. Statistical fraud detection: A review , 2002 .

[34] VARUN CHANDOLA,et al. Anomaly detection: A survey , 2009, CSUR.

[35] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[36] Ananthram Swami,et al. Practical Black-Box Attacks against Deep Learning Systems using Adversarial Examples , 2016, ArXiv.

[37] Geoffrey E. Hinton,et al. Distilling the Knowledge in a Neural Network , 2015, ArXiv.

[38] Geoff Hulten,et al. Mining time-changing data streams , 2001, KDD '01.

[39] Fabio Roli,et al. Evasion Attacks against Machine Learning at Test Time , 2013, ECML/PKDD.

[40] Moustapha Cissé,et al. ConvNets and ImageNet Beyond Accuracy: Explanations, Bias Detection, Adversarial Examples and Model Criticism , 2017, ArXiv.

[41] Blaine Nelson,et al. Adversarial machine learning , 2019, AISec '11.

[42] Tobias Scheffer,et al. Stackelberg games for adversarial prediction problems , 2011, KDD.

[43] Toniann Pitassi,et al. Fairness through awareness , 2011, ITCS '12.

[44] Milind Tambe,et al. From physical security to cybersecurity , 2015, J. Cybersecur..

[45] Anand D. Sarwate,et al. Differentially Private Empirical Risk Minimization , 2009, J. Mach. Learn. Res..

[46] Sebastian Nowozin,et al. Oblivious Multi-Party Machine Learning on Trusted Processors , 2016, USENIX Security Symposium.

[47] Amir Globerson,et al. Nightmare at test time: robust learning by feature deletion , 2006, ICML.

[48] David A. Wagner,et al. Towards Evaluating the Robustness of Neural Networks , 2016, 2017 IEEE Symposium on Security and Privacy (SP).

[49] Matt J. Kusner,et al. Counterfactual Fairness , 2017, NIPS.

[50] Peter Glöckner,et al. Why Does Unsupervised Pre-training Help Deep Learning? , 2013 .

[51] Somesh Jha,et al. Static Analysis of Executables to Detect Malicious Patterns , 2003, USENIX Security Symposium.

[52] Michael Naehrig,et al. CryptoNets: applying neural networks to encrypted data with high throughput and accuracy , 2016, ICML 2016.

[53] Fan Zhang,et al. Stealing Machine Learning Models via Prediction APIs , 2016, USENIX Security Symposium.

[54] Dale Schuurmans,et al. Learning with a Strong Adversary , 2015, ArXiv.

[55] Cynthia Rudin,et al. Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model , 2015, ArXiv.

[56] Mikhail Belkin,et al. Learning privately from multiparty data , 2016, ICML.

[57] Samy Bengio,et al. Adversarial examples in the physical world , 2016, ICLR.

[58] Xiaojin Zhu,et al. Using Machine Teaching to Identify Optimal Training-Set Attacks on Machine Learners , 2015, AAAI.

[59] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[60] Raef Bassily,et al. Differentially Private Empirical Risk Minimization: Efficient Algorithms and Tight Error Bounds , 2014, 1405.7085.

[61] Ling Huang,et al. ANTIDOTE: understanding and defending against poisoning of anomaly detectors , 2009, IMC '09.

[62] Cannady,et al. Next Generation Intrusion Detection: Autonomous Reinforcement Learning of Network Attacks , 2000 .

[63] Pavel Laskov,et al. Practical Evasion of a Learning-Based Classifier: A Case Study , 2014, 2014 IEEE Symposium on Security and Privacy.

[64] Anil K. Jain,et al. Data clustering: a review , 1999, CSUR.

[65] James Newsome,et al. Polygraph: automatically generating signatures for polymorphic worms , 2005, 2005 IEEE Symposium on Security and Privacy (S&P'05).

[66] Jürgen Schmidhuber,et al. Stacked Convolutional Auto-Encoders for Hierarchical Feature Extraction , 2011, ICANN.

[67] Kevin P. Murphy,et al. Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[68] Patrick D. McDaniel,et al. Adversarial Perturbations Against Deep Neural Networks for Malware Classification , 2016, ArXiv.

[69] Avi Feller,et al. Algorithmic Decision Making and the Cost of Fairness , 2017, KDD.

[70] Milind Tambe,et al. Security and Game Theory - Algorithms, Deployed Systems, Lessons Learned , 2011 .

[71] Liva Ralaivola,et al. Learning SVMs from Sloppily Labeled Data , 2009, ICANN.

[72] Percy Liang,et al. Understanding Black-box Predictions via Influence Functions , 2017, ICML.

[73] Ming Li,et al. Learning in the presence of malicious errors , 1993, STOC '88.

[74] Lorenzo Rosasco,et al. Are Loss Functions All the Same? , 2004, Neural Computation.

[75] Alan Bundy,et al. Preparing for the future of Artificial Intelligence , 2017, AI & SOCIETY.

[76] Vladimir Vapnik,et al. A new learning paradigm: Learning using privileged information , 2009, Neural Networks.

[77] Yarin Gal,et al. Dropout Inference in Bayesian Neural Networks with Alpha-divergences , 2017, ICML.

[78] Vitaly Shmatikov,et al. Membership Inference Attacks Against Machine Learning Models , 2016, 2017 IEEE Symposium on Security and Privacy (SP).

[79] Cynthia Dwork,et al. Differential Privacy: A Survey of Results , 2008, TAMC.

[80] Arslan Munir,et al. Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks , 2017, MLDM.

[81] Vitaly Shmatikov,et al. Privacy-preserving deep learning , 2015, 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[82] Seyed-Mohsen Moosavi-Dezfooli,et al. DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[83] Úlfar Erlingsson,et al. RAPPOR: Randomized Aggregatable Privacy-Preserving Ordinal Response , 2014, CCS.

[84] Marius Kloft,et al. Online Anomaly Detection under Adversarial Impact , 2010, AISTATS.

[85] David M. W. Powers,et al. Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[86] Blaine Nelson,et al. Support Vector Machines Under Adversarial Label Noise , 2011, ACML.

[87] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[88] Sandy H. Huang,et al. Adversarial Attacks on Neural Network Policies , 2017, ICLR.

[89] David A. Wagner,et al. Defensive Distillation is Not Robust to Adversarial Examples , 2016, ArXiv.

[90] Shari Lawrence Pfleeger,et al. Analyzing Computer Security - A Threat / Vulnerability / Countermeasure Approach , 2012 .

[91] John Schulman,et al. Concrete Problems in AI Safety , 2016, ArXiv.

[92] Ling Huang,et al. Query Strategies for Evading Convex-Inducing Classifiers , 2010, J. Mach. Learn. Res..

[93] Christopher Meek,et al. Adversarial learning , 2005, KDD '05.

[94] Carlos Guestrin,et al. "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[95] Ananthram Swami,et al. Practical Black-Box Attacks against Machine Learning , 2016, AsiaCCS.

[96] Giovanni Felici,et al. Hacking smart machines with smarter ones: How to extract meaningful data from machine learning classifiers , 2013, Int. J. Secur. Networks.

[97] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[98] Yanjun Qi,et al. Automatically Evading Classifiers: A Case Study on PDF Malware Classifiers , 2016, NDSS.

[99] James Davidson,et al. Supervision via competition: Robot adversaries for learning tasks , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[100] Yair Zick,et al. Algorithmic Transparency via Quantitative Input Influence , 2017 .

[101] Ananthram Swami,et al. The Limitations of Deep Learning in Adversarial Settings , 2015, 2016 IEEE European Symposium on Security and Privacy (EuroS&P).

[102] Seth Flaxman,et al. European Union Regulations on Algorithmic Decision-Making and a "Right to Explanation" , 2016, AI Mag..

[103] Daniel Kifer,et al. Unifying Adversarial Training Algorithms with Flexible Deep Data Gradient Regularization , 2016, ArXiv.

[104] Sofya Raskhodnikova,et al. What Can We Learn Privately? , 2008, 2008 49th Annual IEEE Symposium on Foundations of Computer Science.

[105] Percy Liang,et al. Certified Defenses for Data Poisoning Attacks , 2017, NIPS.

[106] Michael Kearns,et al. Fair Algorithms for Machine Learning , 2017, EC.

[107] Dawn Xiaodong Song,et al. Delving into Transferable Adversarial Examples and Black-box Attacks , 2016, ICLR.

[108] Krishna P. Gummadi,et al. Fairness Constraints: Mechanisms for Fair Classification , 2015, AISTATS.

[109] Aaron Roth,et al. The Algorithmic Foundations of Differential Privacy , 2014, Found. Trends Theor. Comput. Sci..

[110] Jon M. Kleinberg,et al. Inherent Trade-Offs in the Fair Determination of Risk Scores , 2016, ITCS.

[111] Naresh Manwani,et al. Noise Tolerance Under Risk Minimization , 2011, IEEE Transactions on Cybernetics.

[112] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[113] Andrew D. Selbst,et al. Big Data's Disparate Impact , 2016 .

[114] Koby Crammer,et al. Robust Support Vector Machine Training via Convex Outlier Ablation , 2006, AAAI.

[115] Ashwin Machanavajjhala,et al. No free lunch in data privacy , 2011, SIGMOD '11.

[116] Shyhtsun Felix Wu,et al. On Attacking Statistical Spam Filters , 2004, CEAS.

[117] Blaine Nelson,et al. Can machine learning be secure? , 2006, ASIACCS '06.

[118] Pascal Vincent,et al. Visualizing Higher-Layer Features of a Deep Network , 2009 .

[119] Yevgeniy Vorobeychik,et al. Optimal randomized classification in adversarial settings , 2014, AAMAS.

[120] Claudia Eckert,et al. Adversarial Label Flips Attack on Support Vector Machines , 2012, ECAI.

[121] Ananthram Swami,et al. Distillation as a Defense to Adversarial Perturbations Against Deep Neural Networks , 2015, 2016 IEEE Symposium on Security and Privacy (SP).

[122] Andrea Vedaldi,et al. Visualizing Deep Convolutional Neural Networks Using Natural Pre-images , 2015, International Journal of Computer Vision.

[123] Ronald L. Rivest,et al. ON DATA BANKS AND PRIVACY HOMOMORPHISMS , 1978 .

[124] Somesh Jha,et al. Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures , 2015, CCS.

[125] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[126] Wenke Lee,et al. Misleading worm signature generators using deliberate noise injection , 2006, 2006 IEEE Symposium on Security and Privacy (S&P'06).

[127] Somesh Jha,et al. Privacy in Pharmacogenetics: An End-to-End Case Study of Personalized Warfarin Dosing , 2014, USENIX Security Symposium.

[128] Fabio Roli,et al. Poisoning behavioral malware clustering , 2014, AISec '14.

[129] Susmita Sur-Kolay,et al. Systematic Poisoning Attacks on and Defenses for Machine Learning in Healthcare , 2015, IEEE Journal of Biomedical and Health Informatics.