论文信息 - Robustness to Adversarial Perturbations in Learning from Incomplete Data - 字舞流文

Robustness to Adversarial Perturbations in Learning from Incomplete Data

What is the role of unlabeled data in an inference problem, when the presumed underlying distribution is adversarially perturbed? To provide a concrete answer to this question, this paper unifies two major learning frameworks: Semi-Supervised Learning (SSL) and Distributionally Robust Learning (DRL). We develop a generalization theory for our framework based on a number of novel complexity measures, such as an adversarial extension of Rademacher complexity and its semi-supervised analogue. Moreover, our analysis is able to quantify the role of unlabeled data in the generalization under a more general condition compared to the existing theoretical works in SSL. Based on our framework, we also present a hybrid of DRL and EM algorithms that has a guaranteed convergence rate. When implemented with deep neural networks, our method shows a comparable performance to those of the state-of-the-art on a number of real-world benchmark datasets.

Amir Najafi | Takeru Miyato | Shin-ichi Maeda | Masanori Koyama | Takeru Miyato | Masanori Koyama | S. Maeda | Amir Najafi | Amir Najafi

[1] Shin Ishii,et al. Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[3] Ludwig Schmidt,et al. Unlabeled Data Improves Adversarial Robustness , 2019, NeurIPS.

[4] Di He,et al. Adversarially Robust Generalization Just Requires More Unlabeled Data , 2019, ArXiv.

[5] Daniel Kuhn,et al. Data-driven distributionally robust optimization using the Wasserstein metric: performance guarantees and tractable reformulations , 2015, Mathematical Programming.

[6] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[7] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[8] Prateek Mittal,et al. PAC-learning in the presence of evasion adversaries , 2018, NIPS 2018.

[9] Neil Genzlinger. A. and Q , 2006 .

[10] Marco Loog,et al. Contrastive Pessimistic Likelihood Estimation for Semi-Supervised Classification , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] W. Marsden. I and J , 2012 .

[12] Karthyek R. A. Murthy,et al. Quantifying Distributional Model Risk Via Optimal Transport , 2016, Math. Oper. Res..

[13] Xiaojin Zhu,et al. --1 CONTENTS , 2006 .

[14] O. Chapelle,et al. Semi-Supervised Learning (Chapelle, O. et al., Eds.; 2006) [Book reviews] , 2009, IEEE Transactions on Neural Networks.

[15] Yang Kang,et al. Distributionally Robust Semi-supervised Learning , 2017 .

[16] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[17] Sepp Hochreiter,et al. Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[18] Jason Yosinski,et al. Deep neural networks are easily fooled: High confidence predictions for unrecognizable images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Yoshua Bengio,et al. Semi-supervised Learning by Entropy Minimization , 2004, CAP.

[20] Ameet Talwalkar,et al. Foundations of Machine Learning , 2012, Adaptive computation and machine learning.

[21] Massih-Reza Amini,et al. Semi Supervised Logistic Regression , 2002, ECAI.

[22] Fan Yang,et al. Good Semi-supervised Learning That Requires a Bad GAN , 2017, NIPS.

[23] M. Dresher. Games of Strategy: Theory and Applications , 2007 .

[24] Saeed Ghadimi,et al. Optimal Stochastic Approximation Algorithms for Strongly Convex Stochastic Composite Optimization I: A Generic Algorithmic Framework , 2012, SIAM J. Optim..

[25] Yoav Freund,et al. Scalable Semi-Supervised Aggregation of Classifiers , 2015, NIPS.

[26] Richard Nock,et al. Monge beats Bayes: Hardness Results for Adversarial Training , 2018, ICML.

[27] John Duchi,et al. Statistics of Robust Optimization: A Generalized Empirical Likelihood Approach , 2016, Math. Oper. Res..

[28] John C. Duchi,et al. Certifying Some Distributional Robustness with Principled Adversarial Training , 2017, ICLR.

[29] Po-Sen Huang,et al. Are Labels Required for Improving Adversarial Robustness? , 2019, NeurIPS.

[30] Robert D. Nowak,et al. Unlabeled data: Now it helps, now it doesn't , 2008, NIPS.

[31] Gang Niu,et al. Does Distributionally Robust Supervised Learning Give Robust Classifiers? , 2016, ICML.

[32] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[33] John C. Duchi,et al. Certifiable Distributional Robustness with Principled Adversarial Training , 2017, ArXiv.

[34] James Bailey,et al. On the Convergence and Robustness of Adversarial Training , 2021, ICML.

[35] Ivor W. Tsang,et al. Robust Semi-Supervised Learning through Label Aggregation , 2016, AAAI.

[36] Yang Kang,et al. Semi‐supervised Learning Based on Distributionally Robust Optimization , 2020 .

[37] M. Staib,et al. Distributionally Robust Deep Learning as a Generalization of Adversarial Training , 2017 .

[38] Philippe Rigollet,et al. Generalization Error Bounds in Semi-supervised Classification Under the Cluster Assumption , 2006, J. Mach. Learn. Res..

[39] Dong-Hyun Lee,et al. Pseudo-Label : The Simple and Efficient Semi-Supervised Learning Method for Deep Neural Networks , 2013 .

[40] Aditi Raghunathan,et al. Adversarial Training Can Hurt Generalization , 2019, ArXiv.

[41] Can Yang,et al. On the Convergence of the EM Algorithm: From the Statistical Perspective , 2016 .

[42] Arindam Banerjee,et al. Semi-supervised Clustering by Seeding , 2002, ICML.

[43] J. Frédéric Bonnans,et al. Perturbation Analysis of Optimization Problems , 2000, Springer Series in Operations Research.

[44] Aleksander Madry,et al. Towards Deep Learning Models Resistant to Adversarial Attacks , 2017, ICLR.

[45] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[46] Daniel Kuhn,et al. Distributionally Robust Logistic Regression , 2015, NIPS.

[47] Ananthram Swami,et al. Distillation as a Defense to Adversarial Perturbations Against Deep Neural Networks , 2015, 2016 IEEE Symposium on Security and Privacy (SP).

[48] Aleksander Madry,et al. Adversarially Robust Generalization Requires More Data , 2018, NeurIPS.

[49] Andrew Y. Ng,et al. Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[50] Anja De Waegenaere,et al. Robust Solutions of Optimization Problems Affected by Uncertain Probabilities , 2011, Manag. Sci..