论文信息 - Smoothness Analysis of Adversarial Training - 字舞流文

Smoothness Analysis of Adversarial Training

Deep neural networks are vulnerable to adversarial attacks. Recent studies about adversarial robustness focus on the loss landscape in the parameter space since it is related to optimization and generalization performance. These studies conclude that the difficulty of adversarial training is caused by the non-smoothness of the loss function: i.e., its gradient is not Lipschitz continuous. However, this analysis ignores the dependence of adversarial attacks on model parameters. Since adversarial attacks are optimized for models, they should depend on the parameters. Considering this dependence, we analyze the smoothness of the loss function of adversarial training using the optimal attacks for the model parameter in more detail. We reveal that the constraint of adversarial attacks is one cause of the non-smoothness and that the smoothness depends on the types of the constraints. Specifically, the L∞ constraint can cause non-smoothness more than the L2 constraint. Moreover, our analysis implies that if we flatten the loss function with respect to input data, the Lipschitz constant of the gradient of adversarial loss tends to increase. To address the non-smoothness, we show that EntropySGD smoothens the non-smooth loss and improves the performance of adversarial training.

Yasutoshi Ida | Sekitoshi Kanai | Masanori Yamada | Yuki Yamanaka | Hiroshi Takahashi | Yasutoshi Ida | Sekitoshi Kanai | Masanori Yamada | Yuki Yamanaka | Hiroshi Takahashi

[1] Woojin Lee,et al. Understanding Catastrophic Overfitting in Single-step Adversarial Training , 2020, AAAI.

[2] Andrew Y. Ng,et al. Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[3] Matthias Hein,et al. Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks , 2020, ICML.

[4] Razvan Pascanu,et al. Sharp Minima Can Generalize For Deep Nets , 2017, ICML.

[5] Aleksander Madry,et al. Towards Deep Learning Models Resistant to Adversarial Attacks , 2017, ICLR.

[6] Eric Jones,et al. SciPy: Open Source Scientific Tools for Python , 2001 .

[7] Tao Lin,et al. On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them , 2020, NeurIPS.

[8] Yoshua Bengio,et al. Three Factors Influencing Minima in SGD , 2017, ArXiv.

[9] Ludwig Schmidt,et al. Unlabeled Data Improves Adversarial Robustness , 2019, NeurIPS.

[10] Stefano Soatto,et al. Entropy-SGD: biasing gradient descent into wide valleys , 2016, ICLR.

[11] J. Zico Kolter,et al. Certified Adversarial Robustness via Randomized Smoothing , 2019, ICML.

[12] Samy Bengio,et al. Adversarial Machine Learning at Scale , 2016, ICLR.

[13] Richard Socher,et al. Improving Generalization Performance by Switching from Adam to SGD , 2017, ArXiv.

[14] Jorge Nocedal,et al. On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima , 2016, ICLR.

[15] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[16] Hao Li,et al. Visualizing the Loss Landscape of Neural Nets , 2017, NeurIPS.

[17] Hossein Mobahi,et al. Sharpness-Aware Minimization for Efficiently Improving Generalization , 2020, ArXiv.

[18] Moustapha Cissé,et al. Parseval Networks: Improving Robustness to Adversarial Examples , 2017, ICML.

[19] Yoram Singer,et al. Train faster, generalize better: Stability of stochastic gradient descent , 2015, ICML.

[20] K. Schittkowski,et al. NONLINEAR PROGRAMMING , 2022 .

[21] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[22] Logan Engstrom,et al. Evaluating and Understanding the Robustness of Adversarial Logit Pairing , 2018, ArXiv.

[23] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Nathan Srebro,et al. Exploring Generalization in Deep Learning , 2017, NIPS.