论文信息 - Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks

Exploring Architectural Ingredients of Adversarially Robust Deep Neural Networks

Deep neural networks (DNNs) are known to be vulnerable to adversarial attacks. A range of defense methods have been proposed to train adversarially robust DNNs, among which adversarial training has demonstrated promising results. However, despite preliminary understandings developed for adversarial training, it is still not clear, from the architectural perspective, what configurations can lead to more robust DNNs. In this paper, we address this gap via a comprehensive investigation on the impact of network width and depth on the robustness of adversarially trained DNNs. Specifically, we make the following key observations: 1) more parameters (higher model capacity) does not necessarily help adversarial robustness; 2) reducing capacity at the last stage (the last group of blocks) of the network can actually improve adversarial robustness; and 3) under the same parameter budget, there exists an optimal architectural configuration for adversarial robustness. We also provide a theoretical analysis explaning why such network configuration can help robustness. These architectural insights can help design adversarially robust DNNs. Code is available at https://github.com/HanxunH/RobustWRN.

[1] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2] Xingyi Yang,et al. DSRNA: Differentiable Search of Robust Neural Architectures , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Preetum Nakkiran,et al. Adversarial Robustness May Be at Odds With Simplicity , 2019, ArXiv.

[4] Kamyar Azizzadenesheli,et al. Stochastic Activation Pruning for Robust Adversarial Defense , 2018, ICLR.

[5] Ritu Chadha,et al. Limitations of the Lipschitz constant as a defense against adversarial examples , 2018, Nemesis/UrbReas/SoGood/IWAISe/GDM@PKDD/ECML.

[6] Yu Cheng,et al. Adversarial Robustness: From Self-Supervised Pre-Training to Fine-Tuning , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Shiyu Chang,et al. Robust Overfitting may be mitigated by properly learned smoothening , 2021, ICLR.

[8] R. Venkatesh Babu,et al. Guided Adversarial Attack for Evaluating and Enhancing Adversarial Defenses , 2020, NeurIPS.

[9] Quanquan Gu,et al. Do Wider Neural Networks Really Help Adversarial Robustness? , 2020, NeurIPS.

[10] James Bailey,et al. Improving Adversarial Robustness Requires Revisiting Misclassified Examples , 2020, ICLR.

[11] Suman Jana,et al. HYDRA: Pruning Adversarially Robust Neural Networks , 2020, NeurIPS.

[12] Andrew L. Beam,et al. Adversarial attacks on medical machine learning , 2019, Science.

[13] David Doermann,et al. Anti-Bandit Neural Architecture Search for Model Defense , 2020, ECCV.

[14] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[15] David A. Wagner,et al. Towards Evaluating the Robustness of Neural Networks , 2016, 2017 IEEE Symposium on Security and Privacy (SP).

[16] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Di He,et al. Adversarially Robust Generalization Just Requires More Unlabeled Data , 2019, ArXiv.

[18] Jun Zhu,et al. Adversarial Distributional Training for Robust Deep Learning , 2020, NeurIPS.

[19] Micah Goldblum,et al. Adversarially Robust Distillation , 2019, AAAI.

[20] Alan L. Yuille,et al. Intriguing Properties of Adversarial Training at Scale , 2020, ICLR.

[21] Yisen Wang,et al. Adversarial Weight Perturbation Helps Robust Generalization , 2020, NeurIPS.

[22] David A. Wagner,et al. Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples , 2018, ICML.

[23] Timothy A. Mann,et al. Uncovering the Limits of Adversarial Training against Norm-Bounded Adversarial Examples , 2020, ArXiv.

[24] Bernhard Pfahringer,et al. Regularisation of neural networks by enforcing Lipschitz continuity , 2018, Machine Learning.

[25] James Bailey,et al. Characterizing Adversarial Subspaces Using Local Intrinsic Dimensionality , 2018, ICLR.

[26] Shu-Tao Xia,et al. Improving Adversarial Robustness via Channel-wise Activation Suppressing , 2021, ICLR.

[27] Yu-Gang Jiang,et al. Revisiting Adversarial Robustness Distillation: Robust Soft Labels Make Student Better , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[28] Gaurav Mittal,et al. An Empirical Study on the Robustness of NAS based Architectures , 2020, ArXiv.

[29] Aleksander Madry,et al. Adversarial Robustness as a Prior for Learned Representations , 2019 .

[30] Michael I. Jordan,et al. Theoretically Principled Trade-off between Robustness and Accuracy , 2019, ICML.

[31] Bin Dong,et al. You Only Propagate Once: Accelerating Adversarial Training via Maximal Principle , 2019, NeurIPS.

[32] Ohad Shamir,et al. Depth-Width Tradeoffs in Approximating Natural Functions with Neural Networks , 2016, ICML.

[33] James Bailey,et al. On the Convergence and Robustness of Adversarial Training , 2021, ICML.

[34] Aleksander Madry,et al. Adversarially Robust Generalization Requires More Data , 2018, NeurIPS.

[35] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[36] Quoc V. Le,et al. Adversarial Examples Improve Image Recognition , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37] M. Rudelson,et al. Non-asymptotic theory of random matrices: extreme singular values , 2010, 1003.2990.

[38] Xiangning Chen,et al. Stabilizing Differentiable Architecture Search via Perturbation-based Regularization , 2020, ICML.

[39] Cyrus Rashtchian,et al. A Closer Look at Accuracy vs. Robustness , 2020, NeurIPS.

[40] Hang Su,et al. Boosting Adversarial Training with Hypersphere Embedding , 2020, NeurIPS.

[41] Zhouchen Lin,et al. Demystifying Adversarial Training via A Unified Probabilistic Framework , 2021 .

[42] J. Zico Kolter,et al. Overfitting in adversarially robust deep learning , 2020, ICML.

[43] Haifeng Qian,et al. L2-Nonexpansive Neural Networks , 2018, ICLR.

[44] Quanshi Zhang,et al. A Unified Approach to Interpreting and Boosting Adversarial Transferability , 2020, ICLR.

[45] Mohan S. Kankanhalli,et al. Attacks Which Do Not Kill Training Make Adversarial Learning Stronger , 2020, ICML.

[46] Colin Raffel,et al. Thermometer Encoding: One Hot Way To Resist Adversarial Examples , 2018, ICLR.

[47] Ludwig Schmidt,et al. Unlabeled Data Improves Adversarial Robustness , 2019, NeurIPS.

[48] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.