论文信息 - Towards Large Yet Imperceptible Adversarial Image Perturbations With Perceptual Color Distance

Towards Large Yet Imperceptible Adversarial Image Perturbations With Perceptual Color Distance

The success of image perturbations that are designed to fool image classifier is assessed in terms of both adversarial effect and visual imperceptibility. The conventional assumption on imperceptibility is that perturbations should strive for tight Lp-norm bounds in RGB space. In this work, we drop this assumption by pursuing an approach that exploits human color perception, and more specifically, minimizing perturbation size with respect to perceptual color distance. Our first approach, Perceptual Color distance C&W (PerC-C&W), extends the widely-used C&W approach and produces larger RGB perturbations. PerC-C&W is able to maintain adversarial strength, while contributing to imperceptibility. Our second approach, Perceptual Color distance Alternating Loss (PerC-AL), achieves the same outcome, but does so more efficiently by alternating between the classification loss and perceptual color difference when updating perturbations. Experimental evaluation shows PerC approaches outperform conventional Lp approaches in terms of robustness and transferability, and also demonstrates that the PerC distance can provide added value on top of existing structure-based methods to creating image perturbations.

Martha Larson | Zhengyu Zhao | Zhuoran Liu

[1] Nicholas Carlini,et al. Unrestricted Adversarial Examples , 2018, ArXiv.

[2] Sergey Ioffe,et al. Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Aleksander Madry,et al. A Rotation and a Translation Suffice: Fooling CNNs with Simple Transformations , 2017, ArXiv.

[4] Dawn Xiaodong Song,et al. Adversarial Example Defenses: Ensembles of Weak Defenses are not Strong , 2017, ArXiv.

[5] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Qiang Xu,et al. Towards Imperceptible and Robust Adversarial Example Attacks against Neural Networks , 2018, AAAI.

[7] Samy Bengio,et al. Adversarial examples in the physical world , 2016, ICLR.

[8] Soheil Feizi,et al. Functional Adversarial Attacks , 2019, NeurIPS.

[9] Koen E. A. van de Sande,et al. Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10] Fahad Shahbaz Khan,et al. Color attributes for object detection , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11] J. Zico Kolter,et al. Wasserstein Adversarial Examples via Projected Sinkhorn Iterations , 2019, ICML.

[12] Mingyan Liu,et al. Spatially Transformed Adversarial Examples , 2018, ICLR.

[13] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14] Radha Poovendran,et al. Semantic Adversarial Examples , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[15] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[16] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[17] Logan Engstrom,et al. Synthesizing Robust Adversarial Examples , 2017, ICML.

[18] Luiz Eduardo Soares de Oliveira,et al. Decoupling Direction and Norm for Efficient Gradient-Based L2 Adversarial Attacks and Defenses , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Seong Joon Oh,et al. Adversarial Image Perturbation for Privacy Protection A Game Theory Perspective , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[20] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[21] Lujo Bauer,et al. On the Suitability of Lp-Norms for Creating and Preventing Adversarial Examples , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[22] David A. Wagner,et al. Towards Evaluating the Robustness of Neural Networks , 2016, 2017 IEEE Symposium on Security and Privacy (SP).

[23] Arun Ross,et al. Gender Privacy: An Ensemble of Semi Adversarial Networks for Confounding Arbitrary Gender Classifiers , 2018, 2018 IEEE 9th International Conference on Biometrics Theory, Applications and Systems (BTAS).

[24] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[25] Li Chen,et al. SHIELD: Fast, Practical Defense and Vaccination for Deep Learning using JPEG Compression , 2018, KDD.

[26] M. Luo,et al. The development of the CIE 2000 Colour Difference Formula , 2001 .

[27] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[28] Martha Larson,et al. Who's Afraid of Adversarial Queries?: The Impact of Image Modifications on Content-based Image Retrieval , 2019, ICMR.

[29] Mahmoud Afifi,et al. What Else Can Fool Deep Learning? Addressing Color Constancy Errors on Deep Neural Network Performance , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[30] Lujo Bauer,et al. A General Framework for Adversarial Examples with Objectives , 2017, ACM Trans. Priv. Secur..

[31] J. Zico Kolter,et al. Provable defenses against adversarial examples via the convex outer adversarial polytope , 2017, ICML.

[32] Bo Li,et al. Big but Imperceptible Adversarial Perturbations via Semantic Manipulation , 2019, ArXiv.

[33] Martha Larson,et al. The Geo-Privacy Bonus of Popular Photo Enhancements , 2017, ICMR.

[34] Jorge Nocedal,et al. On the limited memory BFGS method for large scale optimization , 1989, Math. Program..

[35] Jing Li Wang,et al. Color image segmentation: advances and prospects , 2001, Pattern Recognit..

[36] Jun Zhu,et al. Evading Defenses to Transferable Adversarial Examples by Translation-Invariant Attacks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Chinmay Hegde,et al. Semantic Adversarial Attacks: Parametric Transformations That Fool Deep Classifiers , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[38] Ananthram Swami,et al. The Limitations of Deep Learning in Adversarial Settings , 2015, 2016 IEEE European Symposium on Security and Privacy (EuroS&P).

[39] Leon A. Gatys,et al. Preserving Color in Neural Artistic Style Transfer , 2016, ArXiv.

[40] Yannis Avrithis,et al. Smooth adversarial examples , 2019, EURASIP J. Inf. Secur..

[41] Matthias Hein,et al. Sparse and Imperceivable Adversarial Attacks , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[42] Dan Boneh,et al. Ensemble Adversarial Training: Attacks and Defenses , 2017, ICLR.

[43] Yang Yang,et al. Color Image Quality Assessment Based on CIEDE2000 , 2012, Adv. Multim..

[44] Dawn Xiaodong Song,et al. Delving into Transferable Adversarial Examples and Black-box Attacks , 2016, ICLR.

[45] Zoubin Ghahramani,et al. A study of the effect of JPG compression on adversarial images , 2016, ArXiv.

[46] Moustapha Cissé,et al. Countering Adversarial Images using Input Transformations , 2018, ICLR.

[47] Seyed-Mohsen Moosavi-Dezfooli,et al. DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48] Yao Zhao,et al. Adversarial Attacks and Defences Competition , 2018, ArXiv.

[49] Yanjun Qi,et al. Feature Squeezing: Detecting Adversarial Examples in Deep Neural Networks , 2017, NDSS.

[50] Aleksander Madry,et al. Towards Deep Learning Models Resistant to Adversarial Attacks , 2017, ICLR.

[51] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[52] J. Zico Kolter,et al. Certified Adversarial Robustness via Randomized Smoothing , 2019, ICML.

[53] Matthias Bethge,et al. Adversarial Vision Challenge , 2018, The NeurIPS '18 Competition.

[54] Lior Wolf,et al. Unsupervised Cross-Domain Image Generation , 2016, ICLR.

[55] Luisa Verdoliva,et al. Perceptual Quality-preserving Black-Box Attack against Deep Learning Image Classifiers , 2019, Pattern Recognit. Lett..

[56] Michael S. Brown,et al. Colorization for Single Image Super Resolution , 2010, ECCV.

[57] Dawn Song,et al. Robust Physical-World Attacks on Deep Learning Models , 2017, 1707.08945.