On the robustness of skeleton detection against adversarial attacks

Human perception of an object's skeletal structure is particularly robust to diverse perturbations of shape. This skeleton representation possesses substantial advantages for parts-based and invariant shape encoding, which is essential for object recognition. Multiple deep learning-based skeleton detection models have been proposed, while their robustness to adversarial attacks remains unclear. (1) This paper is the first work to study the robustness of deep learning-based skeleton detection against adversarial attacks, which are only slightly unlike the original data but still imperceptible to humans. We systematically analyze the robustness of skeleton detection models through exhaustive adversarial attacking experiments. (2) We propose a novel Frequency attack, which can directly exploit the regular and interpretable perturbations to sharply disrupt skeleton detection models. Frequency attack consists of an excitatory-inhibition waveform with high frequency attribution, which confuses edge-sensitive convolutional filters due to the sudden contrast between crests and troughs. Our comprehensive results verify that skeleton detection models are also vulnerable to adversarial attacks. The meaningful findings will inspire researchers to explore more potential robust models by involving explicit skeleton features.

[1]  Iasonas Kokkinos,et al.  Learning-Based Symmetry Detection in Natural Images , 2012, ECCV.

[2]  Wei Shen,et al.  Hi-Fi: Hierarchical Feature Integration for Skeleton Detection , 2018, IJCAI.

[3]  Jitendra Malik,et al.  A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[4]  Sven J. Dickinson,et al.  AMAT: Medial Axis Transform for Natural Images , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[5]  Samy Bengio,et al.  Adversarial Machine Learning at Scale , 2016, ICLR.

[6]  Irving Biederman,et al.  Cortical representation of medial axis structure. , 2013, Cerebral cortex.

[7]  Andrew Gordon Wilson,et al.  Simple Black-box Adversarial Attacks , 2019, ICML.

[8]  Roman Neruda,et al.  Vulnerability of classifiers to evolutionary generated adversarial examples , 2020, Neural Networks.

[9]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[10]  J. P. Jones,et al.  An evaluation of the two-dimensional Gabor filter model of simple receptive fields in cat striate cortex. , 1987, Journal of neurophysiology.

[11]  Eric T. Carlson,et al.  Medial Axis Shape Coding in Macaque Inferotemporal Cortex , 2012, Neuron.

[12]  Joshua B. Tenenbaum,et al.  Human-level concept learning through probabilistic program induction , 2015, Science.

[13]  Matthias Bethge,et al.  Approximating CNNs with Bag-of-local-Features models works surprisingly well on ImageNet , 2019, ICLR.

[14]  Samuel Ritter,et al.  Cognitive Psychology for Deep Neural Networks: A Shape Bias Case Study , 2017, ICML.

[15]  Jonathon Shlens,et al.  Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[16]  David A. Wagner,et al.  Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples , 2018, ICML.

[17]  Yiming Hu,et al.  Convolutional relation network for skeleton-based action recognition , 2019, Neurocomputing.

[18]  Stella F. Lourenco,et al.  Skeletal representations of shape in human vision: Evidence for a pruned medial axis model , 2019, Journal of vision.

[19]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Thomas Brox,et al.  Universal Adversarial Perturbations Against Semantic Image Segmentation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[21]  Francis Brochu,et al.  Increasing Shape Bias in ImageNet-Trained Networks Using Transfer Learning and Domain-Adversarial Methods , 2019, ArXiv.

[22]  Xinlei Chen,et al.  Microsoft COCO Captions: Data Collection and Evaluation Server , 2015, ArXiv.

[23]  Max Mignotte,et al.  Symmetry detection based on multiscale pairwise texture boundary segment interactions , 2016, Pattern Recognit. Lett..

[24]  Minho Lee,et al.  Real-time human action classification using a dynamic neural model , 2015, Neural Networks.

[25]  Vladislav Ayzenberg,et al.  Skeletal descriptions of shape provide unique perceptual information for object recognition , 2019, bioRxiv.

[26]  Xiang Bai,et al.  DeepSkeleton: Learning Multi-Task Scale-Associated Deep Side Outputs for Object Skeleton Extraction in Natural Images , 2016, IEEE Transactions on Image Processing.

[27]  Qinkun Xiao,et al.  Skeleton-based Chinese sign language recognition and generation for bidirectional communication between deaf and hearing people , 2020, Neural Networks.

[28]  B. Julesz,et al.  Perceptual sensitivity maps within globally defined visual shapes , 1994, Nature.

[29]  Andreas Geiger,et al.  Computer Vision for Autonomous Vehicles: Problems, Datasets and State-of-the-Art , 2017, Found. Trends Comput. Graph. Vis..

[30]  Honghai Liu,et al.  HDS-SP: A novel descriptor for skeleton-based human action recognition , 2020, Neurocomputing.

[31]  Jihua Zhu,et al.  Skeleton Filter: A Self-Symmetric Filter for Skeletonization in Noisy Text Images , 2020, IEEE Transactions on Image Processing.

[32]  Anastasios Tefas,et al.  K-Anonymity inspired adversarial attack and multiple one-class classification defense , 2020, Neural Networks.

[33]  Samy Bengio,et al.  Adversarial examples in the physical world , 2016, ICLR.

[34]  Joan Bruna,et al.  Intriguing properties of neural networks , 2013, ICLR.

[35]  Zihao Hu,et al.  Multiple instance subspace learning via partial random projection tree for local reflection symmetry in natural images , 2016, Pattern Recognit..

[36]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[37]  D. Hofstadter Metamagical Themas: Questing for the Essence of Mind and Pattern , 1985 .

[38]  Javier Del Ser,et al.  Robust image classification against adversarial attacks using elastic similarity measures between edge count sequences , 2020, Neural Networks.

[39]  Shimon Ullman,et al.  Class-Specific, Top-Down Segmentation , 2002, ECCV.

[40]  D. Mumford,et al.  The role of the primary visual cortex in higher level vision , 1998, Vision Research.