论文信息 - Local Intrinsic Dimensionality Signals Adversarial Perturbations

Local Intrinsic Dimensionality Signals Adversarial Perturbations

The vulnerability of machine learning models to adversarial perturbations has motivated a significant amount of research under the broad umbrella of adversarial machine learning. Sophisticated attacks may cause learning algorithms to learn decision functions or make decisions with poor predictive performance. In this context, there is a growing body of literature that uses local intrinsic dimensionality (LID), a local metric that describes the minimum number of latent variables required to describe each data point, for detecting adversarial samples and subsequently mitigating their effects. The research to date has tended to focus on using LID as a practical defence method often without fully explaining why LID can detect adversarial samples. In this paper, we derive a lower-bound and an upper-bound for the LID value of a perturbed data point and demonstrate that the bounds, in particular the lower-bound, has a positive correlation with the magnitude of the perturbation. Hence, we demonstrate that data points that are perturbed by a large amount would have large LID values compared to unperturbed samples, thus justifying its use in the prior literature. Furthermore, our empirical validation demonstrates the validity of the bounds on benchmark datasets.

[1] Bernard W. Silverman,et al. Density Estimation for Statistics and Data Analysis , 1987 .

[2] Aleksander Madry,et al. Towards Deep Learning Models Resistant to Adversarial Attacks , 2017, ICLR.

[3] James Bailey,et al. Dimensionality-Driven Learning with Noisy Labels , 2018, ICML.

[4] Mohan S. Kankanhalli,et al. Attacks Which Do Not Kill Training Make Adversarial Learning Stronger , 2020, ICML.

[5] Michael E. Houle,et al. Dimensionality, Discriminability, Density and Distance Distributions , 2013, 2013 IEEE 13th International Conference on Data Mining Workshops.

[6] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[7] James Bailey,et al. Quality Evaluation of GANs Using Cross Local Intrinsic Dimensionality , 2018, ArXiv.

[8] David R. Karger,et al. Finding nearest neighbors in growth-restricted metrics , 2002, STOC '02.

[9] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[10] Alessandro Rozza,et al. Novel high intrinsic dimensionality estimators , 2012, Machine Learning.

[11] James Bailey,et al. Characterizing Adversarial Subspaces Using Local Intrinsic Dimensionality , 2018, ICLR.

[12] T. Gneiting,et al. Estimators of Fractal Dimension : Assessing the Roughness of Time Series and Spatial Data , 2010 .

[13] Arthur Zimek,et al. On the Correlation Between Local Intrinsic Dimensionality and Outlierness , 2018, SISAP.

[14] Suman Jana,et al. Certified Robustness to Adversarial Examples with Differential Privacy , 2018, 2019 IEEE Symposium on Security and Privacy (SP).

[15] James Bailey,et al. Improving Adversarial Robustness Requires Revisiting Misclassified Examples , 2020, ICLR.

[16] Laurent Amsaleg,et al. High Intrinsic Dimensionality Facilitates Adversarial Attack: Theoretical Evidence , 2021, IEEE Transactions on Information Forensics and Security.

[17] Kenneth Falconer,et al. Fractal Geometry: Mathematical Foundations and Applications , 1990 .

[18] Ken-ichi Kawarabayashi,et al. Estimating Local Intrinsic Dimensionality , 2015, KDD.

[19] Michael E. Houle,et al. Local Intrinsic Dimensionality I: An Extreme-Value-Theoretic Foundation for Similarity Applications , 2017, SISAP.

[20] Christopher Leckie,et al. Closing the BIG-LID: An Effective Local Intrinsic Dimensionality Defense for Nonlinear Regression Poisoning , 2021, IJCAI.

[21] J. Zico Kolter,et al. Provable defenses against adversarial examples via the convex outer adversarial polytope , 2017, ICML.

[22] Christopher Leckie,et al. Defending Support Vector Machines Against Data Poisoning Attacks , 2021, IEEE Transactions on Information Forensics and Security.

[23] Michael E. Houle,et al. Local Intrinsic Dimensionality II: Multivariate Analysis and Distributional Support , 2017, SISAP.

[24] Seyed-Mohsen Moosavi-Dezfooli,et al. DeepFool: A Simple and Accurate Method to Fool Deep Neural Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] J. Zico Kolter,et al. Certified Adversarial Robustness via Randomized Smoothing , 2019, ICML.

[26] Hisashi Kashima,et al. Generalized Expansion Dimension , 2012, 2012 IEEE 12th International Conference on Data Mining Workshops.

[27] Ken-ichi Kawarabayashi,et al. Intrinsic Dimensionality Estimation within Tight Localities , 2019, SDM.

[28] P. J. Green,et al. Density Estimation for Statistics and Data Analysis , 1987 .

[29] J. Zico Kolter,et al. Adversarial Robustness Against the Union of Multiple Perturbation Models , 2019, ICML.