论文信息 - Informative Outlier Matters: Robustifying Out-of-distribution Detection Using Outlier Mining

Informative Outlier Matters: Robustifying Out-of-distribution Detection Using Outlier Mining

Detecting out-of-distribution (OOD) inputs is critical for safely deploying deep learning models in an open-world setting. However, existing OOD detection solutions can be brittle in the open world, facing various types of adversarial OOD inputs. While methods leveraging auxiliary OOD data have emerged, our analysis reveals a key insight that the majority of auxiliary OOD examples may not meaningfully improve the decision boundary of the OOD detector. In this paper, we provide a theoretically motivated method, Adversarial Training with informative Outlier Mining (ATOM), which improves the robustness of OOD detection. We show that, by mining informative auxiliary OOD data, one can significantly improve OOD detection performance, and somewhat surprisingly, generalize to unseen adversarial attacks. ATOM achieves state-of-the-art performance under a broad family of natural and perturbed OOD evaluation tasks. For example, on the CIFAR-10 in-distribution dataset, ATOM reduces the FPR95 by up to 57.99% under adversarial OOD inputs, surpassing the previous best baseline by a large margin.

Somesh Jha | Yingyu Liang | Yixuan Li | Xi Wu | Jiefeng Chen

[1] Soheil Feizi,et al. Functional Adversarial Attacks , 2019, NeurIPS.

[2] Kah Kay Sung,et al. Learning and example selection for object and pattern detection , 1995 .

[3] Ludwig Schmidt,et al. Unlabeled Data Improves Adversarial Robustness , 2019, NeurIPS.

[4] John Schulman,et al. Concrete Problems in AI Safety , 2016, ArXiv.

[5] R. Venkatesh Babu,et al. Confidence estimation in Deep Neural networks via density modelling , 2017, ArXiv.

[6] Eric Jang,et al. Generative Ensembles for Robust Anomaly Detection , 2018, ArXiv.

[7] Matthias Hein,et al. Why ReLU Networks Yield High-Confidence Predictions Far Away From the Training Data and How to Mitigate the Problem , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Andrew Y. Ng,et al. Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[9] Yee Whye Teh,et al. Do Deep Generative Models Know What They Don't Know? , 2018, ICLR.

[10] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[11] Antonio Torralba,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence 1 80 Million Tiny Images: a Large Dataset for Non-parametric Object and Scene Recognition , 2022 .

[12] Matthias Hein,et al. Towards neural networks that provably know when they don't know , 2020, ICLR.

[13] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[14] R. Srikant,et al. Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks , 2017, ICLR.

[15] Kevin Gimpel,et al. A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks , 2016, ICLR.

[16] Aleksander Madry,et al. Towards Deep Learning Models Resistant to Adversarial Attacks , 2017, ICLR.

[17] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[18] Jasper Snoek,et al. Likelihood Ratios for Out-of-Distribution Detection , 2019, NeurIPS.

[19] Mung Chiang,et al. Analyzing the Robustness of Open-World Machine Learning , 2019, AISec@CCS.

[20] Mark J. F. Gales,et al. Predictive Uncertainty Estimation via Prior Networks , 2018, NeurIPS.

[21] Fabio Roli,et al. Evasion Attacks against Machine Learning at Test Time , 2013, ECML/PKDD.

[22] Thomas G. Dietterich,et al. Benchmarking Neural Network Robustness to Common Corruptions and Perturbations , 2018, ICLR.

[23] Koby Crammer,et al. A theory of learning from different domains , 2010, Machine Learning.

[24] Nikos Komodakis,et al. Wide Residual Networks , 2016, BMVC.

[25] Joan Bruna,et al. Intriguing properties of neural networks , 2013, ICLR.

[26] E. Tabak,et al. A Family of Nonparametric Density Estimation Algorithms , 2013 .

[27] David A. Wagner,et al. Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples , 2018, ICML.

[28] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.

[29] Aleksander Madry,et al. Adversarially Robust Generalization Requires More Data , 2018, NeurIPS.

[30] Thomas G. Dietterich,et al. Deep Anomaly Detection with Outlier Exposure , 2018, ICLR.

[31] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32] Ananthram Swami,et al. The Limitations of Deep Learning in Adversarial Settings , 2015, 2016 IEEE European Symposium on Security and Privacy (EuroS&P).

[33] Frédéric Jurie,et al. Hard Negative Mining for Metric Learning Based Zero-Shot Classification , 2016, ECCV Workshops.

[34] Zhangyang Wang,et al. Self-Supervised Learning for Generalizable Out-of-Distribution Detection , 2020, AAAI.

[35] Matthias Hein,et al. Provable Worst Case Guarantees for the Detection of Out-of-Distribution Data , 2020, ArXiv.

[36] Martin S. Lindner,et al. Analytical and clinical validation of a microbial cell-free DNA sequencing test for infectious disease , 2019, Nature Microbiology.

[37] Atul Prakash,et al. Robust Physical-World Attacks on Deep Learning Visual Classification , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[38] Di He,et al. Adversarially Robust Generalization Just Requires More Unlabeled Data , 2019, ArXiv.

[39] Samy Bengio,et al. Density estimation using Real NVP , 2016, ICLR.

[40] Charles Blundell,et al. Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[41] Abhinav Gupta,et al. Training Region-Based Object Detectors with Online Hard Example Mining , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42] Nikos Komodakis,et al. Object Detection via a Multi-region and Semantic Segmentation-Aware CNN Model , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[43] Po-Sen Huang,et al. Are Labels Required for Improving Adversarial Robustness? , 2019, NeurIPS.

[44] Soheil Feizi,et al. Adversarial Robustness of Flow-Based Generative Models , 2019, AISTATS.

[45] Pingmei Xu,et al. TurkerGaze: Crowdsourcing Saliency with Webcam based Eye Tracking , 2015, ArXiv.

[46] Iasonas Kokkinos,et al. Describing Textures in the Wild , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[47] Frank Hutter,et al. A Downsampled Variant of ImageNet as an Alternative to the CIFAR datasets , 2017, ArXiv.

[48] Yinda Zhang,et al. LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop , 2015, ArXiv.

[49] Mohammad Reza Rajati,et al. Outlier exposure with confidence control for out-of-distribution detection , 2019, Neurocomputing.

[50] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[51] Kibok Lee,et al. Training Confidence-calibrated Classifiers for Detecting Out-of-Distribution Samples , 2017, ICLR.

[52] Amir Najafi,et al. Robustness to Adversarial Perturbations in Learning from Incomplete Data , 2019, NeurIPS.

[53] Jordi Luque,et al. Input complexity and out-of-distribution detection with likelihood-based generative models , 2020, ICLR.

[54] Bolei Zhou,et al. Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[55] Kibok Lee,et al. A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks , 2018, NeurIPS.

[56] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[57] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.