论文信息 - Can multi-label classification networks know what they don't know?

Can multi-label classification networks know what they don't know?

Estimating out-of-distribution (OOD) uncertainty is a major challenge for safely deploying machine learning models in the open-world environment. Improved methods for OOD detection in multi-class classification have emerged, while OOD detection methods for multi-label classification remain underexplored and use rudimentary techniques. We propose JointEnergy, a simple and effective method, which estimates the OOD indicator scores by aggregating label-wise energy scores from multiple labels. We show that JointEnergy can be mathematically interpreted from a joint likelihood perspective. Our results show consistent improvement over previous methods that are based on the maximum-valued scores, which fail to capture joint information from multiple labels. We demonstrate the effectiveness of our method on three common multi-label classification benchmarks, including MSCOCO, PASCAL-VOC, and NUS-WIDE. We show that JointEnergy can reduce the FPR95 by up to 10.05% compared to the previous best baseline, establishing state-of-the-art performance.

Yixuan Li | Haoran Wang | Weitang Liu | Alex Bocchieri

[1] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[2] Yann LeCun,et al. Energy-based Generative Adversarial Networks , 2016, ICLR.

[3] Rui Huang,et al. MOS: Towards Scaling Out-of-distribution Detection for Large Semantic Space , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Stefan Wermter,et al. Generating Multiple Objects at Spatially Distinct Locations , 2019, ICLR.

[5] Yali Amit,et al. Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder , 2020, NeurIPS.

[6] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[7] Hongxia Jin,et al. Generalized ODIN: Detecting Out-of-Distribution Image Without Learning From Out-of-Distribution Data , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Dawn Song,et al. A Benchmark for Anomaly Segmentation , 2019, ArXiv.

[9] Wei Xu,et al. CNN-RNN: A Unified Framework for Multi-label Image Classification , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Geoffrey E. Hinton,et al. A Learning Algorithm for Boltzmann Machines , 1985, Cogn. Sci..

[11] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[12] Yee Whye Teh,et al. Do Deep Generative Models Know What They Don't Know? , 2018, ICLR.

[13] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.

[14] Kevin Gimpel,et al. A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks , 2016, ICLR.

[15] Jason Yosinski,et al. Deep neural networks are easily fooled: High confidence predictions for unrecognizable images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[17] Zhi-Hua Zhou,et al. Isolation Forest , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[18] Weitang Liu,et al. Energy-based Out-of-distribution Detection , 2020, NeurIPS.

[19] Yangqing Jia,et al. Deep Convolutional Ranking for Multilabel Image Annotation , 2013, ICLR.

[20] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[21] Marc'Aurelio Ranzato,et al. A Unified Energy-Based Framework for Unsupervised Learning , 2007, AISTATS.

[22] Samy Bengio,et al. Density estimation using Real NVP , 2016, ICLR.

[23] Yang Lu,et al. A Theory of Generative ConvNet , 2016, ICML.

[24] Grigorios Tsoumakas,et al. Multi-Label Classification of Music into Emotions , 2008, ISMIR.

[25] Yixuan Li,et al. Generalized Out-of-Distribution Detection: A Survey , 2021, ArXiv.

[26] Hugo Larochelle,et al. Efficient Learning of Deep Boltzmann Machines , 2010, AISTATS.

[27] Yixuan Li,et al. MOOD: Multi-level Out-of-distribution Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Luc Van Gool,et al. The Pascal Visual Object Classes Challenge: A Retrospective , 2014, International Journal of Computer Vision.

[29] Hans-Peter Kriegel,et al. LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[30] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31] Jordi Luque,et al. Input complexity and out-of-distribution detection with likelihood-based generative models , 2020, ICLR.

[32] Nenghai Yu,et al. Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Jasper Snoek,et al. Likelihood Ratios for Out-of-Distribution Detection , 2019, NeurIPS.

[34] Long Chen,et al. Deep Integration: A Multi-Label Architecture for Road Scene Recognition , 2019, IEEE Transactions on Image Processing.

[35] Iasonas Kokkinos,et al. Describing Textures in the Wild , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[36] Grigorios Tsoumakas,et al. Multi-Label Classification: An Overview , 2007, Int. J. Data Warehous. Min..

[37] Tat-Seng Chua,et al. NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[38] Yixuan Li,et al. ReAct: Out-of-distribution Detection With Rectified Activations , 2021, NeurIPS.

[39] John E. Hopcroft,et al. Stacked Generative Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40] Somesh Jha,et al. ATOM: Robustifying Out-of-Distribution Detection Using Outlier Mining , 2020, ECML/PKDD.

[41] Lifu Tu,et al. Learning Approximate Inference Networks for Structured Prediction , 2018, ICLR.

[42] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[43] Yixuan Li,et al. On the Importance of Gradients for Detecting Distributional Shifts in the Wild , 2021, NeurIPS.

[44] E. Tabak,et al. A Family of Nonparametric Density Estimation Algorithms , 2013 .

[45] Johannes Fürnkranz,et al. Large-Scale Multi-label Text Classification - Revisiting Neural Networks , 2013, ECML/PKDD.

[46] Fu Jie Huang,et al. A Tutorial on Energy-Based Learning , 2006 .

[47] Andrew McCallum,et al. Structured Prediction Energy Networks , 2015, ICML.

[48] Jiun-Hung Chen,et al. A multi-label classification based approach for sentiment classification , 2015, Expert Syst. Appl..

[49] Yiming Yang,et al. Deep Learning for Extreme Multi-label Text Classification , 2017, SIGIR.

[50] Ronald M. Summers,et al. ChestX-ray: Hospital-Scale Chest X-ray Database and Benchmarks on Weakly Supervised Classification and Localization of Common Thorax Diseases , 2019, Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics.

[51] Kibok Lee,et al. A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks , 2018, NeurIPS.

[52] Marc'Aurelio Ranzato,et al. Efficient Learning of Sparse Representations with an Energy-Based Model , 2006, NIPS.

[53] Charles Blundell,et al. Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[54] Jian Sun,et al. Identity Mappings in Deep Residual Networks , 2016, ECCV.

[55] R. Srikant,et al. Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks , 2017, ICLR.