论文信息 - Finding the Unknown: Novelty Detection with Extreme Value Signatures of Deep Neural Activations

Finding the Unknown: Novelty Detection with Extreme Value Signatures of Deep Neural Activations

Achieving or even surpassing human-level accuracy became recently possible in a variety of application scenarios due to the rise of convolutional neural networks (CNNs) trained from large datasets. However, solving supervised visual recognition tasks by discriminating among known categories is only one side of the coin. In contrast to this, novelty detection is still an unsolved task where instances of yet unknown categories need to be identified. Therefore, we propose to leverage the powerful discriminative nature of CNNs to novelty detection tasks by investigating class-specific activation patterns. More precisely, we assume that a semantic category can be described by its extreme value signature, that specifies which dimensions of deep neural activations have largest values. By following this intuition, we show that already a small number of high-valued dimensions allows to separate known from unknown categories. Our approach is simple, intuitive, and can be easily put on top of CNNs trained for vanilla classification tasks. We empirically validate the benefits of our approach in terms of accuracy and speed by comparing it against established methods in a variety of novelty detection tasks derived from ImageNet. Finally, we show that visualizing extreme value signatures allows to inspect class-specific patterns learned during training which may ultimately help to better understand CNN models.

[1] Thomas Brox,et al. FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2] Daphna Weinshall,et al. Novelty Detection in MultiClass Scenarios with Incomplete Set of Class Labels , 2016, ArXiv.

[3] Ming Xu,et al. Channel-Max, Channel-Drop and Stochastic Max-pooling , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[4] Terrance E. Boult,et al. Towards Open World Recognition , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Stefan Carlsson,et al. CNN Features Off-the-Shelf: An Astounding Baseline for Recognition , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[6] Johan A. K. Suykens,et al. Supervised Novelty Detection , 2013, 2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[7] Anderson Rocha,et al. Toward Open Set Recognition , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Gabriela Csurka,et al. Distance-Based Image Classification: Generalizing to New Classes at Near-Zero Cost , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Andrew Zisserman,et al. Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[10] David A. Clifton,et al. A review of novelty detection , 2014, Signal Process..

[11] Trevor Hastie,et al. The Elements of Statistical Learning , 2001 .

[12] Bernhard Schölkopf,et al. Estimating the Support of a High-Dimensional Distribution , 2001, Neural Computation.

[13] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[14] Trevor Darrell,et al. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[15] Eric R. Ziegel,et al. The Elements of Statistical Learning , 2003, Technometrics.

[16] Joachim Denzler,et al. One-class classification with Gaussian processes , 2013, Pattern Recognit..

[17] Joachim Denzler,et al. Part Detector Discovery in Deep Convolutional Neural Networks , 2014, ACCV.

[18] Joachim Denzler,et al. Local Novelty Detection in Multi-class Recognition Problems , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[19] Matej Kristan,et al. Visual re-identification across large, distributed camera networks , 2015, Image Vis. Comput..

[20] Franz-Josef Brandenburg,et al. The nearest neighbor Spearman footrule distance for bucket, interval, and partial orders , 2013, J. Comb. Optim..

[21] Terrance E. Boult,et al. Towards Open Set Deep Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[23] Tom Fawcett,et al. An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[24] Joachim Denzler,et al. Kernel Null Space Methods for Novelty Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[25] Brendan J. Frey,et al. Winner-Take-All Autoencoders , 2014, NIPS.

[26] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[27] Johan A. K. Suykens,et al. Multi-Class Supervised Novelty Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28] Thomas Brox,et al. Inverting Convolutional Networks with Convolutional Networks , 2015, ArXiv.

[29] David Stutz,et al. Neural Codes for Image Retrieval , 2015 .

[30] Yao Li,et al. Mining Mid-level Visual Patterns with Deep CNN Activations , 2015, International Journal of Computer Vision.

[31] Bolei Zhou,et al. Places: An Image Database for Deep Scene Understanding , 2016, ArXiv.

[32] Chandan Srivastava,et al. Support Vector Data Description , 2011 .