论文信息 - Few-shot learning in deep networks through global prototyping

Few-shot learning in deep networks through global prototyping

Training a deep convolution neural network (CNN) to succeed in visual object classification usually requires a great number of examples. Here, starting from such a pre-learned CNN, we study the task of extending the network to classify additional categories on the basis of only few examples ("few-shot learning"). We find that a simple and fast prototype-based learning procedure in the global feature layers ("Global Prototype Learning", GPL) leads to some remarkably good classification results for a large portion of the new classes. It requires only up to ten examples for the new classes to reach a plateau in performance. To understand this few-shot learning performance resulting from GPL as well as the performance of the original network, we use the t-SNE method (Maaten and Hinton, 2008) to visualize clusters of object category examples. This reveals the strong connection between classification performance and data distribution and explains why some new categories only need few examples for learning while others resist good classification results even when trained with many more examples.

Thomas Burwick | Sebastian Blaes | Sebastian Blaes | T. Burwick

[1] Qiang Yang,et al. A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[2] Andrew Zisserman,et al. Return of the Devil in the Details: Delving Deep into Convolutional Nets , 2014, BMVC.

[3] Joshua B. Tenenbaum,et al. One-shot learning by inverting a compositional causal process , 2013, NIPS.

[4] Garrison W. Cottrell,et al. Bikers Are Like Tobacco Shops, Formal Dressers Are Like Suits: Recognizing Urban Tribes with Caffe , 2015, 2015 IEEE Winter Conference on Applications of Computer Vision.

[5] Atsushi Sato,et al. Generalized Learning Vector Quantization , 1995, NIPS.

[6] Martial Hebert,et al. Learning to Learn: Model Regression Networks for Easy Small Sample Learning , 2016, ECCV.

[7] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Bernd Fritzke,et al. A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[9] Alexander Gepperth,et al. Computational Advantages of Deep Prototype-Based Learning , 2016, ICANN.

[10] Christoph H. Lampert,et al. Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Fei-Fei Li,et al. ImageNet: A large-scale hierarchical image database , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[12] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[13] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[14] Venkatesh Saligrama,et al. Zero-Shot Learning via Semantic Similarity Embedding , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[15] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[16] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[17] Aaron C. Courville,et al. Deep Learning Vector Quantization , 2016, ESANN.

[18] Andrea Vedaldi,et al. MatConvNet: Convolutional Neural Networks for MATLAB , 2014, ACM Multimedia.

[19] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[20] Trevor Darrell,et al. DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[21] Pietro Perona,et al. One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22] Philip H. S. Torr,et al. Prototypical Priors: From Improving Classification to Zero-Shot Learning , 2015, BMVC.

[23] Michael Fink,et al. Object Classification from a Single Example Utilizing Class Relevance Metrics , 2004, NIPS.

[24] Joshua B. Tenenbaum,et al. Human-level concept learning through probabilistic program induction , 2015, Science.

[25] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.