Is DeCAF Good Enough for Accurate Image Classification?

In recent years, deep learning has attracted much interest for addressing complex AI tasks. However, most of the deep learning models need to be trained for a long time in order to obtain good results. To overcome this problem, the deep convolutional activation feature DeCAF was proposed, which is directly extracted from the activation of a well trained deep convolutional neural network. Nevertheless, the dimensionality of DeCAF is simply fixed to a constant number. In this case, one may ask whether DeCAF is good enough for image classification applications and whether we can further improve its performance? To answer these two questions, we propose a new model called RS-DeCAF based on "reducing" and "stretching" the dimensionality of DeCAF. In the implementation of RS-DeCAF, we reduce the dimensionality of DeCAF using dimensionality reduction methods, such as PCA, and meanwhile increase the dimensionality by stretching the weight matrix between successive layers. RS-DeCAF is aimed to discover the effective representations of data for classification tasks. As there is no back propagation is needed for network training, RS-DeCAF is very efficient and can be easily applied to large scale problems. Extensive experiments on image classification show that RS-DeCAF not only slightly improves DeCAF, but dramatically outperforms previous "stretching" and other state-of-the-art approaches. Hence, RS-DeCAF can be considered as an effective substitute for previous DeCAF and "stretching" approaches.

[1]  Bernhard Schölkopf,et al.  Nonlinear Component Analysis as a Kernel Eigenvalue Problem , 1998, Neural Computation.

[2]  G. Baudat,et al.  Generalized Discriminant Analysis Using a Kernel Approach , 2000, Neural Computation.

[3]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[4]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[5]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Ivor W. Tsang,et al.  Core Vector Machines: Fast SVM Training on Very Large Data Sets , 2005, J. Mach. Learn. Res..

[7]  Xiang Zhang,et al.  OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks , 2013, ICLR.

[8]  Ambedkar Dukkipati,et al.  Learning by Stretching Deep Networks , 2014, ICML.

[9]  Lawrence K. Saul,et al.  Large-Margin Classification in Infinite Neural Networks , 2010, Neural Computation.

[10]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[11]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[12]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[13]  Junyu Dong,et al.  Visual Texture Perception with Feature Learning Models and Deep Architectures , 2014, CCPR.

[14]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[15]  R. Fisher THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS , 1936 .

[16]  Thomas Brox,et al.  Discriminative Unsupervised Feature Learning with Convolutional Neural Networks , 2014, NIPS.

[17]  Jasper Snoek,et al.  Multi-Task Bayesian Optimization , 2013, NIPS.

[18]  Dit-Yan Yeung,et al.  Ensemble-Based Tracking: Aggregating Crowdsourced Structured Time Series Data , 2014, ICML.

[19]  I. Jolliffe Principal Component Analysis , 2002 .

[20]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[21]  Trevor Darrell,et al.  DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition , 2013, ICML.

[22]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[23]  Honglak Lee,et al.  An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.

[24]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[26]  Xiaogang Wang,et al.  Deep Learning Face Representation by Joint Identification-Verification , 2014, NIPS.

[27]  Qiang Chen,et al.  Network In Network , 2013, ICLR.

[28]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[29]  Lawrence D. Jackel,et al.  Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[30]  Nitish Srivastava,et al.  Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.