论文信息 - From Class-Specific to Class-Mixture: Cascaded Feature Representations via Restricted Boltzmann Machine Learning

From Class-Specific to Class-Mixture: Cascaded Feature Representations via Restricted Boltzmann Machine Learning

In this paper, we propose two kinds of feature extracting frameworks that can extract cascaded class-specific and class-mixture features, respectively, by taking the restricted Boltzmann machine (RBM) as the basic building blocks; we further call them as a CS-RBM and CM-RBM feature extractor. The discriminations of features from both CS-RBM and CM-RBM are verified better than the class-independent (traditional) RBM (CI-RBM) feature extractor. As one mini-batch samples are randomly selected from all classes during the training phase of the traditional RBM, which can make that the above mini-batch data contain easy-confusing samples from different categories. Therefore, the features from CI-RBM are difficult to distinguish these samples from the confused categories. CS-RBM and CM-RBM can overcome the above sample confusing problem efficiently and effectively. To cope with the real-valued input samples, we further extend the binary RBM to Gaussian–Bernoulli RBM (GBRBM), leading to the CS-GBRBM (CM-GBRBM) feature extracting framework. Experiments on binary datasets, i.e., MNIST and USPS, scene image dataset (Scene-15), and object image dataset (Coil-100), well verify the above facts and show the competitive results.

[1] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[2] Hermann Ney,et al. Statistical Image Object Recognition using Mixture Densities , 2001, Journal of Mathematical Imaging and Vision.

[3] Geoffrey E. Hinton. A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.

[4] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[5] Guo-Sen Xie,et al. Integrating supervised subspace criteria with restricted Boltzmann Machine for feature extraction , 2014, 2014 International Joint Conference on Neural Networks (IJCNN).

[6] Cordelia Schmid,et al. Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[7] Honglak Lee,et al. Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations , 2009, ICML '09.

[8] Stephen M. Smith,et al. Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm , 2001, IEEE Transactions on Medical Imaging.

[9] Ching Y. Suen,et al. A trainable feature extractor for handwritten digit recognition , 2007, Pattern Recognit..

[10] Geoffrey E. Hinton,et al. Deep Boltzmann Machines , 2009, AISTATS.

[11] Razvan Pascanu,et al. Learning Algorithms for the Classification Restricted Boltzmann Machine , 2012, J. Mach. Learn. Res..

[12] Geoffrey E. Hinton,et al. Binary coding of speech spectrograms using a deep auto-encoder , 2010, INTERSPEECH.

[13] Hermann Ney,et al. Learning of Variability for Invariant Statistical Pattern Recognition , 2001, ECML.

[14] Tapani Raiko,et al. Improved Learning of Gaussian-Bernoulli Restricted Boltzmann Machines , 2011, ICANN.

[15] Yi Ma,et al. Robust principal component analysis? , 2009, JACM.

[16] Ling Shao,et al. Discriminative Elastic-Net Regularized Linear Regression , 2017, IEEE Transactions on Image Processing.

[17] Yihong Gong,et al. Locality-constrained Linear Coding for image classification , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[18] Yann LeCun,et al. The mnist database of handwritten digits , 2005 .

[19] Michael E. Tipping. The Relevance Vector Machine , 1999, NIPS.

[20] Zhiwei Li,et al. Max-Margin Dictionary Learning for Multiclass Image Categorization , 2010, ECCV.

[21] Larry S. Davis,et al. Learning Structured Low-Rank Representations for Image Classification , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[22] Bernhard Schölkopf,et al. Training Invariant Support Vector Machines , 2002, Machine Learning.

[23] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[24] Ruimin Shen,et al. Learning Class-relevant Features and Class-irrelevant Features via a Hybrid third-order RBM , 2011, AISTATS.

[25] James L. McClelland,et al. James L. McClelland, David Rumelhart and the PDP Research Group, Parallel distributed processing: explorations in the microstructure of cognition . Vol. 1. Foundations . Vol. 2. Psychological and biological models . Cambridge MA: M.I.T. Press, 1987. , 1989, Journal of Child Language.

[26] Allen Y. Yang,et al. Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[28] Xue Li,et al. Face recognition using class specific dictionary learning for sparse representation and collaborative representation , 2016, Neurocomputing.

[29] Mohammed Bennamoun,et al. Linear Regression for Face Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30] Patrice Y. Simard,et al. Best practices for convolutional neural networks applied to visual document analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[31] Shin'ichi Tamura,et al. Capabilities of a four-layered feedforward neural network: four layers versus three , 1997, IEEE Trans. Neural Networks.

[32] Ching Y. Suen,et al. A novel hybrid CNN-SVM classifier for recognizing handwritten digits , 2012, Pattern Recognit..

[33] Ian T. Jolliffe,et al. Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[34] Lei Zhang,et al. Sparse representation or collaborative representation: Which helps face recognition? , 2011, 2011 International Conference on Computer Vision.

[35] Thomas Hofmann,et al. Greedy Layer-Wise Training of Deep Networks , 2007 .

[36] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[37] Larry S. Davis,et al. Learning a discriminative dictionary for sparse coding via label consistent K-SVD , 2011, CVPR 2011.

[38] Sameer A. Nene,et al. Columbia Object Image Library (COIL100) , 1996 .

[39] B. Kégl,et al. Fast boosting using adversarial bandits , 2010, ICML.

[40] Claudio Gentile,et al. A New Approximate Maximal Margin Classification Algorithm , 2002, J. Mach. Learn. Res..

[41] Jakub M. Tomczak,et al. Learning Invariant Features Using Subspace Restricted Boltzmann Machine , 2016, Neural Processing Letters.

[42] 刘宝弟. Class Specific Centralized Dictionary Learning for Face Recognition , 2016 .

[43] Feiping Nie,et al. Discriminative Least Squares Regression for Multiclass Classification and Feature Selection , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[44] Liang-Tien Chia,et al. Laplacian Sparse Coding, Hypergraph Laplacian Sparse Coding, and Applications , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.