论文信息 - Learning Class-relevant Features and Class-irrelevant Features via a Hybrid third-order RBM

Learning Class-relevant Features and Class-irrelevant Features via a Hybrid third-order RBM

Restricted Boltzmann Machines are commonly used in unsupervised learning to extract features from training data. Since these features are learned for regenerating training data a classifier based on them has to be trained. If only a few of the learned features are discriminative other non-discriminative features will distract the classifier during the training process and thus waste computing resources for testing. In this paper, we present a hybrid third-order Restricted Boltzmann Machine in which class-relevant features (for recognizing) and class-irrelevant features (for generating only) are learned simultaneously. As the classification task uses only the class-relevant features, the test itself becomes very fast. We show that classirrelevant features help class-relevant features to focus on the recognition task and introduce useful regularization effects to reduce the norms of class-relevant features. Thus there is no need to use weight-decay for the parameters of this model. Experiments on the MNIST, NORB and Caltech101 Silhouettes datasets show very promising results.

Ruimin Shen | Changyong Niu | Carsten Ullrich | Heng Luo

[1] Geoffrey E. Hinton,et al. Deep Boltzmann Machines , 2009, AISTATS.

[2] Yoshua Bengio,et al. Classification using discriminative restricted Boltzmann machines , 2008, ICML '08.

[3] Geoffrey E. Hinton,et al. 3D Object Recognition with Deep Belief Nets , 2009, NIPS.

[4] Yee Whye Teh,et al. A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[5] Geoffrey E. Hinton,et al. Implicit Mixtures of Restricted Boltzmann Machines , 2008, NIPS.

[6] Marc'Aurelio Ranzato,et al. A Unified Energy-Based Framework for Unsupervised Learning , 2007, AISTATS.

[7] B. Schölkopf,et al. Modeling Human Motion Using Binary Latent Variables , 2007 .

[8] David Haussler,et al. Unsupervised learning of distributions on binary vectors using two layer networks , 1991, NIPS 1991.

[9] Geoffrey E. Hinton,et al. Exponential Family Harmoniums with an Application to Information Retrieval , 2004, NIPS.

[10] Y. LeCun,et al. Learning methods for generic object recognition with invariance to pose and lighting , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[11] Geoffrey E. Hinton,et al. Learning a Nonlinear Embedding by Preserving Class Neighbourhood Structure , 2007, AISTATS.

[12] Nando de Freitas,et al. A tutorial on stochastic approximation algorithms for training Restricted Boltzmann Machines and Deep Belief Nets , 2010, 2010 Information Theory and Applications Workshop (ITA).

[13] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[14] T. Sejnowski. Higher‐order Boltzmann machines , 1987 .

[15] Bernhard Schölkopf,et al. Training Invariant Support Vector Machines , 2002, Machine Learning.

[16] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[17] Geoffrey E. Hinton,et al. Learning and relearning in Boltzmann machines , 1986 .

[18] Nando de Freitas,et al. Inductive Principles for Restricted Boltzmann Machine Learning , 2010, AISTATS.

[19] Yoshua Bengio,et al. Scaling learning algorithms towards AI , 2007 .