论文信息 - Learning Parts-based Representations with Nonnegative Restricted Boltzmann Machine

Learning Parts-based Representations with Nonnegative Restricted Boltzmann Machine

The success of any machine learning system depends critically on eective representations of data. In many cases, especially those in vision, it is desirable that a representation scheme uncovers the parts-based, additive nature of the data. Of current representation learning schemes, restricted Boltzmann machines (RBMs) have proved to be highly eective in unsupervised settings. However, when it comes to parts-based discovery, RBMs do not usually produce satisfactory results. We enhance such capacity of RBMs by introducing nonnegativity into the model weights, resulting in a variant called nonnegative restricted Boltzmann machine (NRBM). The NRBM produces not only controllable decomposition of data into interpretable parts but also oers a way to estimate the intrinsic nonlinear dimensionality of data. We demonstrate the capacity of our model on well-known datasets of handwritten digits, faces and documents. The decomposition quality on images is comparable with or better than what produced by the nonnegative matrix factorisation (NMF), and the thematic features uncovered from text are qualitatively interpretable in a similar manner to that of the latent Dirichlet allocation (LDA). However, the learnt features, when used for classication,

Svetha Venkatesh | Truyen Tran | Tu Dinh Nguyen | Dinh Q. Phung

[1] Yee Whye Teh,et al. Rate-coded Restricted Boltzmann Machines for Face Recognition , 2000, NIPS.

[2] D K Smith,et al. Numerical Optimization , 2001, J. Oper. Res. Soc..

[3] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[4] Geoffrey E. Hinton. A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.

[5] Thomas L. Griffiths,et al. Infinite latent feature models and the Indian buffet process , 2005, NIPS.

[6] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Victoria Stodden,et al. When Does Non-Negative Matrix Factorization Give a Correct Decomposition into Parts? , 2003, NIPS.

[8] HighWire Press. Philosophical Transactions of the Royal Society of London , 1781, The London Medical Journal.

[9] Jochen J. Steil,et al. Online learning and generalization of parts-based image representations by non-negative sparse autoencoders , 2012, Neural Networks.

[10] Yann LeCun,et al. The mnist database of handwritten digits , 2005 .

[11] Geoffrey E. Hinton,et al. Generative models for discovering sparse distributed representations. , 1997, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[12] Peter V. Gehler,et al. The rate adapting poisson model for information retrieval and object recognition , 2006, ICML.

[13] Chih-Jen Lin,et al. LIBSVM: A library for support vector machines , 2011, TIST.

[14] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[15] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[16] Paul Smolensky,et al. Information processing in dynamical systems: foundations of harmony theory , 1986 .

[17] Jiawei Han,et al. Document clustering using locality preserving indexing , 2005, IEEE Transactions on Knowledge and Data Engineering.

[18] Geoffrey E. Hinton,et al. Visualizing Data using t-SNE , 2008 .

[19] David Haussler,et al. Unsupervised learning of distributions on binary vectors using two layer networks , 1991, NIPS 1991.

[20] Dan Roth,et al. Learning to detect objects in images via a sparse, part-based representation , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21] Patrik O. Hoyer,et al. Non-negative Matrix Factorization with Sparseness Constraints , 2004, J. Mach. Learn. Res..

[22] N. Goodwin,et al. Learning to Detect Objects in Images via a Sparse, Part-Based Representation , 2004 .

[23] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..