Improved Gaussian-Bernoulli restricted Boltzmann machine for learning discriminative representations

Abstract Restricted Boltzmann machines (RBMs) have received considerable research interest in recent years because of their capability to discover latent representations in an unsupervised manner. The standard RBM is only suitable for processing binary-valued data. To address this limitation, the Gaussian–Bernoulli RBM (GRBM) has been designed to model real-valued data, particularly images. A GRBM that seeks to map real-valued data nonlinearly into a latent representation space is typically trained by contrastive divergence learning. However, most existing GRBM-based models neglect the inherent interpoint affinity information from the original data that can be used to enhance the expression ability of the model. In this study, a novel interpoint-affinity-based GRBM (abGRBM) is proposed to learn discriminative representations in the hidden layer. By incorporating the interpoint affinity information into the training process of the GRBM, the proposed model can not only utilize the GRBM’s powerful latent representation learning capabilities for real-valued data, it can also transform the original data into another space with improved separability. We prove the availability of our model using several image datasets of Microsoft Research Asia Multimedia for unsupervised clustering and supervised classification tasks. The experimental results show the superior performance of the GRBM in discovering discriminative representations and demonstrate the effectiveness of the affinity information.

[1]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[2]  Chunxia Zhang,et al.  A new regularized restricted Boltzmann machine based on class preserving , 2017, Knowl. Based Syst..

[3]  Min Chen,et al.  Affinity-based similarity measure for Web document clustering , 2004, Proceedings of the 2004 IEEE International Conference on Information Reuse and Integration, 2004. IRI 2004..

[4]  Tao Li,et al.  Constraint Co-Projections for Semi-Supervised Co-Clustering , 2016, IEEE Transactions on Cybernetics.

[5]  Geoffrey E. Hinton,et al.  Robust Boltzmann Machines for recognition and denoising , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Yoshua Bengio,et al.  Greedy Layer-Wise Training of Deep Networks , 2006, NIPS.

[7]  Masato Okada,et al.  Dynamical analysis of contrastive divergence learning: Restricted Boltzmann machines with Gaussian visible units , 2016, Neural Networks.

[8]  Xuelong Li,et al.  Spectral Embedded Adaptive Neighbors Clustering , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[9]  Shuicheng Yan,et al.  Graph Embedding and Extensions: A General Framework for Dimensionality Reduction , 2007 .

[10]  Zenglin Xu,et al.  Robust multi-view data clustering with multi-view capped-norm K-means , 2018, Neurocomputing.

[11]  Yan Liu,et al.  Discriminative deep belief networks for visual data classification , 2011, Pattern Recognit..

[12]  W. Bruce Croft,et al.  Inference Networks for Document Retrieval , 1989, SIGIR Forum.

[13]  Gang Chen,et al.  Deep Transductive Semi-supervised Maximum Margin Clustering , 2015, ArXiv.

[14]  Jing Liu,et al.  Self-learning Local Supervision Encoding Framework to Constrict and Disperse Feature Distribution for Clustering , 2018, ArXiv.

[15]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Peng Jin,et al.  Restricted Boltzmann Machines With Gaussian Visible Units Guided by Pairwise Constraints , 2017, IEEE Transactions on Cybernetics.

[17]  Nicolas Le Roux,et al.  Representational Power of Restricted Boltzmann Machines and Deep Belief Networks , 2008, Neural Computation.

[18]  Martin Ester,et al.  Frequent term-based text clustering , 2002, KDD.

[19]  Lei Wang,et al.  3D shape recognition and retrieval based on multi-modality deep learning , 2017, Neurocomputing.

[20]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[21]  Geoffrey E. Hinton,et al.  Learning Multilevel Distributed Representations for High-Dimensional Sequences , 2007, AISTATS.

[22]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[23]  Hyunsoo Lee,et al.  Learning framework of multimodal Gaussian-Bernoulli RBM handling real-value input data , 2018, Neurocomputing.

[24]  Geoffrey E. Hinton,et al.  Replicated Softmax: an Undirected Topic Model , 2009, NIPS.

[25]  Tijmen Tieleman,et al.  Training restricted Boltzmann machines using approximations to the likelihood gradient , 2008, ICML '08.

[26]  Junwei Han,et al.  Duplex Metric Learning for Image Set Classification , 2018, IEEE Transactions on Image Processing.

[27]  Miguel Á. Carreira-Perpiñán,et al.  On Contrastive Divergence Learning , 2005, AISTATS.

[28]  Tara N. Sainath,et al.  Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[29]  Xinhua Zhang,et al.  Hyperparameter Learning for Graph Based Semi-supervised Learning Algorithms , 2006, NIPS.

[30]  Jiancheng Lv,et al.  Cross-correlation conditional restricted Boltzmann machines for modeling motion style , 2018, Knowl. Based Syst..

[31]  Stuart Harvey Rubin,et al.  FA-MCADF: Feature Affinity Based Multiple Correspondence Analysis and Decision Fusion Framework for Disaster Information Management , 2017, 2017 IEEE International Conference on Information Reuse and Integration (IRI).

[32]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[33]  Xin Li,et al.  Conditional Restricted Boltzmann Machines for Multi-label Learning with Incomplete Labels , 2015, AISTATS.

[34]  Geoffrey E. Hinton,et al.  Application of Deep Belief Networks for Natural Language Understanding , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[35]  Hemant A. Patil,et al.  Novel Unsupervised Auditory Filterbank Learning Using Convolutional RBM for Speech Recognition , 2016, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[36]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[37]  Hayat Al-Dmour,et al.  A clustering fusion technique for MR brain tissue segmentation , 2018, Neurocomputing.

[38]  Geoffrey E. Hinton,et al.  The Recurrent Temporal Restricted Boltzmann Machine , 2008, NIPS.

[39]  Svetha Venkatesh,et al.  Graph-induced restricted Boltzmann machines for document modeling , 2016, Inf. Sci..

[40]  Erik Cambria,et al.  Recent Trends in Deep Learning Based Natural Language Processing , 2017, IEEE Comput. Intell. Mag..

[41]  Zhang Yi,et al.  Graph Regularized Restricted Boltzmann Machine , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[42]  Liang Wang,et al.  Unconstrained Multimodal Multi-Label Learning , 2015, IEEE Transactions on Multimedia.

[43]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[44]  Tapani Raiko,et al.  Gaussian-Bernoulli deep Boltzmann machine , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[45]  Radford M. Neal Connectionist Learning of Belief Networks , 1992, Artif. Intell..

[46]  Alessandro Laio,et al.  Clustering by fast search and find of density peaks , 2014, Science.

[47]  Georgios B. Giannakis,et al.  Sketched Subspace Clustering , 2017, IEEE Transactions on Signal Processing.

[48]  Geoffrey E. Hinton Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[49]  Tianrui Li,et al.  Nonnegative matrix factorization for clustering ensemble based on dark knowledge , 2019, Knowl. Based Syst..

[50]  Pengtao Xie,et al.  Diversifying Restricted Boltzmann Machine for Document Modeling , 2015, KDD.

[51]  Zhang Yi,et al.  Subspace clustering using a low-rank constrained autoencoder , 2018, Inf. Sci..

[52]  Yun Xie,et al.  Affinity based information diffusion model in social networks , 2014 .

[53]  Javier Hernando,et al.  Restricted Boltzmann machines for vector representation of speech in speaker recognition , 2018, Comput. Speech Lang..

[54]  Chun-Xia Zhang,et al.  Enhancing performance of restricted Boltzmann machines via log-sum regularization , 2014, Knowl. Based Syst..

[55]  Nitish Srivastava,et al.  Multimodal learning with deep Boltzmann machines , 2012, J. Mach. Learn. Res..

[56]  Matthieu Cord,et al.  Unsupervised and Supervised Visual Codes with Restricted Boltzmann Machines , 2012, ECCV.

[57]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.