论文信息 - Training restricted Boltzmann machines using approximations to the likelihood gradient

Training restricted Boltzmann machines using approximations to the likelihood gradient

A new algorithm for training Restricted Boltzmann Machines is introduced. The algorithm, named Persistent Contrastive Divergence, is different from the standard Contrastive Divergence algorithms in that it aims to draw samples from almost exactly the model distribution. It is compared to some standard Contrastive Divergence and Pseudo-Likelihood algorithms on the tasks of modeling and classifying various types of data. The Persistent Contrastive Divergence algorithm outperforms the other algorithms, and is equally fast and simple.

Tijmen Tieleman | T. Tieleman

[1] J. Besag. On the Statistical Analysis of Dirty Pictures , 1986 .

[2] Paul Smolensky,et al. Information processing in dynamical systems: foundations of harmony theory , 1986 .

[3] Radford M. Neal. Connectionist Learning of Belief Networks , 1992, Artif. Intell..

[4] L. Younes. On the convergence of markovian stochastic algorithms with rapidly decreasing ergodicity rates , 1999 .

[5] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[6] Geoffrey E. Hinton,et al. A New Learning Algorithm for Mean Field Boltzmann Machines , 2002, ICANN.

[7] Geoffrey E. Hinton,et al. Exponential Family Harmoniums with an Application to Information Retrieval , 2004, NIPS.

[8] Shimon Ullman,et al. Combining Top-Down and Bottom-Up Segmentation , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[9] Alan L. Yuille,et al. The Convergence of Contrastive Divergences , 2004, NIPS.

[10] Yann LeCun,et al. The mnist database of handwritten digits , 2005 .

[11] Miguel Á. Carreira-Perpiñán,et al. On Contrastive Divergence Learning , 2005, AISTATS.