Evaluating probabilities under high-dimensional latent variable models

We present a simple new Monte Carlo algorithm for evaluating probabilities of observations in complex latent variable models, such as Deep Belief Networks. While the method is based on Markov chains, estimates based on short runs are formally unbiased. In expectation, the log probability of a test set will be underestimated, and this could form the basis of a probabilistic bound. The method is much cheaper than gold-standard annealing-based methods and only slightly more expensive than the cheapest Monte Carlo methods. We give examples of the new method substantially improving simple variational bounds at modest extra cost.

[1]  M. Tanner,et al.  Facilitating the Gibbs Sampler: The Gibbs Stopper and the Griddy-Gibbs Sampler , 1992 .

[2]  M. Newton Approximate Bayesian-inference With the Weighted Likelihood Bootstrap , 1994 .

[3]  S. Chib Marginal Likelihood from the Gibbs Output , 1995 .

[4]  Geoffrey E. Hinton,et al.  The EM algorithm for mixtures of factor analyzers , 1996 .

[5]  Aapo Hyvärinen,et al.  Fast and robust fixed-point algorithms for independent component analysis , 1999, IEEE Trans. Neural Networks.

[6]  Radford M. Neal Annealed importance sampling , 1998, Stat. Comput..

[7]  G. Nicholls,et al.  Bridge estimation of the probability density at a point , 2001 .

[8]  S. Chib,et al.  Marginal Likelihood From the Metropolis–Hastings Output , 2001 .

[9]  P. Moral,et al.  Sequential Monte Carlo samplers , 2002, cond-mat/0212648.

[10]  Thomas L. Griffiths,et al.  Integrating Topics and Syntax , 2004, NIPS.

[11]  A. Mira,et al.  Efficient Bayes factor estimation from the reversible jump output , 2006 .

[12]  Thomas P. Minka,et al.  Divergence measures and message passing , 2005 .

[13]  Hanna M. Wallach,et al.  Topic modeling: beyond bag-of-words , 2006, ICML.

[14]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[15]  Geoffrey E. Hinton,et al.  Modeling image patches with a directed hierarchy of Markov random fields , 2007, NIPS.

[16]  Vibhav Gogate,et al.  Studies in Lower Bounding Probabilities of Evidence using the Markov Inequality , 2007, UAI.

[17]  Ruslan Salakhutdinov,et al.  On the quantitative analysis of deep belief networks , 2008, ICML '08.