论文信息 - Products of Hidden Markov Models: It Takes N>1 to Tango

Products of Hidden Markov Models: It Takes N>1 to Tango

Products of Hidden Markov Models (PoHMMs) are an interesting class of generative models which have received little attention since their introduction. This may be in part due to their more computationally expensive gradient-based learning algorithm, and the intractability of computing the log likelihood of sequences under the model. In this paper, we demonstrate how the partition function can be estimated reliably via Annealed Importance Sampling. We perform experiments using contrastive divergence learning on rainfall data and data captured from pairs of people dancing. Our results suggest that advances in learning and evaluation for undirected graphical models and recent increases in available computing power make PoHMMs worth considering for complex time-series modeling tasks.

Geoffrey E. Hinton | Graham W. Taylor

[1] Sean R. Eddy,et al. Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[2] Geoffrey E. Hinton,et al. Products of Hidden Markov Models , 2001, AISTATS.

[3] Tijmen Tieleman,et al. Training restricted Boltzmann machines using approximations to the likelihood gradient , 2008, ICML '08.

[4] Miguel Á. Carreira-Perpiñán,et al. On Contrastive Divergence Learning , 2005, AISTATS.

[5] Yoshua Bengio,et al. An Input Output HMM Architecture , 1994, NIPS.

[6] Michael I. Jordan,et al. Factorial Hidden Markov Models , 1995, Machine Learning.

[7] Geoffrey E. Hinton,et al. Modeling Human Motion Using Binary Latent Variables , 2006, NIPS.

[8] Yoshua Bengio,et al. Markovian Models for Sequential Data , 2004 .

[9] Radford M. Neal. Annealed importance sampling , 1998, Stat. Comput..

[10] Geoffrey E. Hinton. Training Products of Experts by Minimizing Contrastive Divergence , 2002, Neural Computation.

[11] Padhraic Smyth,et al. Modeling of multivariate time series using hidden markov models , 2005 .

[12] Ruslan Salakhutdinov,et al. On the quantitative analysis of deep belief networks , 2008, ICML '08.

[13] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.