Product of Gaussians as a distributed representation for speech recognition

Distributed representations allow the effective number of Gaussian components in a mixture model, or state of an HMM, to be increased without dramatically increasing the number of model parameters. Various forms of distributed representation have previously been investigated. In this work it shown that the product of experts (PoE) framework may be viewed as a distributed representation when the individual experts are mixtures of Gaussians. However, in contrast to the standard PoE model, the individual experts are not required to be valid distributions, thus allowing additional flexibility in the component priors and variances. The performance of PoE models when used as a distributed representation on a large vocabulary speech recognition task, SwitchBoard, is evaluated.

[1]  Peder A. Olsen,et al.  Modeling inverse covariance matrices by basis expansion , 2002, IEEE Transactions on Speech and Audio Processing.

[2]  Mark J. F. Gales Transformation streams and the HMM error model , 2002, Comput. Speech Lang..

[3]  Christopher K. I. Williams,et al.  Products of Gaussians , 2001, NIPS.

[4]  Thomas Hain,et al.  THE CU-HTK MARCH 2000 HUB5E TRANSCRIPTION SYSTEM , 2000 .

[5]  Geoffrey E. Hinton,et al.  Products of Hidden Markov Models , 2001, AISTATS.

[6]  Mark J. F. Gales,et al.  Product of Gaussians and multiple stream systems , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[7]  Michael I. Jordan,et al.  Factorial Hidden Markov Models , 1995, Machine Learning.