论文信息 - Rao-Blackwellised Gibbs sampling for switching linear dynamical systems

Rao-Blackwellised Gibbs sampling for switching linear dynamical systems

This paper describes the application of Rao-Blackwellised Gibbs sampling (RBGS) to speech recognition using switching linear dynamical systems (SLDS). The SLDS is a hybrid of standard hidden Markov models (HMM) and linear dynamical systems. It is an extension of the stochastic segment model as it relaxes the assumption of independent segments. SLDS explicitly take into account the strong co-articulation present in speech. Unfortunately, inference in SLDS is intractable unless the discrete state sequence is known. RBGS is one approach that may be applied for both improved training and decoding for this form of intractable model. The theory of SLDS and RBGS is described, along with an efficient proposal mechanism. The performance of the SLDS using RBGS for training and inference is evaluated on the ARPA Resource Management task.

Mark J. F. Gales | Antti-Veikko I. Rosti | M. Gales

[1] Mark J. F. Gales,et al. Factor analysed hidden Markov models for speech recognition , 2004, Comput. Speech Lang..

[2] Christophe Andrieu,et al. Iterative algorithms for state estimation of jump Markov linear systems , 2001, IEEE Trans. Signal Process..

[3] Christian P. Robert,et al. Monte Carlo Statistical Methods , 2005, Springer Texts in Statistics.

[4] Li Deng,et al. Efficient decoding strategy for conversational speech recognition using state-space models for vocal-tract-resonance dynamics , 2001, INTERSPEECH.

[5] G. C. Wei,et al. A Monte Carlo Implementation of the EM Algorithm and the Poor Man's Data Augmentation Algorithms , 1990 .

[6] Mark J. F. Gales,et al. Switching linear dynamical systems for speech recognition , 2003 .

[7] Mari Ostendorf,et al. From HMM's to segment models: a unified view of stochastic modeling for speech recognition , 1996, IEEE Trans. Speech Audio Process..