Online adaptive learning of continuous-density hidden Markov models based on multiple-stream prior evolution and posterior pooling

We introduce a new adaptive Bayesian learning framework, called multiple-stream prior evolution and posterior pooling, for online adaptation of the continuous density hidden Markov model (CDHMM) parameters. Among three architectures we proposed for this framework, we study in detail a specific two stream system where linear transformations are applied to the mean vectors of the CDHMMs to control the evolution of their prior distribution. This new stream of prior distribution can be combined with another stream of prior distribution evolved without any constraints applied. In a series of speaker adaptation experiments on the task of continuous Mandarin speech recognition, we show that the new adaptation algorithm achieves a similar fast-adaptation performance as that of the incremental maximum likelihood linear regression (MLLR) in the case of small amount of adaptation data, while maintains the good asymptotic convergence property as that of our previously proposed quasi-Bayes adaptation algorithms.

[1]  Chin-Hui Lee,et al.  Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..

[2]  Vassilios Digalakis,et al.  Speaker adaptation using combined transformation and Bayesian methods , 1996, IEEE Trans. Speech Audio Process..

[3]  Keikichi Hirose,et al.  Robust speech recognition based on a Bayesian prediction approach , 1999, IEEE Trans. Speech Audio Process..

[4]  Yunxin Zhao Self-learning speaker and channel adaptation based on spectral variation source decomposition , 1996, Speech Commun..

[5]  L. Deng Ieee Transactions on Speech and Audio Processing, Speech Trajectory Discrimination Using the Minimum Classiication Error Learning , 1997 .

[6]  Jun-ichi Takahashi,et al.  Vector-field-smoothed Bayesian learning for fast and incremental speaker/telephone-channel adaptation , 1997, Comput. Speech Lang..

[7]  Jen-Tzung Chien,et al.  Online hierarchical transformation of hidden Markov models for speech recognition , 1999, IEEE Trans. Speech Audio Process..

[8]  Keikichi Hirose,et al.  Improving Viterbi Bayesian predictive classification via sequential bayesian learning in robust speech recognition , 1999, Speech Commun..

[9]  Tetsuo Kosaka,et al.  Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation , 1996, Comput. Speech Lang..

[10]  Philip C. Woodland,et al.  Flexible speaker adaptation for large vocabulary speech recognition , 1995, EUROSPEECH.

[11]  Philip C. Woodland,et al.  Combined Bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models , 1997, Comput. Speech Lang..

[12]  Vassilios Digalakis,et al.  A comparative study of speaker adaptation techniques , 1995, EUROSPEECH.

[13]  Chin-Hui Lee,et al.  On-line adaptive learning of the correlated continuous density hidden Markov models for speech recognition , 1998, IEEE Trans. Speech Audio Process..

[14]  Yunxin Zhao,et al.  On-Line Bayesian Tree-Structured Transformation of Hidden Markov Models for Speaker Adaptation , 1999 .

[15]  Chin-Hui Lee,et al.  Bayesian adaptive learning of the parameters of hidden Markov model for speech recognition , 1995, IEEE Trans. Speech Audio Process..

[16]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[17]  Chin-Hui Lee,et al.  Joint maximum a posteriori estimation of transformation and hidden Markov model parameters , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[18]  Stephen Cox,et al.  Predictive speaker adaptation in speech recognition , 1995, Comput. Speech Lang..

[19]  Philip C. Woodland,et al.  Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[20]  Biing-Hwang Juang,et al.  A study on speaker adaptation of the parameters of continuous density hidden Markov models , 1991, IEEE Trans. Signal Process..

[21]  Vassilios Digalakis,et al.  Online adaptation of hidden Markov models using incremental estimation algorithms , 1997, IEEE Trans. Speech Audio Process..

[22]  Vassilios Digalakis,et al.  Speaker adaptation using constrained estimation of Gaussian mixtures , 1995, IEEE Trans. Speech Audio Process..

[23]  Chin-Hui Lee,et al.  On-line adaptation of the SCHMM parameters based on the segmental quasi-Bayes learning for speech recognition , 1996, IEEE Trans. Speech Audio Process..

[24]  Chin-Hui Lee,et al.  A maximum-likelihood approach to stochastic matching for robust speech recognition , 1996, IEEE Trans. Speech Audio Process..

[25]  Mark J. F. Gales,et al.  Mean and variance adaptation within the MLLR framework , 1996, Comput. Speech Lang..

[26]  Yiqing Zu Sentence design for speech synthesis and speech recognition database by phonetic rules , 1997, EUROSPEECH.

[27]  Chin-Hui Lee,et al.  A Bayesian predictive classification approach to robust speech recognition , 2000, IEEE Trans. Speech Audio Process..

[28]  Chin-Hui Lee,et al.  Unsupervised adaptation using structural Bayes approach , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[29]  Mark J. F. Gales,et al.  Maximum likelihood linear transformations for HMM-based speech recognition , 1998, Comput. Speech Lang..

[30]  Jen-Tzung Chien,et al.  A hybrid algorithm for speaker adaptation using MAP transformation and adaptation , 1997 .

[31]  Chin-Hui Lee,et al.  On-line adaptive learning of the continuous density hidden Markov model based on approximate recursive Bayes estimate , 1997, IEEE Trans. Speech Audio Process..

[32]  George Zavaliagkos,et al.  Batch, incremental and instantaneous adaptation techniques for speech recognition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[33]  S. Kullback,et al.  Information Theory and Statistics , 1959 .