Description of acoustic variations by hidden Markov models with tree structure

This research was sponsored in part by U S WEST and in part by the Defense Advanced Research Projects Agency (DOD), and monitored by the Space and Naval Warfare Systems Command under Contract N0003985-C-0163, ARPA Order No. 5167. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of U S WEST, DARPA or the US government. K e y w o r d s : HMM(Hidden Markov Model), Binary-Tree Vector Quantization, Decision Tree Clustering, CART, Speaker Clustering, Smoothing.

[1]  Shigeki Sagayama,et al.  Phoneme environment clustering for speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[2]  Kazuyo Tanaka,et al.  A large vocabulary word recognition system using rule-based network representation of acoustic characteristic variations , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[3]  Hsiao-Wuen Hon,et al.  Allophone clustering for continuous speech recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[4]  Hsiao-Wuen Hon,et al.  On vocabulary-independent speech modeling , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[5]  John Makhoul,et al.  Context-dependent modeling for acoustic-phonetic recognition of continuous speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[7]  Robert M. Gray,et al.  Vector Quantizers and Predictive Quantizers for Gauss-Markov Sources , 1982, IEEE Trans. Commun..

[8]  Mei-Yuh Hwang,et al.  The SPHINX speech recognition system , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[9]  Michael Picheny,et al.  Large vocabulary natural language continuous speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[10]  Hsiao-Wuen Hon,et al.  Towards Speech Recognition Without Vocabulary-Specific Training , 1989, HLT.

[11]  Frederick Jelinek,et al.  Interpolated estimation of Markov source parameters from sparse data , 1980 .

[12]  C. Zheng,et al.  ; 0 ; , 1951 .

[13]  Lalit R. Bahl,et al.  A tree-based statistical language model for natural language speech recognition , 1989, IEEE Trans. Acoust. Speech Signal Process..

[14]  Biing-Hwang Juang,et al.  HMM clustering for connected word recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[15]  V. Rich Personal communication , 1989, Nature.