论文信息 - Maximum likelihood clustering of Gaussians for speech recognition

Maximum likelihood clustering of Gaussians for speech recognition

Describes a method for clustering multivariate Gaussian distributions using a maximum likelihood criterion. The authors point out possible applications of model clustering, and then use the approach to determine classes of shared covariances for contest modeling in speech recognition, achieving an order of magnitude reduction in the number of covariance parameters, with no loss in recognition performance. >

[1] Mei-Yuh Hwang,et al. Subphonetic Modeling for Speech Recognition , 1992, HLT.

[2] Mari Ostendorf,et al. Context modeling with the stochastic segment model , 1992, IEEE Trans. Signal Process..

[3] Herbert Gish,et al. Segregation of speakers for speech recognition and speaker identification , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[4] L. R. Rabiner,et al. A probabilistic distance measure for hidden Markov models , 1985, AT&T Technical Journal.

[5] Michael Picheny,et al. Context Dependent Modeling of Phones in Continuous Speech Using Decision Trees , 1991, HLT.

[6] Philip A. Chou,et al. Optimal pruning with applications to tree-structured source coding and modeling , 1989, IEEE Trans. Inf. Theory.

[7] Shigeki Sagayama,et al. A successive state splitting algorithm for efficient allophone modeling , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8] Hsiao-Wuen Hon,et al. Allophone clustering for continuous speech recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[9] Mari Ostendorf,et al. A stochastic segment model for phoneme-based continuous speech recognition , 1989, IEEE Trans. Acoust. Speech Signal Process..

[10] Mari Ostendorf,et al. Integration of Diverse Recognition Methodologies Through Reevaluation of N-Best Sentence Hypotheses , 1991, HLT.

[11] Richard M. Schwartz,et al. BYBLOS Speech Recognition Benchmark Results , 1991, HLT.

[12] Kai-Fu Lee,et al. Context-independent phonetic hidden Markov models for speaker-independent continuous speech recognition , 1990 .

[13] D. B. Paul,et al. Speaker stress-resistant continuous speech recognition , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[14] Mari Ostendorf,et al. Weight Estimation for N-Best Rescoring , 1992, HLT.

[15] T. W. Anderson. An Introduction to Multivariate Statistical Analysis , 1959 .