Speaker adaptation for demi-syllable based continuous density HMM

A novel speaker adaptation method for a speech recognition system which uses a continuous density HMM (hidden Markov model) is proposed. It is a supervised adaptation method in which the HMM parameters are modified for new speakers. It is effective not only for recognition units for which there are training samples available, but also for recognition units for which there are no training samples, since the parameters for these units without training samples are estimated by an interpolation technique which are often used in unsupervised adaptation. The effectiveness of the proposed method was evaluated by large vocabulary word recognition experiments, which were carried out under a demi-syllable-based speaker-dependent speech recognition system. The proposed method is shown to be effective when applied to a speaker independent system, under which the recognition accuracy improved by an average of 2.9% for 50 words of training data.<<ETX>>

[1]  S. Furui,et al.  Unsupervised speaker adaptation method based on hierarchical spectral clustering , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[2]  R. Schwartz,et al.  Rapid speaker adaptation using a probabilistic spectral mapping , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Takao Watanabe,et al.  Large vocabulary word recognition based on demi-syllable hidden Markov model using small amount of training data , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[4]  Chin-Hui Lee,et al.  A study on speaker adaptation of continuous density HMM parameters , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[5]  Kiyohiro Shikano,et al.  Speaker adaptation through vector quantization , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.