Fast speaker adaptation for speech recognition systems

Different speaker adaptation methods for speech recognition systems adapting automatically to new and unknown speakers in a short training phase are discussed. The adaptation techniques aim at transformations of feature vectors, optimized with respect to some constraints. Two different adaptation strategies are discussed. The first one is based on least mean-squared-error optimization. The second method is a codebook-driven feature transformation. Both adaptation techniques are incorporated into two different recognition systems: dynamic time warping (DTW) and hidden Markov modeling (HMM). The results show that in both systems speaker-adaptive error rates are close to speaker-dependent error rates. In the best case the mean error rate of four test speakers decreases by a factor of six compared to the interspeaker error rate without adaptation. A hardware realization of the speaker-adaptive HMM-recognizer is described.<<ETX>>

[1]  Kiyohiro Shikano,et al.  Speaker adaptation through vector quantization , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Yves Grenier,et al.  Spectral transformations through canonical correlation analysis for speaker adptation in ASR , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Jean-Luc Gauvain,et al.  Vector quantization for speaker adaptation , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Masafumi Nishimura,et al.  Speaker adaptation method for HMM-based speech recognition , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[5]  J. Makhoul,et al.  Iterative normalization for speaker-adaptive training in continuous speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[6]  Fritz Class,et al.  Speaker adaptation for recognition systems with a large vocabulary , 1989, Proceedings. Electrotechnical Conference Integrating Research, Industry and Education in Energy and Communication Engineering',.