论文信息 - A novel speaker adaptation algorithm and its implementation on a RISC microprocessor

A novel speaker adaptation algorithm and its implementation on a RISC microprocessor

We have developed speech recognition middleware on a RISC microprocessor. The speech recognition function is required in many applications of RISC microprocessors, such as ear navigation systems and handheld PCs. The speech recognition middleware provides a fundamental library for developers to make those applications. Speaker adaptation is one of the most important functions to realize robust recognition performance. As part of the speech recognition middleware, we have developed a new speaker adaptation algorithm, in which the relationships among HMM (hidden Markov model) transfer vectors are provided as a set of pre-trained interpolation coefficients. Experimental evaluations showed promising results that 28% of recognition errors are reduced using 10 words for adaptation and 52% are reduced using 50 words.

Yasunari Obuchi | Nobuo Hataoka | Akio Amano

[1] Sadaoki Furui,et al. A training procedure for isolated word recognition systems , 1980 .

[2] Koichi Shinoda,et al. Speaker adaptation with autonomous control using tree structure , 1995, EUROSPEECH.

[3] Koichi Shinoda,et al. Speaker adaptation for demi-syllable based continuous density HMM , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[4] Tetsuo Kosaka,et al. Tree-structured speaker clustering for fast speaker adaptation , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[5] Biing-Hwang Juang,et al. A study on speaker adaptation of the parameters of continuous density hidden Markov models , 1991, IEEE Trans. Signal Process..

[6] Akito Monden,et al. Speaker adaptation fitting training data size and contents , 1995, EUROSPEECH.

[7] Tetsuo Kosaka,et al. Speaker adaptation based on transfer vector field smoothing using maximum a posteriori probability estimation , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[8] Shigeki Sagayama,et al. Speaker adaptation based on transfer vector field smoothing with continuous mixture density HMMs , 1992, ICSLP.