Improved speaker adaption using text dependent spectral mappings

A novel text-dependent probabilistic spectral mapping method is presented for rapid speaker adaptation. The algorithm has been tested on the DARPA 1000-word resource management database with a grammar perplexity of 60. It results in significant better performance than the previous algorithms, and also provides recognition performance which is less than two times the word error rate for speaker-dependent training, using two minutes of adaptation speech.<<ETX>>

[1]  John Makhoul,et al.  Continuous speech recognition results of the BYBLOS system on the DARPA 1000-word resource management database , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[2]  Patti Price,et al.  The DARPA 1000-word resource management database for continuous speech recognition , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[3]  R. Schwartz,et al.  Rapid speaker adaptation using a probabilistic spectral mapping , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Kiyohiro Shikano,et al.  Speaker adaptation through vector quantization , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  John Makhoul,et al.  BYBLOS: The BBN continuous speech recognition system , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.