A real-time Chinese speech recognition system with unlimited vocabulary

A Chinese speech recognizer with unlimited vocabulary is described. The system has two major components: the acoustic recognition component, which includes an HMM (hidden Markov model)-based phone recognizer, a NN (neural network)-based initial refiner, and a NN-based tone classifier; and the lexical and homonym processor, which is based on a knowledge database extracted from large amounts of texts. This real-time recognizer is implemented on a PC-386 enhanced by only one digital signal processing board on which a TMS-320c25 chip operates as the CPU. On average, it takes only 0.19 s to recognize a one-syllable word. The recognition accuracy for syllables, tones, and words is 92.5%, 99.6%, and 97.5%, respectively.<<ETX>>

[1]  Biing-Hwang Juang,et al.  Mixture autoregressive hidden Markov models for speech signals , 1985, IEEE Trans. Acoust. Speech Signal Process..

[2]  Yuqing Gao,et al.  A model block-training method for HMM-based speech recognition systems , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[3]  Yu-qing Gao,et al.  The speech recognition technique for the whole Chinese vocabulary , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.