论文信息 - The SSI large-vocabulary speaker-independent continuous speech recognition system

The SSI large-vocabulary speaker-independent continuous speech recognition system

The Speech Systems Incorporated (SSI) commercial, large-vocabulary, speaker-independent, continuous speech recognition system is described. The system utilizes a novel approach to speech representation: a two-stage encoding of speech, with an intervening compression of acoustic frames (segmentation) between the encoding stages, and a linguistic decoding process suitable for large, variable-duration segments. Binary decision trees trained using the maximum mutual information (MMI) criterion serve as encoders. The features used in encoding are listed, and their ability to discriminate the phonetic content of the speech is analyzed. Recognition results are given for a speaker-independent continuous speech, grammar-constrained radiology reporting product, and for an isolated-word grammar of high perplexity.<<ETX>>

[1] William S. Meisel,et al. Experiments with Tree-Structured MMI Encoders on the RM Task , 1990, HLT.

[2] William S. Meisel,et al. An Algorithm for Constructing Optimal Binary Decision Trees , 1977, IEEE Transactions on Computers.

[3] Kai-Fu Lee,et al. Automatic Speech Recognition , 1989 .

[4] William H. Press,et al. Numerical recipes , 1990 .

[5] William S. Meisel,et al. A Partitioning Algorithm with Application in Pattern Classification and the Optimization of Decision Trees , 1973, IEEE Transactions on Computers.

[6] Hsiao-Wuen Hon,et al. Speaker-independent phone recognition using hidden Markov models , 1989, IEEE Trans. Acoust. Speech Signal Process..