论文信息 - Estimation of probabilities from sparse data for the language model component of a speech recognizer

Estimation of probabilities from sparse data for the language model component of a speech recognizer

The description of a novel type of m-gram language model is given. The model offers, via a nonlinear recursive procedure, a computation and space efficient solution to the problem of estimating probabilities from sparse data. This solution compares favorably to other proposed methods. While the method has been developed for and successfully implemented in the IBM Real Time Speech Recognizers, its generality makes it applicable in other areas where the problem of estimating probabilities from sparse data arises.

Slava M. Katz | S. Katz

[1] I. Good. THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .

[2] H. Robbins. An Empirical Bayes Approach to Statistics , 1956 .

[3] Frederick Jelinek,et al. Interpolated estimation of Markov source parameters from sparse data , 1980 .

[4] A. Nadas,et al. Estimation of probabilities in the language model of the IBM speech recognition system , 1984 .

[5] Arthur Nádas,et al. On Turing's formula for word probabilities , 1985, IEEE Trans. Acoust. Speech Signal Process..

[6] Frederick Jelinek,et al. A real-time, isolated-word, speech recognition system for dictation transcription , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7] Amir Averbuch,et al. An IBM PC based large-vocabulary isolated-utterance speech recognizer , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.