Three different probabilistic language models: comparison and combination

The authors outline the different problems that arise when using a statistical language model for speech recognition, especially for inflected languages such as French, Italian or German. After a brief review of two classical models (TriPOS and Trigram), the authors present a refinement of the morphological language model (Trilemma). They give the different methods used to evaluate performances. They discuss combination experiments between two of these three building blocks and present a model which takes advantage of all three models through a backing-off strategy. Assuming the same vocabulary (20000 forms), experiments show equivalent results using either a classical trigram language model or a trilemma model. The second model can be extended to a full dictionary containing all the inflected forms of each lemma, whereas the first needs a large amount of data to perform such a task.<<ETX>>

[1]  Vishwa Gupta,et al.  Three probabilistic language models for a large-vocabulary speech recognizer , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[2]  Marco Ferretti,et al.  Language model and acoustic model information in probabilistic speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[3]  J. Mariani,et al.  Recent advances in speech processing , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[4]  Bernard Mérialdo,et al.  Natural Language Modeling for Phoneme-to-Text Transcription , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  A.-M. Derouault,et al.  A morphological model for large vocabulary speech recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[6]  Slava M. Katz,et al.  Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[7]  Renato De Mori,et al.  A Cache-Based Natural Language Model for Speech Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  B. Merialdo Speech recognition with very large size dictionary , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.