Language modeling is a key component in speech and handwriting recognition. N-gram language modeling is used as the formalism of choice for a wide range of domains. Although a high order N can reduce perplexity greatly, it is unrealistic in many practical cases to get statistically reliable N -grams. We propose an interpolated model by introducing signal words and clue words into the baseline N -gram model. The initial word in a word pair with high mutual information is chosen as a signal word. In the same way, we de ne such words that have high mutual information with a certain morphological form as clue words. In a given context, we select a signal word with the highest score to compute the probability of the current word, and a clue word with the highest score to estimate the probability of the form of the current word. We discuss the basic requirements of designing an interpolating language model and see how our models satisfy the requirements. We got considerable reduction in perplexity, compared to the baseline model. Because both signal words and clue words are easy to collect and handle, the proposed method is practical.
[1]
Jerome R. Bellegarda,et al.
Exploiting both local and global constraints for multi-span statistical language modeling
,
1998,
Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[2]
Roberto Pieraccini,et al.
Non-deterministic stochastic language models for speech recognition
,
1995,
1995 International Conference on Acoustics, Speech, and Signal Processing.
[3]
Isabelle Guyon,et al.
Design of a linguistic postprocessor using variable memory length Markov models
,
1995,
Proceedings of 3rd International Conference on Document Analysis and Recognition.
[4]
Pascale Fung,et al.
A monolingual semantic decoder based on word sense disambiguation for mixed language understanding
,
1999,
EUROSPEECH.
[5]
Robert L. Mercer,et al.
Class-Based n-gram Models of Natural Language
,
1992,
CL.
[6]
Kamel Smaïli,et al.
A new based distance language model for a dictation machine: application to MAUD
,
1999,
EUROSPEECH.