论文信息 - Adaptive Language Modeling Using the Maximum Entropy Principle

Adaptive Language Modeling Using the Maximum Entropy Principle

We describe our ongoing efforts at adaptive statistical language modeling. Central to our approach is the Maximum Entropy (ME) Principle, allowing us to combine evidence from multiple sources, such as long-distance triggers and conventional short-distance trigrams. Given consistent statistical evidence, a unique ME solution is guaranteed to exist, and an iterative algorithm exists which is guaranteed to converge to it. Among the advantages of this approach are its simplicity, its generality, and its incremental nature. Among its disadvantages are its computational requirements. We describe a succession of ME models, culminating in our current Maximum Likelihood/Maximum Entropy (ML/ME) model. Preliminary results with the latter show a 27% perplexity reduction as compared to a conventional trigram model.

[1] Ronald Rosenfeld,et al. Adaptive Statistical Language Modeling; A Maximum Entropy Approach , 1994 .

[2] Ronald Rosenfeld,et al. Trigger-based language models: a maximum entropy approach , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3] Bernard Mérialdo,et al. A Dynamic Language Model for Speech Recognition , 1991, HLT.

[4] Ronald Rosenfeld,et al. Improvements in Stochastic Language Modeling , 1992, HLT.

[5] E. Jaynes. Information Theory and Statistical Mechanics , 1957 .

[6] I. Good. THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .

[7] I. Good. Maximum Entropy for Hypothesis Formulation, Especially for Multidimensional Contingency Tables , 1963 .

[8] S. Kullback,et al. Information Theory and Statistics , 1959 .

[9] Robert L. Mercer,et al. Adaptive Language Modeling Using Minimum Discriminant Estimation , 1992, HLT.

[10] J. Darroch,et al. Generalized Iterative Scaling for Log-Linear Models , 1972 .