论文信息 - Augmenting words with linguistic information for n-gram language models

Augmenting words with linguistic information for n-gram language models

The main goal of the present work is to explore the use of rich lexical information in language modelling. We reformulated the task of a language model from predicting the next word given its history to predicting simultaneously both the word and a tag encoding various types of lexical information. Using part-of-speech tags and syntactic/semantic feature tags obtained with a set of NLP tools developed at Microsoft Research, we obtained a reduction in perplexity compared to the baseline phrase trigram model in a set of preliminary tests performed on part of the WSJ corpus.

Eric K. Ringger | Lucian Galescu

[1] Ronald Rosenfeld,et al. Statistical language modeling using the CMU-cambridge toolkit , 1997, EUROSPEECH.

[2] Thomas Niesler,et al. Combination of word-based and category-based language models , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[3] Frederick Jelinek,et al. Interpolated estimation of Markov source parameters from sparse data , 1980 .

[4] Eric Brill,et al. A Simple Rule-Based Part of Speech Tagger , 1992, HLT.

[5] J. Cleary,et al. \self-organized Language Modeling for Speech Recognition". In , 1997 .

[6] F. Jelinek,et al. Perplexity—a measure of the difficulty of speech recognition tasks , 1977 .

[7] Julian M. Kupiec,et al. Robust part-of-speech tagging using a hidden Markov model , 1992 .

[8] Lawrence R. Rabiner,et al. A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[9] F ChenStanley,et al. An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[10] Robert L. Mercer,et al. Class-Based n-gram Models of Natural Language , 1992, CL.

[11] Peter A. Heeman,et al. POS Tagging versus Classes in Language Modeling , 1998, VLC@COLING/ACL.

[12] Slava M. Katz,et al. Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[13] Géraldine Damnati,et al. Deriving phrase-based language models , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[14] Sunil Issar. Estimation of language models for new spoken language applications , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.