Coping with Ambiguity and Unknown Words through Probabilistic Models

From spring 1990 through fall 1991, we performed a battery of small experiments to test the effectiveness of supplementing knowledge-based techniques with probabilistic models. This paper reports our experiments in predicting parts of speech of highly ambiguous words, predicting the intended interpretation of an utterance when more than one interpretation satisfies all known syntactic and semantic constraints, and learning caseframe informationfor verbsfrom example uses.From these experiments, we are convinced that probabilistic models based on annotated corpora can effectively reduce the ambiguity in processing text and can be used to acquire lexical informationfrom a corpus, by supplementing knowledge-based techniques.Based on the results of those experiments, we have constructed a new natural language system (PLUM) for extracting data from text, e.g., newswire text.

[1]  L. Baum,et al.  An inequality with applications to statistical estimation for probabilistic functions of Markov processes and to a model for ecology , 1967 .

[2]  Frederick Jelinek,et al.  Self-organizing language modeling for speech recognition , 1990 .

[3]  Ralph Grishman,et al.  Discovery Procedures for Sublanguage Selectional Patterns: Initial Experiments , 1986, Comput. Linguistics.

[4]  Ralph M. Weischedel,et al.  An Environment for Acquiring Semantic Information , 1987, ACL.

[5]  Slava M. Katz,et al.  Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[6]  Steven J. DeRose,et al.  Grammatical Category Disambiguation by Statistical Optimization , 1988, CL.

[7]  Julian Kupiec,et al.  Augmenting a Hidden Markov Model for Phrase-Dependent Word Tagging , 1989, HLT.

[8]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.

[9]  David Stallard Unification-Based Semantic Interpretation in the BBN Spoken Language System , 1989, HLT.

[10]  Ralph Grishman,et al.  Preference Semantics for Message Understanding , 1989, HLT.

[11]  Madeleine Bates Rapid Porting of the Parlancetm Natural Language Interface , 1989, HLT.

[12]  Ralph M. Weischedel,et al.  Portability in the Janus Natural Language Interface , 1989, HLT.

[13]  Carl de Marcken,et al.  Parsing the LOB Corpus , 1990, ACL.

[14]  Steven Abney Rapid Incremental Parsing with Repair , 1990 .

[15]  Beatrice Santorini,et al.  First Steps Towards an Annotated Database of American English , 1990 .

[16]  Ralph Grishman,et al.  Statistical Parsing of Messages , 1990, HLT.

[17]  Renato De Mori,et al.  A Cache-Based Natural Language Model for Speech Recognition , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  Beth Sundheim,et al.  Overview of the Third Message Understanding Evaluation and Conference , 1991, MUC.

[19]  Mats Rooth,et al.  Structural Ambiguity and Lexical Relations , 1991, ACL.