Probabilistic GLR Parsing

This chapter presents a new formalization of probabilistic GLR language modeling for statistical parsing. Our model inherits its essential features from Briscoe and Carroll’s generalized probabilistic LR model (Briscoe and Carroll 1993), which takes context of parse derivation into account by assigning a probability to each LR parsing action according to its left and right context. Briscoe and Carroll’s model, however, has a drawback in that it is not formalized in any probabilistically well-founded way, which may degrade its parsing performance. Our formulation overcomes this drawback with a few significant refinements, while maintaining all the advantages of Briscoe and Carroll’s modeling. We discuss the formal and qualitative aspects of our model, illustrating the qualitative differences between Briscoe and Carroll’s model and our model, and their expected impact on parsing performance.

[1]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[2]  Ted Briscoe,et al.  Generalized Probabilistic LR Parsing of Natural Language (Corpora) with Unification-Based Grammars , 1993, CL.

[3]  Masakazu Fujio,et al.  Japanese Dependency Structure Analysis based on Lexicalized Statistics , 1998, EMNLP.

[4]  Nigel P. Chapman,et al.  LR Parsing: Theory and Practice , 1988 .

[5]  Ted Briscoe,et al.  Probabilistic Normalisation and Unpacking of Packed Parse Forests for Unification-based Grammars , 1992 .

[6]  Ted Briscoe,et al.  Can Subcategorisation Probabilities Help a Statistical Parser , 1998, VLC@COLING/ACL.

[7]  Kenji Kita,et al.  Spoken Sentence Recognition Based on HMM-LR with Hybrid Language Modeling (Special Issue on Natural Language Processing and Understanding) , 1994 .

[8]  Mitchell P. Marcus,et al.  Pearl: A Probabilistic Chart Parser , 1991, EACL.

[9]  Yves Schabes,et al.  Stochastic Lexicalized Tree-adjoining Grammars , 1992, COLING.

[10]  Keh-Yih Su,et al.  GLR Parsing with Scoring , 1991 .

[11]  Eugene Charniak,et al.  Statistical Parsing with a Context-Free Grammar and Word Statistics , 1997, AAAI/IAAI.

[12]  Hang Li A Probabilistic Disambiguation Method Based on Psycholinguistic Principles , 1996, VLC@COLING.

[13]  M. Tomita Generalized LR Parsing , 1991, Springer US.

[14]  E. N. Wrigley,et al.  GLR Parsing With Probability , 1991 .

[15]  Kentaro Inui,et al.  Empirical Support for New Probabilistic Generalized LR Parsing , 1999 .

[16]  John D. Lafferty,et al.  Towards History-based Grammars: Using Richer Models for Probabilistic Parsing , 1993, ACL.

[17]  Ralph Grishman,et al.  A Corpus-based Probabilistic Grammar with Only Two Non-terminals , 1995, IWPT.

[18]  Masaru Tomita,et al.  Efficient parsing for natural language , 1985 .

[19]  Michael Collins,et al.  A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[20]  Carl Vogel,et al.  Proceedings of the 16th International Conference on Computational Linguistics , 1996, COLING 1996.