Wide Coverage Incremental Parsing by Learning Attachment Preferences

This paper presents a novel method for wide coverage parsing using an incremental strategy, which is psycholinguistically motivated. A recursive neural network is trained on treebank data to learn first pass attachments, and is employed as a heuristic for guidingpa rsingde cision. The parser is lexically blind and uses beam search to explore the space of plausible partial parses and returns the full analysis havinghi ghest probability. Results are based on preliminary tests on the WSJ section of the Penn treebank and suggest that our incremental strategy is a computationally viable approach to parsing.

[1]  Lyn Frazier,et al.  Syntactic processing: Evidence from dutch , 1987 .

[2]  Brian Roark,et al.  Efficient probabilistic top-down and left-corner parsing , 1999, ACL.

[3]  Raymond J. Mooney,et al.  Learning Parse and Translation Decisions from Examples with Rich Context , 1997, ACL.

[4]  Julie C. Sedivy,et al.  Eye movements as a window into real-time spoken language comprehension in natural contexts , 1995, Journal of psycholinguistic research.

[5]  Adwait Ratnaparkhi,et al.  A Linear Observed Time Statistical Parser Based on Maximum Entropy Models , 1997, EMNLP.

[6]  David Milward,et al.  Dynamic dependency grammar , 1994 .

[7]  Yuki Kamide,et al.  Incremental Pre-Head Attachment in Japanese Parsing. , 1999 .

[8]  Michael Collins,et al.  A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[9]  Alessandro Sperduti,et al.  A general framework for adaptive processing of data structures , 1998, IEEE Trans. Neural Networks.

[10]  Eugene Charniak,et al.  Statistical Parsing with a Context-Free Grammar and Word Statistics , 1997, AAAI/IAAI.

[11]  James Henderson,et al.  Incremental Syntactic Parsing of Natural Language Corpora with Simple Synchrony Networks , 2001, IEEE Trans. Knowl. Data Eng..

[12]  WILLIAM MARSLEN-WILSON,et al.  Linguistic Structure and Speech Shadowing at Very Short Latencies , 1973, Nature.

[13]  Vincenzo Lombardo,et al.  Incrementality and Lexicalism: A Treebank Study , 2002 .

[14]  Mark Steedman,et al.  The nite connectivity of linguistic structure , 1999 .

[15]  Mark Steedman,et al.  Grammar, interpretation, and processing from the lexicon , 1989 .

[16]  Patrick Sturt,et al.  Monotonic Syntactic Processing : A Cross-linguistic Study of Attachment and Reanalysis , 1996 .

[17]  Marc Brysbaert,et al.  Exposure-based models of human parsing: Evidence for the use of coarse-grained (nonlexical) statistical records , 1995 .