At Last Parsing Is Now Operational

Natural language analysis systems which combine knowledge-based and corpus- based methods are now becoming accurate enough to be used in various applications. We de- scribe one such parsing system for Dutch, known as Alpino, and we show how corpus-based methods are essential to obtain accurate knowledge-based parsers. In particular we show a variety of cases where large amounts of parser output are used to improve the parser. Mots-clefs :

[1]  Miles Osborne,et al.  Estimation of Stochastic Attribute-Value Grammars using an Informative Sample , 2000, COLING.

[2]  L. J. van der Beek,et al.  Argument Order Alternations in Dutch , 2004 .

[3]  Tsujii Jun'ichi,et al.  Maximum entropy estimation for feature forests , 2002 .

[4]  María Begoña Villada Moirón,et al.  University of Groningen Data-driven identification of fixed expressions and their modifiability , 2005 .

[5]  Gosse Bouma Reasoning over Dependency Relations for QA , 2005 .

[6]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[7]  Robbert Prins,et al.  Finite-state pre-processing for natural language analysis , 2005 .

[8]  Valentin Jijkoun,et al.  Information Extraction for Question Answering: Improving Recall Through Syntactic Patterns , 2004, COLING.

[9]  Antonio Cisternino,et al.  PiQASso: Pisa Question Answering System , 2001, TREC.

[10]  Werkgroep Frequentie-onderzoek van het Nederlands,et al.  Woordfrequenties in geschreven en gesproken Nederlands , 1975 .

[11]  Mark Johnson,et al.  Dynamic programming for parsing and estimation of stochastic unification-based grammars , 2002, ACL.

[12]  Gertjan van Noord Robust Parsing of Word Graphs , 2001 .

[13]  Mark-Jan Nederhof,et al.  Squibs and Discussions: Weighted Deductive Parsing and Knuth’s Algorithm , 2003, CL.

[14]  Diego Mollá Aliod,et al.  Answerfinder: Question Answering by Combining Lexical, Syntactic and Semantic Information , 2004, ALTA.

[15]  Gosse Bouma,et al.  Querying Dependency Treebanks in XML , 2002, LREC.

[16]  Gosse Bouma Third Workshop on Treebanks and Linguistic Theories , 2004 .

[17]  Gertjan van Noord,et al.  The Alpino Dependency Treebank , 2001, CLIN.

[18]  Jean-Claude Junqua,et al.  Robustness in Language and Speech Technology , 2001, Text, Speech and Language Technology.

[19]  Gertjan van Noord Error Mining for Wide-Coverage Grammar Engineering , 2004, ACL.

[20]  Ted Briscoe,et al.  Relational evaluation schemes , 2002 .

[21]  Gosse Bouma,et al.  Treebank evidence for the analysis of PP-fronting , 2004 .

[22]  Lonneke van der Plas,et al.  Syntactic Contexts for Finding Semantically Related Words , 2004, CLIN.

[23]  Jörg Tiedemann,et al.  Question Answering for Dutch using Dependency Relations , 2005, CLEF.

[24]  Frederick Jelinek,et al.  Statistical methods for speech recognition , 1997 .

[25]  Hans-Ulrich Krieger,et al.  A Bag of Useful Techniques for Efficient and Robust Parsing , 1999, ACL.

[26]  Jan Daciuk Finite State Tools for Natural Language Processing , 2000, COLING 2000.

[27]  Ted Briscoe,et al.  Apportioning Development Effort in a Probabilistic LR Parsing System Through Evaluation , 1996, EMNLP.

[28]  Glenn Carroll,et al.  Taggers for Parsers , 1996, Artif. Intell..

[29]  Jimmy J. Lin,et al.  Selectively Using Relations to Improve Precision in Question Answering , 2003 .

[30]  Lonneke van der Plas,et al.  Automatic Acquisition of Lexico-semantic Knowledge for QA , 2005, IJCNLP.

[31]  Mark Johnson,et al.  Estimators for Stochastic “Unification-Based” Grammars , 1999, ACL.

[32]  Munchausen. English,et al.  Baron Munchausen's narrative of his marvelous travels and campaigns in Russia , 1928 .

[33]  Kenneth C. Litkowski,et al.  Use of Metadata for Question Answering and Novelty Tasks , 2003, TREC.

[34]  Steven P. Abney Stochastic Attribute-Value Grammars , 1996, CL.

[35]  Robert Malouf,et al.  Wide Coverage Parsing with Stochastic Attribute Value Grammars , 2004 .

[36]  Günther Görz,et al.  Towards understanding spontaneous speech: word accuracy vs. concept accuracy , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[37]  Nelleke Oostdijk,et al.  The Spoken Dutch Corpus. Overview and First Evaluation , 2000, LREC.

[38]  Atro Voutilainen Does tagging help parsing? A case study on finite state parsing , 1998 .

[39]  Oliver Wauschkuhn,et al.  The Influence of Tagging on the Results of Partial Parsing in German Corpora , 1995, IWPT.

[40]  Mark Johnson,et al.  Parsing the Wall Street Journal using a Lexical-Functional Grammar and Discriminative Estimation Techniques , 2002, ACL.

[41]  Gertjan van Noord An Efficient Implementation of the Head-Corner Parser , 1997, CL.