Customizable Modular Lexicalized Parsing

Different NLP applications have different efficiency constraints (i.e. quality of the results and throughput) that reflect on each core linguistic component. Syntactic processors are basic modules in some NLP application. A customization that permits the performance control of these components enables their reuse in different application scenarios. Throughput has been commonly improved using partial syntactic processors. On the other hand, specialized lexicons are generally employed to improve the quality of the syntactic material produced by specific parsing (sub)process (e.g. verb argument detection or PP attachment disambiguation) . Building upon the idea of grammar stratification, in this paper a method to push modularity and lexical sensitivity, in parsing, in view of customizable syntactic analysers is presented. A framework for modular parser design is proposed and its main properties are discussed. Parsers (i.e. different parsing module chains) are then presented and their performances are analyzed in an application-driven scenarios.

[1]  Aravind K. Joshi,et al.  Tree-adjoining grammars and lexicalized grammars , 1992, Tree Automata and Languages.

[2]  Roberto Basili,et al.  A Shallow Syntactic Analyser to Extract Word Associations from Corpora , 1992 .

[3]  Roberto Basili,et al.  Corpus-Driven Unsupervised Learning of Verb Subcategorization Frames , 1997, AI*IA.

[4]  John T. Maxwell,et al.  Formal issues in lexical-functional grammar , 1998 .

[5]  Maria Teresa Pazienza,et al.  Information Extraction A Multidisciplinary Approach to an Emerging Information Technology , 1997, Lecture Notes in Computer Science.

[6]  Lucien Tesnière Éléments de syntaxe structurale , 1959 .

[7]  Aravind K. Joshi,et al.  Tree Adjunct Grammars , 1975, J. Comput. Syst. Sci..

[8]  Dekang Lin,et al.  A dependency-based method for evaluating broad-coverage parsers , 1995, Natural Language Engineering.

[9]  David J. Weir,et al.  D-Tree Grammars , 1995, ACL.

[10]  James Pustejovsky,et al.  Corpus processing for lexical acquisition , 1996 .

[11]  Beth Ann Hockey,et al.  XTAG System - A Wide Coverage Grammar for English , 1994, COLING.

[12]  Roberto Basili,et al.  Engineering of IE Systems: An Object-Oriented Approach , 1999, SCIE.

[13]  John D. Lafferty,et al.  A Robust Parsing Algorithm for Link Grammars , 1995, IWPT.

[14]  Norbert Bröker,et al.  A Projection Architecture for Dependency Grammar and How it Compares to LFG , 1998, ArXiv.

[15]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[16]  Maria Teresa Pazienza Information Extraction: Towards Scalable, Adaptable Systems , 1999 .

[17]  Roberto Basili,et al.  Lexicalizing a shallow parser , 1999 .

[18]  Jean-Pierre Chanod,et al.  Incremental Finite-State Parsing , 1997, ANLP.

[19]  Steven Abney,et al.  Part-of-Speech Tagging and Partial Parsing , 1997 .