Lexicalizing a shallow parser

Current NL parsers are expected to run with throughput rate suitable to satisfy ”time constraints” in real applications. The aim of the present work is, on the one hand, to investigate the effects of lexical information in a shallow parsing environment, on the other hand, to study the limits of a bootstrapping architecture that, automatically learning the lexical information in an unsupervised fashion, guarantees the reliability and portability of the parser to different domains. The investigated parser is Chaos (Chunk analysis oriented system), a robust parser based on stratification and lexicalization. Large scale evaluation over a standard tree bank is discussed.

[1]  Roberto Basili,et al.  A Shallow Syntactic Analyser to Extract Word Associations from Corpora , 1992 .

[2]  John T. Maxwell,et al.  Formal issues in lexical-functional grammar , 1998 .

[3]  Douglas E. Appelt,et al.  FASTUS: A Finite-state Processor for Information Extraction from Real-world Text , 1993, IJCAI.

[4]  Maria Teresa Pazienza,et al.  Information Extraction A Multidisciplinary Approach to an Emerging Information Technology , 1997, Lecture Notes in Computer Science.

[5]  Steven Abney,et al.  Part-of-Speech Tagging and Partial Parsing , 1997 .

[6]  Dekang Lin,et al.  A dependency-based method for evaluating broad-coverage parsers , 1995, Natural Language Engineering.

[7]  Jean-Pierre Chanod,et al.  Incremental Finite-State Parsing , 1997, ANLP.

[8]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[9]  Roberto Basili,et al.  Corpus-Driven Unsupervised Learning of Verb Subcategorization Frames , 1997, AI*IA.

[10]  Lorna Balkan,et al.  TSNLP - Test Suites for Natural Language Processing , 1996, COLING.

[11]  Ralph Grishman,et al.  A Procedure for Quantitatively Comparing the Syntactic Coverage of English Grammars , 1991, HLT.

[12]  Steve Young,et al.  Corpus-based methods in language and speech processing , 1997 .

[13]  Srinivas Bangalore,et al.  Complexity of lexical descriptions and its relevance to partial parsing , 1997 .

[14]  John D. Lafferty,et al.  A Robust Parsing Algorithm for Link Grammars , 1995, IWPT.

[15]  Rens Bod Using an Annotated Corpus as a Stochastic Grammar , 1993, EACL.