Supertagging: An Approach to Almost Parsing

In this paper, we have proposed novel methods for robust parsing that integrate the flexibility of linguistically motivated lexical descriptions with the robustness of statistical techniques. Our thesis is that the computation of linguistic structure can be localized if lexical items are associated with rich descriptions (supertags) that impose complex constraints in a local context. The supertags are designed such that only those elements on which the lexical item imposes constraints appear within a given supertag. Further, each lexical item is associated with as many supertags as the number of different syntactic contexts in which the lexical item can appear. This makes the number of different descriptions for each lexical item much larger than when the descriptions are less complex, thus increasing the local ambiguity for a parser. But this local ambiguity can be resolved by using statistical distributions of supertag co-occurrences collected from a corpus of parses. We have explored these ideas in the context of the Lexicalized Tree-Adjoining Grammar (LTAG) framework. The supertags in LTAG combine both phrase structure information and dependency information in a single representation. Supertag disambiguation results in a representation that is effectively a parse (an almost parse), and the parser need "only" combine the individual supertags. This method of parsing can also be used to parse sentence fragments such as in spoken utterances where the disambiguated supertag sequence may not combine into a single structure.

[1]  John D. Lafferty,et al.  Decision Tree Parsing using a Hidden Derivation Model , 1994, HLT.

[2]  Thomas Niesler,et al.  A variable-length category-based n-gram language model , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[3]  Hiyan Alshawi,et al.  Training and Scaling Preference Functions for Disambiguation , 1994, Comput. Linguistics.

[4]  Beth Ann Hockey,et al.  XTAG System - A Wide Coverage Grammar for English , 1994, COLING.

[5]  Raman Chandrasekar,et al.  Motivations and Methods for Text Simplification , 1996, COLING.

[6]  Gregory Grefenstette,et al.  Regular expressions for language engineering , 1996, Natural Language Engineering.

[7]  Aravind K. Joshi,et al.  Tree-Adjoining Grammars , 1997, Handbook of Formal Languages.

[8]  Yves Schabes,et al.  Parsing the Wall Street Journal with the Inside-Outside Algorithm , 1993, EACL.

[9]  Aravind K. Joshi,et al.  Unification-Based Tree Adjoining Grammars , 1991 .

[10]  Raman Chandrasekar,et al.  Automatic induction of rules for text simplification , 1997, Knowl. Based Syst..

[11]  Patrick Henry Winston,et al.  The psychology of computer vision , 1976, Pattern Recognit..

[12]  Aravind K. Joshi,et al.  Parsing with Lexicalized Tree Adjoining Grammar , 1991 .

[13]  Maurice Gross,et al.  Lexicon-Grammar and the Syntactic Analysis of French , 1984, ACL.

[14]  SchwartzRichard,et al.  Coping with ambiguity and unknown words through probabilistic models , 1993 .

[15]  Srinivas Bangalore,et al.  Performance Evaluation of Supertagging for Partial Parsing , 2000 .

[16]  Kenneth Ward Church A Stochastic Parts Program and Noun Phrase Parser for Unrestricted Text , 1988, ANLP.

[17]  Srinivas Bangalore,et al.  Complexity of lexical descriptions and its relevance to partial parsing , 1997 .

[18]  Eric Brill,et al.  Automatic Grammar Induction and Parsing Free Text: A Transformation-Based Approach , 1993, ACL.

[19]  Steven Abney Rapid Incremental Parsing with Repair , 1990 .

[20]  Douglas E. Appelt,et al.  FASTUS: A Cascaded Finite-State Transducer for Extracting Information from Natural-Language Text , 1997, ArXiv.

[21]  Daniel Dominic Sleator,et al.  Parsing English with a Link Grammar , 1995, IWPT.

[22]  Mark Steedman,et al.  Combinatory grammars and parasitic gaps , 1987 .

[23]  R. F. Brown,et al.  PERFORMANCE EVALUATION , 2019, ISO 22301:2019 and business continuity management – Understand how to plan, implement and enhance a business continuity management system (BCMS).

[24]  B. Srinivas "Almost parsing" technique for language modeling , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[25]  John Cocke,et al.  Probabilistic Parsing Method for Sentence Disambiguation , 1989, IWPT.

[26]  Raman Chandrasekar,et al.  Gleaning Information from the Web: Using Syntax to Filter Out Irrelevant Information , 1996 .

[27]  I. Good THE POPULATION FREQUENCIES OF SPECIES AND THE ESTIMATION OF POPULATION PARAMETERS , 1953 .

[28]  Michael Collins,et al.  A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[29]  Hermann Ney,et al.  On the Estimation of 'Small' Probabilities by Leaving-One-Out , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[30]  Eugene Charniak,et al.  Statistical Parsing with a Context-Free Grammar and Word Statistics , 1997, AAAI/IAAI.

[31]  Ivan A. Sag,et al.  Information-based syntax and semantics , 1987 .

[32]  Breck Baldwin,et al.  Mother of PERL: A Multi-tier Pattern Description Language , 1996 .

[33]  Emmanuel Roche Analyse syntaxique transformationnelle du francais par transducteurs et lexique-grammaire , 1993 .

[34]  Aravind K. Joshi,et al.  An Introduction to Tree Adjoining Grammar , 1987 .

[35]  Aravind K. Joshi Role of Constrained Computational Systems in Natural Language Processing , 1998, Artif. Intell..

[36]  John D. Lafferty,et al.  Towards History-based Grammars: Using Richer Models for Probabilistic Parsing , 1993, ACL.

[37]  R. Bod,et al.  A Probabilistic Corpus-Driven Model for Lexical-Functional Analysis , COLING.

[38]  Srinivas Bangalore,et al.  The Institute For Research In Cognitive Science Disambiguation of Super Parts of Speech ( or Supertags ) : Almost Parsing by Aravind , 1995 .

[39]  Srinivas Bangalore Using Supertags in Document Filtering: the Eeect of Increased Context on Information Retrieval Eeectiveness , 1997 .

[40]  Aravind K. Joshi,et al.  Mathematical and computational aspects of lexicalized grammars , 1990 .

[41]  R. L. Bradshaw,et al.  RESULTS AND ANALYSIS. , 1971 .

[42]  André Kempe Probabilistic Tagging With Feature Structures , 1994, COLING.

[43]  XTAG Research Group,et al.  A Lexicalized Tree Adjoining Grammar for English , 1998, ArXiv.

[44]  Geoffrey K. Pullum,et al.  Generalized Phrase Structure Grammar , 1985 .

[45]  Raman Chandrasekar,et al.  Using Syntactic Information in Document Filtering: A Comparative Study of Part-of-speech Tagging and Supertagging , 1997, RIAO.

[46]  Rens Bod,et al.  A Probabilistic Corpus-Driven Model for Lexical-Functional Analysis , 1998, ACL.

[47]  Douglas E. Appelt,et al.  FASTUS: A Finite-state Processor for Information Extraction from Real-world Text , 1993, IJCAI.

[48]  David L. Waltz,et al.  Understanding Line drawings of Scenes with Shadows , 1975 .

[49]  Aravind K. Joshi,et al.  Parsing Strategies with ‘Lexicalized’ Grammars: Application to Tree Adjoining Grammars , 1988, COLING.

[50]  Anthony S. Kroch,et al.  The Linguistic Relevance of Tree Adjoining Grammar , 1985 .

[51]  Anne Abeillé,et al.  A Lexicalized Tree Adjoining Grammar for English , 1990 .

[52]  David M. Magerman Statistical Decision-Tree Models for Parsing , 1995, ACL.

[53]  Seth Kulick,et al.  Heuristics and Parse Ranking , 1995, IWPT.

[54]  Douglas E. Appelt,et al.  SRI International FASTUS SystemMUC-6 Test Results and Analysis , 1995, MUC.

[55]  Aravind K. Joshi,et al.  A parser from antiquity , 1996, Nat. Lang. Eng..

[56]  Bernd Kiefer,et al.  Compilation of HPSG to TAG , 1995, ACL.

[57]  Slava M. Katz,et al.  Estimation of probabilities from sparse data for the language model component of a speech recognizer , 1987, IEEE Trans. Acoust. Speech Signal Process..

[58]  Richard M. Schwartz,et al.  Coping with Ambiguity and Unknown Words through Probabilistic Models , 1993, CL.

[59]  Aravind K. Joshi,et al.  Natural language parsing: Tree adjoining grammars: How much context-sensitivity is required to provide reasonable structural descriptions? , 1985 .

[60]  Timo Järvinen,et al.  Syntactic Analysis Of Natural Language Using Linguistic Rules And Corpus-Based Patterns , 1994, COLING.

[61]  cationR. Chandrasekar Automatic Induction of Rules for Text Simpli , 1997 .

[62]  Aravind K. Joshi,et al.  A study of tree adjoining grammars , 1987 .

[63]  Aravind K. Joshi,et al.  Tree Adjunct Grammars , 1975, J. Comput. Syst. Sci..

[64]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[65]  Atro Voutilainen,et al.  A language-independent system for parsing unrestricted text , 1995 .