A Lexicalized Tree Adjoining Grammar for English

This document describes a sizable grammar of English written in the TAG formalism and implemented for use with the XTAG system. This report and the grammar described herein supersedes the TAG grammar described in [XTAG-Group, 1995]. The English grammar described in this report is based on the TAG formalism developed in [Joshi et al., 1975], which has been extended to include lexicalization ([Schabes et al., 1988]), and unification-based feature structures ([Vijay-Shanker and Joshi, 1991]). The range of syntactic phenomena that can be handled is large and includes auxiliaries (including inversion), copula, raising and small clause constructions, topicalization, relative clauses, infinitives, gerunds, passives, adjuncts, it-clefts, wh-clefts, PRO constructions, noun-noun modifications, extraposition, determiner sequences, genitives, negation, noun-verb contractions, sentential adjuncts and imperatives. This technical report corresponds to the XTAG Release 8/31/98. The XTAG grammar is continuously updated with the addition of new analyses and modification of old ones, and an online version of this report can be found at the XTAG web page: http://www.cis.upenn.edu/~xtag/. Acknowledgements We are immensely grateful to Aravind Joshi for supporting this project. The following people have contributed to the development of grammars in the project: Anne Abeille, Jason Baldridge, Rajesh Bhatt, Kathleen Bishop, Raman Chandrasekar, Sharon Cote, Beatrice Daille, Christine Doran, Dania Egedi, Tim Farrington, Jason Frank, Caroline Heycock, Beth Ann Hockey, Roumyana Izvorski, Karin Kipper, Daniel Karp, Seth Kulick, Young-Suk Lee, Heather Matayek, Patrick Martin, Megan Moser, Sabine Petillon, Rashmi Prasad, Laura Siegel, Yves Schabes, Victoria Tredinnick and Raffaella Zanuttini. The XTAG system has been developed by: Tilman Becker, Richard Billington, Andrew Chalnick, Dania Egedi, Devtosh Khare, Albert Lee, David Magerman, Alex Mallet, Patrick Paroubek, Rich Pito, Gilles Prigent, Carlos Prolo, Anoop Sarkar, Yves Schabes, William Schuler, B. Srinivas, Fei Xia, Yuji Yoshiie and Martin Zaidel. We would also like to thank Michael Hegarty, Lauri Karttunen, Anthony Kroch, Mitchell Marcus, Martha Palmer, Owen Rambow, Philip Resnik, Beatrice Santorini and Mark Steedman. In addition, Jeff Aaronson, Douglas DeCarlo, Mark-Jason Dominus, Mark Foster, Gaylord Holder, David Magerman, Ken Noble, Steven Shapiro and Ira Winston have provided technical support. Adminstrative support was provided by Susan Deysher, Carolyn Elken, Jodi Kerper, Christine Sandy and Trisha Yannuzzi. This work was partially supported by NSF Grant SBR8920230 and ARO Grant DAAH040494-G-0426. Part I General Information

[1]  Srinivas Bangalore,et al.  Lexicalization and Grammar Development , 1994, ArXiv.

[2]  R. Larson On the double object construction , 1988 .

[3]  Aravind K. Joshi,et al.  Tree Adjunct Grammars , 1975, J. Comput. Syst. Sci..

[4]  Aravind K. Joshi,et al.  Coordination in Tree Adjoining Grammars: Formalization and Implementation , 1996, COLING.

[5]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[6]  John Knowles,et al.  The cleft sentence A base-generated perspective☆ , 1986 .

[7]  Howard Lasnik,et al.  On the nature of proper government , 1990 .

[8]  Rosanna Sornicola,et al.  It-Clefts and Wh-clefts: two awkward sentence types , 1988, Journal of Linguistics.

[9]  Ronald M. Kaplan,et al.  Lexical Functional Grammar A Formal System for Grammatical Representation , 2004 .

[10]  Verzekeren Naar Sparen,et al.  Cambridge , 1969, Humphrey Burton: In My Own Time.

[11]  Juan Uriagereka,et al.  A Course in GB Syntax: Lectures on Binding and Empty Categories , 1988 .

[12]  H. Grosser Chicago , 1906 .

[13]  Lorna Balkan,et al.  TSNLP - Test Suites for Natural Language Processing , 1996, COLING.

[14]  Hans-Georg Obenauer ON THE IDENTIFICATION OF EMPTY CATEGORIES , 1985 .

[15]  R. Jackendoff On Larson's treatment of the double object construction , 1990 .

[16]  James Rogers,et al.  OBTAINING TREES FROM THEIR DESCRIPTIONS: AN APPLICATION TO TREE‐ADJOINING GRAMMARS , 1994, Comput. Intell..

[17]  B. Partee,et al.  Mathematical Methods in Linguistics , 1990 .

[18]  Aravind K. Joshi,et al.  Parsing Strategies with ‘Lexicalized’ Grammars: Application to Tree Adjoining Grammars , 1988, COLING.

[19]  Patrick Paroubek,et al.  XTAG - A Graphical Workbench for Developing Tree-Adjoining Grammars , 1992, ANLP.

[20]  R. Larson Double Objects Revisited: Reply to Jackendoff , 1990 .

[21]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[22]  Steven G. Lapointe,et al.  A Lexical Analysis of the English Auxiliary Verb System , 1980 .