Classification of "Inheritance" Relations: a Semi-automatic Approach

This study describes a semi-automatic approach to the clas- sification of "inheritance" relations between morphologically related pred- icates. Predicates, such as verbs and nouns subcategorizing for a subclause, are automatically extracted from text corpora and are classified accroding to their subcategorisation properties. For this purpose, we elaborate a semi-automatic knowledge-rich extraction and classification architecture. Our aim is also to compare subcategorisation properties of morphologi- cally related predicates, i.e. verbs and deverbal nouns. In this work, we concentrate exclusively on the predicates with sentential complements, such as dass, ob and w-clauses (that, if and wh-clauses) in German, although our methods can be applied for other complement types as well.

[1]  Ted Briscoe,et al.  Automatic Extraction of Subcategorization from Corpora , 1997, ANLP.

[2]  John A. Carroll,et al.  The Automatic Acquisition of Verb Subcategorisations and Their Impact on the Performance of an HPSG Parser , 2004, IJCNLP.

[3]  Chris Brew,et al.  Inducing German Semantic Verb Classes from Purely Syntactic Subcategorisation Information , 2002, ACL.

[4]  Ralph Grishman,et al.  NOMLEX: a lexicon of nominalizations , 1998 .

[5]  Michael R. Brent,et al.  From Grammar to Lexicon: Unsupervised Learning of Lexical Syntax , 1993, Comput. Linguistics.

[6]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[7]  Stefan Evert The CQP Query Language Tu-torial , 2005 .

[8]  Helmut Schmid,et al.  Improvements in Part-of-Speech Tagging with an Application to German , 1999 .

[9]  Alex Waibel,et al.  The Automatic Acquisition of Frequencies of Verb Subcategorization Frames from Tagged Corpora , 2002 .

[10]  Vito Pirrelli,et al.  Unsupervised Acquisition of Verb Subcategorization Frames from Shallow-Parsed Corpora , 2008, LREC.

[11]  Andy Way,et al.  Large-Scale Induction and Evaluation of Lexical Resources from the Penn-II and Penn-III Treebanks , 2005, Computational Linguistics.

[12]  Helmut Schmid Unsupervised Learning of Period Disambiguation for Tokenisation , 2000 .

[13]  Hannah Kermes,et al.  Off-line (and on-line) text analysis for computational lexicography , 2003 .

[14]  Oliver Wauschkuhn Automatische Extraktion von Verbvalenzen aus deutschen Textkorpora , 1999 .

[15]  Ulrich Heid,et al.  SMOR: A German Computational Morphology Covering Derivation, Composition and Inflection , 2004, LREC.

[16]  Valeria de Paiva,et al.  Deverbal Nouns in Knowledge Representation , 2006, FLAIRS Conference.

[17]  Veronika Ehrich,et al.  Sortale Bedeutung und Argumentstruktur: ung-Nominalislerungen im Deutschen , 2000 .

[18]  Stefan J. Schierholz Präpositionalattribute : Syntaktische und semantische Analysen , 2001 .

[19]  Mary L. Nunes,et al.  Argument linking in English derived nominals , 1992 .

[20]  Serena Villata,et al.  Automatic extraction of subcategorization frames for Italian , 2008, LREC.

[21]  Christopher D. Manning Automatic Acquisition of a Large Sub Categorization Dictionary From Corpora , 1993, ACL.