Which Are the Best Features for Automatic Verb Classification

In this work, we develop and evaluate a wide range of feature spaces for deriving Levinstyle verb classifications (Levin, 1993). We perform the classification experiments using Bayesian Multinomial Regression (an efficient log-linear modeling framework which we found to outperform SVMs for this task) with the proposed feature spaces. Our experiments suggest that subcategorization frames are not the most effective features for automatic verb classification. A mixture of syntactic information and lexical information works best for this task.

[1]  Sabine Schulte im Walde Clustering Verbs Semantically According to their Alternation Behaviour , 2000, COLING.

[2]  Curt Burgess,et al.  Modelling Parsing Constraints with High-dimensional Context Space , 1997 .

[3]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[4]  David Madigan,et al.  Large-Scale Bayesian Logistic Regression for Text Categorization , 2007, Technometrics.

[5]  Dmitriy Fradkin,et al.  Bayesian Multinomial Logistic Regression for Author Identification , 2005, AIP Conference Proceedings.

[6]  James R. Curran,et al.  Formalism-Independent Parser Evaluation with CCG and DepBank , 2007, ACL.

[7]  Min-Yen Kan,et al.  Role of Verbs in Document Analysis , 1998, ACL.

[8]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[9]  Suzanne Stevenson,et al.  A General Feature Space for Automatic Verb Classification , 2003, EACL.

[10]  Nizar Habash,et al.  Hybrid Natural Language Generation from Lexical Conceptual Structures , 2003, Machine Translation.

[11]  Mats Rooth,et al.  Valence Induction with a Head-Lexicalized PCFG , 1998, EMNLP.

[12]  Ted Briscoe,et al.  Extended Lexical-Semantic Classification of English Verbs , 2004, HLT-NAACL 2004.

[13]  David R. Dowty Thematic proto-roles and argument selection , 1991 .

[14]  Ted Briscoe,et al.  Automatic Extraction of Subcategorization from Corpora , 1997, ANLP.

[15]  Yuval Krymolowski,et al.  Clustering Polysemic Subcategorization Frame Distributions Semantically , 2003, ACL.

[16]  Douglas L. T. Rohde An Improved Method for Deriving Word Meaning from Lexical Co-Occurrence , 2004 .

[17]  Ted Briscoe,et al.  The Derivation of a Grammatically Indexed Lexicon from the Longman Dictionary of Contemporary English , 1987, ACL.

[18]  Beth Levin,et al.  English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[19]  Mary D. Swift,et al.  Towards Automatic Verb Acquisition from VerbNet for Spoken Dialog Processing , 2005 .

[20]  Sabine Schulte im Walde Experiments on the Automatic Induction of German Semantic Verb Classes , 2006, CL.

[21]  Christopher D. Manning,et al.  Probabilistic Syntax , 2002 .

[22]  Steven Abney,et al.  Parsing By Chunks , 1991 .

[23]  Suzanne Stevenson,et al.  A Multilingual Paradigm for Automatic Verb Classification , 2002, ACL.

[24]  Sabine Schulte im Walde Experiments on the Choice of Features for Learning Verb Classes , 2003, EACL.

[25]  P. Tichý Constructions , 1986, Philosophy of Science.

[26]  Dekang Lin,et al.  Automatic Retrieval and Clustering of Similar Words , 1998, ACL.

[27]  Nigel Collier,et al.  Automatic Classification of Verbs in Biomedical Texts , 2006, ACL.

[28]  Suzanne Stevenson,et al.  Crosslinguistic Transfer in Automatic Verb Classification , 2002, COLING.

[29]  Martha Palmer,et al.  Investigating Regular Sense Extensions Based on Intersective Levin Classes , 1998, COLING-ACL.

[30]  Suzanne Stevenson,et al.  Unsupervised Semantic Role Labellin , 2004, EMNLP.

[31]  MerloPaola,et al.  Automatic verb classification based on statistical distributions of argument structure , 2001 .

[32]  Lei Shi,et al.  Putting Pieces Together: Combining FrameNet, VerbNet and WordNet for Robust Semantic Parsing , 2005, CICLing.

[33]  John A. Carroll,et al.  Applied morphological processing of English , 2001, Natural Language Engineering.

[34]  J. Lowe,et al.  A Frame-Semantic Approach to Semantic Annotation , 1997 .

[35]  Josef Ruppenhofer,et al.  FrameNet's Frames vs. Levin's Verb Classes , 2002 .

[36]  S. Pinker Learnability and Cognition: The Acquisition of Argument Structure , 1989 .

[37]  John B. Lowe,et al.  The Berkeley FrameNet Project , 1998, ACL.

[38]  Chris Brew,et al.  Inducing German Semantic Verb Classes from Purely Syntactic Subcategorisation Information , 2002, ACL.

[39]  Yuji Matsumoto,et al.  Detecting the Organization of Semantic Subclasses of Japanese Verbs , 1997 .

[40]  Martha Palmer,et al.  Class-Based Construction of a Verb Lexicon , 2000, AAAI/IAAI.

[41]  Eric Joanis,et al.  Automatic Verb Classification Using a General Feature Space , 2002 .

[42]  Ralph Grishman,et al.  Comlex Syntax: Building a Computational Lexicon , 1994, COLING.

[43]  Neville Ryant,et al.  A Large-Scale Extension of VerbNet with Novel Verb Classes , 2006 .

[44]  Mirella Lapata,et al.  Dependency-Based Construction of Semantic Space Models , 2007, CL.

[45]  Helmut Schmid,et al.  LoPar: Design and Implementation , 2000 .

[46]  Michael R. Brent,et al.  From Grammar to Lexicon: Unsupervised Learning of Lexical Syntax , 1993, Comput. Linguistics.

[47]  Jianguo Li Disambiguating Levin Verbs Using Untagged Data , 2007 .

[48]  George A. Miller,et al.  Using Corpus Statistics and WordNet Relations for Sense Identification , 1998, CL.

[49]  Suzanne Stevenson,et al.  Automatic Verb Classification Based on Statistical Distributions of Argument Structure , 2001, CL.

[50]  Bonnie J. Dorr,et al.  Large-Scale Dictionary Construction for Foreign Language Tutoring and Interlingual Machine Translation , 1998, Machine Translation.

[51]  Zeno Vendler,et al.  Verbs and Times , 1957, The Language of Time - A Reader.

[52]  James Pustejovsky,et al.  The Generative Lexicon , 1995, CL.

[53]  Georgia M. Green,et al.  Semantics and Syntactic Regularity , 1974 .

[54]  Mirella Lapata,et al.  Verb Class Disambiguation Using Informative Priors , 2004, CL.

[55]  Daniel Jurafsky,et al.  Automatic Labeling of Semantic Roles , 2002, CL.

[56]  Chris Brew,et al.  Spectral Clustering for German Verbs , 2002, EMNLP.