Making Tree Kernels Practical for Natural Language Learning

In recent years tree kernels have been proposed for the automatic learning of natural language applications. Unfortunately, they show (a) an inherent super linear complexity and (b) a lower accuracy than traditional attribute/value methods. In this paper, we show that tree kernels are very helpful in the processing of natural language as (a) we provide a simple algorithm to compute tree kernels in linear average running time and (b) our study on the classification properties of diverse tree kernels show that kernel combinations always improve the traditional methods. Experiments with Support Vector Machines on the predicate argument classification task provide empirical support to our thesis.

[1]  Dell Zhang,et al.  Question classification using support vector machines , 2003, SIGIR.

[2]  Dan Roth,et al.  On Kernel Methods for Relational Learning , 2003, ICML.

[3]  Daniel Jurafsky,et al.  Automatic Labeling of Semantic Roles , 2002, CL.

[4]  Nello Cristianini,et al.  Classification using String Kernels , 2000 .

[5]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[6]  Miriam R. L. Petruck FRAME SEMANTICS , 1996 .

[7]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[8]  Michael Collins,et al.  New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron , 2002, ACL.

[9]  Aravind K. Joshi,et al.  Using LTAG Based Features in Parse Reranking , 2003, EMNLP.

[10]  Ben Taskar,et al.  Max-Margin Parsing , 2004, EMNLP.

[11]  Alessandro Moschitti,et al.  A Study on Convolution Kernels for Shallow Statistic Parsing , 2004, ACL.

[12]  Aron Culotta,et al.  Dependency Tree Kernels for Relation Extraction , 2004, ACL.

[13]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[14]  Daniel Jurafsky,et al.  Support Vector Learning for Semantic Argument Classification , 2005, Machine Learning.

[15]  Daniel Gildea,et al.  The Necessity of Parsing for Predicate Argument Recognition , 2002, ACL.

[16]  Kentaro Torisawa,et al.  Speeding up Training with Tree Kernels for Node Relation Labeling , 2005, HLT.

[17]  Dmitry Zelenko,et al.  Kernel Methods for Relation Extraction , 2002, J. Mach. Learn. Res..

[18]  Martha Palmer,et al.  From TreeBank to PropBank , 2002, LREC.