论文信息 - Efficient Linearization of Tree Kernel Functions

Efficient Linearization of Tree Kernel Functions

The combination of Support Vector Machines with very high dimensional kernels, such as string or tree kernels, suffers from two major drawbacks: first, the implicit representation of feature spaces does not allow us to understand which features actually triggered the generalization; second, the resulting computational burden may in some cases render unfeasible to use large data sets for training. We propose an approach based on feature space reverse engineering to tackle both problems. Our experiments with Tree Kernels on a Semantic Role Labeling data set show that the proposed approach can drastically reduce the computational footprint while yielding almost unaffected accuracy.

Alessandro Moschitti | Daniele Pighin

[1] Daniel Marcu,et al. NP Bracketing by Maximum Entropy Tagging and SVM Reranking , 2004, EMNLP.

[2] Qiming Chen,et al. PrefixSpan,: mining sequential patterns efficiently by prefix-projected pattern growth , 2001, Proceedings 17th International Conference on Data Engineering.

[3] David Haussler,et al. Convolution kernels on discrete structures , 1999 .

[4] Alessandro Moschitti,et al. Making Tree Kernels Practical for Natural Language Learning , 2006, EACL.

[5] Xavier Carreras,et al. Introduction to the CoNLL-2005 Shared Task: Semantic Role Labeling , 2005, CoNLL.

[6] Roberto Basili,et al. Semantic Role Labeling via Tree Kernel Joint Inference , 2006, CoNLL.

[7] Jun Suzuki,et al. Sequence and Tree Kernels with Statistical Feature Mining , 2005, NIPS.

[8] Michael Collins,et al. New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron , 2002, ACL.

[9] Daniel Gildea,et al. Automatic Labeling of Semantic Roles , 2000, ACL.

[10] Bernhard Schölkopf,et al. Use of the Zero-Norm with Linear Models and Kernel Methods , 2003, J. Mach. Learn. Res..

[11] Yuji Matsumoto,et al. Fast Methods for Kernel-Based Text Analysis , 2003, ACL.