Validating Meronymy Hypotheses with Support Vector Machines and Graph Kernels

There is a substantial body of work on the extraction of relations from texts, most of which is based on pattern matching or on applying tree kernel functions to syntactic structures. Whereas pattern application is usually more efficient, tree kernels can be superior when assessed by the F-measure. In this paper, we introduce a hybrid approach to extracting meronymy relations, which is based on both patterns and kernel functions. In a first step, meronymy relation hypotheses are extracted from a text corpus by applying patterns. In a second step these relation hypotheses are validated by using several shallow features and a graph kernel approach. In contrast to other meronymy extraction and validation methods which are based on surface or syntactic representations we use a purely semantic approach based on semantic networks. This involves analyzing each sentence of the Wikipedia corpus by a deep syntactico-semantic parser and converting it into a semantic network. Meronymy relation hypotheses are extracted from the semantic networks by means of an automated theorem prover, which employs a set of logical axioms and patterns in the form of semantic networks. The meronymy candidates are then validated by means of a graph kernel approach based on common walks. The evaluation shows that this method achieves considerably higher accuracy, recall, and F-measure than a method using purely shallow validation.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Aron Culotta,et al.  Dependency Tree Kernels for Relation Extraction , 2004, ACL.

[3]  Thomas Gärtner,et al.  On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.

[4]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[5]  Christiane Fellbaum,et al.  Combining Local Context and Wordnet Similarity for Word Sense Identification , 1998 .

[6]  Dan I. Moldovan,et al.  Automatic Discovery of Part-Whole Relations , 2006, CL.

[7]  Christiane Fellbaum,et al.  Lexical Chains as Representations of Context for the Detection and Correction of Malapropisms , 1998 .

[8]  Razvan C. Bunescu,et al.  A Shortest Path Dependency Kernel for Relation Extraction , 2005, HLT.

[9]  Ralph Grishman,et al.  Extracting Relations with Integrated Information Using Kernel Methods , 2005, ACL.

[10]  Jean-Philippe Tarel,et al.  Non-Mercer Kernels for SVM Object Recognition , 2004, BMVC.

[11]  David Haussler,et al.  Convolution kernels on discrete structures , 1999 .

[12]  Fintan J. Costello UCD-FC: Deducing semantic relations using WordNet senses that occur frequently in a database of noun-noun compounds , 2007, SemEval@ACL.

[13]  Sven Hartrumpf,et al.  Hybrid disambiguation in natural language analysis , 2003 .

[14]  Graeme Hirst,et al.  Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures , 2004 .

[15]  Patrick Pantel,et al.  Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations , 2006, ACL.

[16]  Karsten M. Borgwardt,et al.  Fast subtree kernels on graphs , 2009, NIPS.

[17]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[18]  Hans-Peter Kriegel,et al.  Shortest-path kernels on graphs , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[19]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[20]  Hermann Helbig,et al.  Knowledge Representation and the Semantics of Natural Language , 2005, Cognitive Technologies.

[21]  Bernard Haasdonk,et al.  Feature space interpretation of SVMs with indefinite kernels , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Tim vor der Brück,et al.  Hypernymy Extraction Using a Semantic Network Representation , 2010, Int. J. Comput. Linguistics Appl..

[23]  Gerhard Paass,et al.  Dependency Tree Kernels for Relation Extraction from Natural Language Text , 2009, ECML/PKDD.