Efficient Graph Kernels for Textual Entailment Recognition

One of the most important research area in Natural Language Processing concerns the modeling of semantics expressed in text. Since foundational work in Natural Language Understanding has shown that a deep semantic approach is still not feasible, current research is focused on shallow methods combining linguistic models and machine learning techniques. The latter aim at learning semantic models, like those that can detect the entailment between the meaning of two text fragments, by means of training examples described by specific features. These are rather difficult to design since there is no linguistic model that can effectively encode the lexico-syntactic level of a sentence and its corresponding semantic models. Thus, the adopted solution consists in exhaustively describing training examples by means of all possible combinations of sentence words and syntactic information. The latter, typically expressed as parse trees of text fragments, is often encoded in the learning process using graph algorithms. In this paper, we propose a class of graphs, the tripartite directed acyclic graphs (tDAGs), which can be efficiently used to design algorithms for graph kernels for semantic natural language tasks involving sentence pairs. These model the matching between two pairs of syntactic trees in terms of all possible graph fragments. Interestingly, since tDAGs encode the association between identical or similar words (i.e. variables), it can be used to represent and learn first-order rules, i.e. rules describable by first-order logic. We prove that our matching function is a valid kernel and we empirically show that, although its evaluation is still exponential in the worst case, it is extremely efficient and more accurate than the previously proposed kernels.

[1]  Jie Wang,et al.  Average-case computational complexity theory , 1998 .

[2]  John A. Carroll,et al.  Applied morphological processing of English , 2001, Natural Language Engineering.

[3]  Alessandro Moschitti,et al.  Automatic Learning of Textual Entailments with Cross-Pair Similarities , 2006, ACL.

[4]  Hoa Trang Dang,et al.  Overview of DUC 2005 , 2005 .

[5]  Eugene Charniak,et al.  A Maximum-Entropy-Inspired Parser , 2000, ANLP.

[6]  Alessandro Moschitti,et al.  Fast and effective kernels for relational learning from texts , 2007, ICML '07.

[7]  F. A B I O M A S S I M O Z A N Z O T T O,et al.  A machine learning approach to textual entailment recognition , 2009 .

[8]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[9]  Alessandro Moschitti,et al.  Experimenting a "General Purpose" Textual Entailment Learner in AVE , 2006, CLEF.

[10]  M VoorheesEllen The TREC question answering track , 2001 .

[11]  Sanda M. Harabagiu,et al.  Methods for Using Textual Entailment in Open-Domain Question Answering , 2006, ACL.

[12]  Sebastian Riedel,et al.  The CoNLL 2007 Shared Task on Dependency Parsing , 2007, EMNLP.

[13]  David W. Conrath,et al.  Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy , 1997, ROCLING/IJCLCLP.

[14]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[15]  Beatrice Santorini,et al.  Building a Large Annotated Corpus of English: The Penn Treebank , 1993, CL.

[16]  Jason Eisner,et al.  Learning Non-Isomorphic Tree Mappings for Machine Translation , 2003, ACL.

[17]  Peter Clark,et al.  The Seventh PASCAL Recognizing Textual Entailment Challenge , 2011, TAC.

[18]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[19]  M. Felisa Verdejo,et al.  Overview of the Answer Validation Exercise 2007 , 2007, CLEF.

[20]  Christopher D. Manning,et al.  Learning to distinguish valid textual entailments , 2006 .

[21]  Marc Moens,et al.  Seventh Message Understanding Conference (MUC-7) , 1998 .

[22]  Gennaro Chierchia,et al.  Meaning and grammar (2nd ed.): an introduction to semantics , 2000 .

[23]  Michael Collins,et al.  New Ranking Algorithms for Parsing and Tagging: Kernels over Discrete Structures, and the Voted Perceptron , 2002, ACL.

[24]  Lucien Tesnière Éléments de syntaxe structurale , 1959 .

[25]  Thomas Gärtner,et al.  A survey of kernels for structured data , 2003, SKDD.

[26]  Ido Dagan,et al.  PROBABILISTIC TEXTUAL ENTAILMENT: GENERIC APPLIED MODELING OF LANGUAGE VARIABILITY , 2004 .

[27]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[28]  Roy Bar-Haim,et al.  The Second PASCAL Recognising Textual Entailment Challenge , 2006 .

[29]  M. Aizerman,et al.  Theoretical Foundations of the Potential Function Method in Pattern Recognition Learning , 1964 .

[30]  Sanda M. Harabagiu,et al.  Satisfying information needs with multi-document summaries , 2007, Inf. Process. Manag..

[31]  Jan Ramon,et al.  Expressivity versus efficiency of graph kernels , 2003 .

[32]  Jun Suzuki,et al.  Hierarchical Directed Acyclic Graph Kernel: Methods for Structured Natural Language Data , 2003, ACL.

[33]  Dekang Lin,et al.  DIRT – Discovery of Inference Rules from Text , 2001 .

[34]  Christopher D. Manning,et al.  Robust Textual Inference using Diverse Knowledge Sources , 2005 .

[35]  Rada Mihalcea,et al.  Measuring the Semantic Similarity of Texts , 2005, EMSEE@ACL.

[36]  John D. Lafferty,et al.  A Robust Parsing Algorithm for Link Grammars , 1995, IWPT.

[37]  Michael Collins,et al.  Head-Driven Statistical Models for Natural Language Parsing , 2003, CL.

[38]  Andrew Hickl,et al.  Recognizing Textual Entailment with LCC’s G ROUNDHOG System , 2005 .

[39]  Fabio Massimo Zanzotto,et al.  Discovering Asymmetric Entailment Relations between Verbs Using Selectional Preferences , 2006, ACL.

[40]  Patrick Pantel,et al.  Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations , 2006, ACL.

[41]  Daniel Gildea,et al.  Automatic Labeling of Semantic Roles , 2000, ACL.

[42]  Andrew Y. Ng,et al.  Robust Textual Inference via Graph Matching , 2005, HLT.

[43]  Bob Carpenter,et al.  The logic of typed feature structures , 1992 .

[44]  Patrick Pantel,et al.  DIRT @SBT@discovery of inference rules from text , 2001, KDD '01.

[45]  Ido Dagan,et al.  The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[46]  Günter Neumann,et al.  Recognizing Textual Entailment Using a Subsequence Kernel Method , 2007, AAAI.

[47]  J. Köbler,et al.  The Graph Isomorphism Problem: Its Structural Complexity , 1993 .

[48]  Roberto Basili,et al.  Tree Kernels for Semantic Role Labeling , 2008, CL.

[49]  Alessandro Moschitti,et al.  A Study on Convolution Kernels for Shallow Statistic Parsing , 2004, ACL.

[50]  Ted Pedersen,et al.  WordNet::Similarity - Measuring the Relatedness of Concepts , 2004, NAACL.

[51]  Gennaro Chierchia,et al.  Meaning and Grammar: An Introduction to Semantics , 1990 .