论文信息 - Textual Entailment with Structured Attentions and Composition

Textual Entailment with Structured Attentions and Composition

Deep learning techniques are increasingly popular in the textual entailment task, overcoming the fragility of traditional discrete models with hard alignments and logics. In particular, the recently proposed attention models (Rocktaschel et al., 2015; Wang and Jiang, 2015) achieves state-of-the-art accuracy by computing soft word alignments between the premise and hypothesis sentences. However, there remains a major limitation: this line of work completely ignores syntax and recursion, which is helpful in many traditional efforts. We show that it is beneficial to extend the attention model to tree nodes between premise and hypothesis. More importantly, this subtree-level attention reveals information about entailment relation. We study the recursive composition of this subtree-level entailment relation, which can be viewed as a soft version of the Natural Logic framework (MacCartney and Manning, 2009). Experiments show that our structured attention and entailment composition model can correctly identify and infer entailment relations from the bottom up, and bring significant improvements in accuracy.

Mingbo Ma | Kai Zhao | Liang Huang

[1] Wojciech Zaremba,et al. Recurrent Neural Network Regularization , 2014, ArXiv.

[2] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[3] Shuohang Wang,et al. Learning Natural Language Inference with LSTM , 2015, NAACL.

[4] Christopher Potts,et al. Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[5] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[6] Phil Blunsom,et al. Reasoning about Entailment with Neural Attention , 2015, ICLR.

[7] Christopher D. Manning,et al. Probabilistic Tree-Edit Models with Structured Latent Variables for Textual Entailment and Question Answering , 2010, COLING.

[8] Christopher D. Manning,et al. An extended model of natural logic , 2009, IWCS.

[9] Christopher Potts,et al. A large annotated corpus for learning natural language inference , 2015, EMNLP.

[10] Brendan J. Frey,et al. Learning Wake-Sleep Recurrent Attention Models , 2015, NIPS.

[11] Gholamreza Haffari,et al. Incorporating Structural Alignment Biases into an Attentional Neural Translation Model , 2016, NAACL.