论文信息 - Can recursive neural tensor networks learn logical reasoning?

Can recursive neural tensor networks learn logical reasoning?

Recursive neural network models and their accompanying vector representations for words have seen success in an array of increasingly semantically sophisticated tasks, but almost nothing is known about their ability to accurately capture the aspects of linguistic meaning that are necessary for interpretation or reasoning. To evaluate this, I train a recursive model on a new corpus of constructed examples of logical reasoning in short sentences, like the inference of "some animal walks" from "some dog walks" or "some cat walks," given that dogs and cats are animals. This model learns representations that generalize well to new types of reasoning pattern in all but a few cases, a result which is promising for the ability of learned representation models to capture logical reasoning.

Samuel R. Bowman

[1] Stephen Clark,et al. Mathematical Foundations for a Compositional Distributional Model of Meaning , 2010, ArXiv.

[2] J Quinonero Candela,et al. Machine Learning Challenges. Evaluating Predictive Uncertainty, Visual Object Classification, and Recognising Tectual Entailment , 2006, Lecture Notes in Computer Science.

[3] Katrin Erk,et al. A Formal Approach to Linking Logical Form and Vector-Space Lexical Semantics , 2014 .

[4] Andrew Y. Ng,et al. Parsing with Compositional Vector Grammars , 2013, ACL.

[5] Christopher D. Manning,et al. Natural language inference , 2009 .

[6] Raffaella Bernardi,et al. Entailment above the word level in distributional semantics , 2012, EACL.

[7] Jeffrey Pennington,et al. Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions , 2011, EMNLP.

[8] Danqi Chen,et al. Learning New Facts From Knowledge Bases With Neural Tensor Networks and Semantic Word Vectors , 2013, ICLR.

[9] Yoram Singer,et al. Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[10] Ido Dagan,et al. The Third PASCAL Recognizing Textual Entailment Challenge , 2007, ACL-PASCAL@ACL.

[11] Jason Weston,et al. A unified architecture for natural language processing: deep neural networks with multitask learning , 2008, ICML '08.