论文信息 - Survey of Reasoning using Neural networks

Survey of Reasoning using Neural networks

Reason and inference require process as well as memory skills by humans. Neural networks are able to process tasks like image recognition (better than humans) but in memory aspects are still limited (by attention mechanism, size). Recurrent Neural Network (RNN) and it's modified version LSTM are able to solve small memory contexts, but as context becomes larger than a threshold, it is difficult to use them. The Solution is to use large external memory. Still, it poses many challenges like, how to train neural networks for discrete memory representation, how to describe long term dependencies in sequential data etc. Most prominent neural architectures for such tasks are Memory networks: inference components combined with long term memory and Neural Turing Machines: neural networks using external memory resources. Also, additional techniques like attention mechanism, end to end gradient descent on discrete memory representation are needed to support these solutions. Preliminary results of above neural architectures on simple algorithms (sorting, copying) and Question Answering (based on story, dialogs) application are comparable with the state of the art. In this paper, I explain these architectures (in general), the additional techniques used and the results of their application.

Amit Sahu

[1] Jason Weston,et al. Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks , 2015, ICLR.

[2] Robert F. Hadley. The Problem of Rapid Variable Creation , 2009, Neural Computation.

[3] Jason Weston,et al. Memory Networks , 2014, ICLR.

[4] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.

[5] Oren Etzioni,et al. Paraphrase-Driven Learning for Open Question Answering , 2013, ACL.

[6] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[7] Wojciech Zaremba,et al. Learning to Execute , 2014, ArXiv.

[8] Hava T. Siegelmann,et al. On the Computational Power of Neural Nets , 1995, J. Comput. Syst. Sci..

[9] H. Sebastian Seung,et al. Continuous attractors and oculomotor control , 1998, Neural Networks.

[10] Jason Weston,et al. End-To-End Memory Networks , 2015, NIPS.

[11] Lukás Burget,et al. Recurrent neural network based language model , 2010, INTERSPEECH.

[12] Jason Weston,et al. Towards Understanding Situated Natural Language , 2010, AISTATS.

[13] Jason Weston,et al. Open Question Answering with Weakly Supervised Embedding Models , 2014, ECML/PKDD.

[14] Alex Graves,et al. Supervised Sequence Labelling with Recurrent Neural Networks , 2012, Studies in Computational Intelligence.

[15] John J. Hopfield,et al. Neural networks and physical systems with emergent collective computational abilities , 1999 .