论文信息 - Neural Random Access Machines - 字舞流文

Neural Random Access Machines

Abstract: In this paper, we propose and investigate a new neural network architecture called Neural Random Access Machine. It can manipulate and dereference pointers to an external variable-size random-access memory. The model is trained from pure input-output examples using backpropagation. We evaluate the new model on a number of simple algorithmic tasks whose solutions require pointer manipulation and dereferencing. Our results show that the proposed model can learn to solve algorithmic tasks of such type and is capable of operating on simple data structures like linked-lists and binary trees. For easier tasks, the learned solutions generalize to sequences of arbitrary length. Moreover, memory access during inference can be done in a constant time under some assumptions.

Marcin Andrychowicz | Ilya Sutskever | Karol Kurach | Marcin Andrychowicz | Ilya Sutskever | Karol Kurach | I. Sutskever

[1] Ray J. Solomonoff,et al. A Formal Theory of Inductive Inference. Part I , 1964, Inf. Control..

[2] Ray J. Solomonoff,et al. A Formal Theory of Inductive Inference. Part II , 1964, Inf. Control..

[3] Yoshua Bengio,et al. Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[4] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[5] Jason Weston,et al. Curriculum learning , 2009, ICML '09.

[6] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[7] Wojciech Zaremba,et al. Learning to Execute , 2014, ArXiv.

[8] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.

[9] Geoffrey E. Hinton,et al. Grammar as a Foreign Language , 2014, NIPS.

[10] Jason Weston,et al. End-To-End Memory Networks , 2015, NIPS.

[11] Wojciech Zaremba,et al. Reinforcement Learning Neural Turing Machines - Revised , 2015 .

[12] Quoc V. Le,et al. Adding Gradient Noise Improves Learning for Very Deep Networks , 2015, ArXiv.

[13] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[14] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15] Tomas Mikolov,et al. Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets , 2015, NIPS.

[16] Quoc V. Le,et al. Listen, Attend and Spell , 2015, ArXiv.

[17] Phil Blunsom,et al. Learning to Transduce with Unbounded Memory , 2015, NIPS.

[18] Wojciech Zaremba,et al. Reinforcement Learning Neural Turing Machines , 2015, ArXiv.

[19] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[20] Alex Graves,et al. Grid Long Short-Term Memory , 2015, ICLR.