论文信息 - Recurrent Independent Mechanisms

Recurrent Independent Mechanisms

Learning modular structures which reflect the dynamics of the environment can lead to better generalization and robustness to changes which only affect a few of the underlying causes. We propose Recurrent Independent Mechanisms (RIMs), a new recurrent architecture in which multiple groups of recurrent cells operate with nearly independent transition dynamics, communicate only sparingly through the bottleneck of attention, and are only updated at time steps where they are most relevant. We show that this leads to specialization amongst the RIMs, which in turn allows for dramatically improved generalization on tasks where some factors of variation differ systematically between training and evaluation.

[1] Sergey Levine,et al. InfoBot: Transfer and Exploration via the Information Bottleneck , 2019, ICLR.

[2] Razvan Pascanu,et al. Discovering objects and their relations from entangled scene representations , 2017, ICLR.

[3] Ruben Villegas,et al. Learning Latent Dynamics for Planning from Pixels , 2018, ICML.

[4] Chrisantha Fernando,et al. PathNet: Evolution Channels Gradient Descent in Super Neural Networks , 2017, ArXiv.

[5] Yoshua Bengio,et al. The Consciousness Prior , 2017, ArXiv.

[6] A. Dickinson. Actions and habits: the development of behavioural autonomy , 1985 .

[7] Razvan Pascanu,et al. A simple neural network module for relational reasoning , 2017, NIPS.

[8] M. Botvinick,et al. Motivation and cognitive control: from behavior to neural mechanism. , 2015, Annual review of psychology.

[9] Alex Graves,et al. Neural Turing Machines , 2014, ArXiv.

[10] Christopher Joseph Pal,et al. Sparse Attentive Backtracking: Temporal CreditAssignment Through Reminding , 2018, NeurIPS.

[11] Nando de Freitas,et al. Neural Programmer-Interpreters , 2015, ICLR.

[12] Shih-Chii Liu,et al. Phased LSTM: Accelerating Recurrent Network Training for Long or Event-based Sequences , 2016, NIPS.

[13] Razvan Pascanu,et al. Relational inductive biases, deep learning, and graph networks , 2018, ArXiv.

[14] Richard Socher,et al. Pointer Sentinel Mixture Models , 2016, ICLR.

[15] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.

[16] Yoshua Bengio,et al. Hierarchical Recurrent Neural Networks for Long-Term Dependencies , 1995, NIPS.

[17] Tomas Mikolov,et al. Variable Computation in Recurrent Neural Networks , 2016, ICLR.

[18] Shuai Li,et al. Independently Recurrent Neural Network (IndRNN): Building A Longer and Deeper RNN , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[19] Yoshua Bengio,et al. BabyAI: First Steps Towards Grounded Language Learning With a Human In the Loop , 2018, ArXiv.

[20] Ronald J. Williams,et al. A Learning Algorithm for Continually Running Fully Recurrent Neural Networks , 1989, Neural Computation.

[21] Henrik Gollee,et al. Modular Neural Networks and Self-Decomposition , 1997 .

[22] Yoshua Bengio,et al. Zoneout: Regularizing RNNs by Randomly Preserving Hidden Activations , 2016, ICLR.

[23] Matthew Riemer,et al. Routing Networks: Adaptive Selection of Non-linear Functions for Multi-Task Learning , 2017, ICLR.

[24] M. Botvinick,et al. Mental labour , 2018, Nature Human Behaviour.

[25] Geoffrey E. Hinton,et al. Matrix capsules with EM routing , 2018, ICLR.

[26] Yoshua Bengio,et al. Hierarchical Multiscale Recurrent Neural Networks , 2016, ICLR.

[27] Yee Whye Teh,et al. Distral: Robust multitask reinforcement learning , 2017, NIPS.

[28] Jürgen Schmidhuber,et al. Relational Neural Expectation Maximization: Unsupervised Discovery of Objects and their Interactions , 2018, ICLR.

[29] Sergey Levine,et al. Reinforcement Learning with Competitive Ensembles of Information-Constrained Primitives , 2019, ICLR.

[30] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[31] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[32] Dan Klein,et al. Neural Module Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33] David Haussler,et al. What Size Net Gives Valid Generalization? , 1989, Neural Computation.

[34] Christopher Joseph Pal,et al. A Meta-Transfer Objective for Learning to Disentangle Causal Mechanisms , 2019, ICLR.

[35] Razvan Pascanu,et al. Relational recurrent neural networks , 2018, NeurIPS.

[36] HERBERT A. SIMON,et al. The Architecture of Complexity , 1991 .

[37] R. Desimone,et al. Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[38] Bernhard Schölkopf,et al. Elements of Causal Inference: Foundations and Learning Algorithms , 2017 .

[39] Ah Chung Tsoi,et al. The Graph Neural Network Model , 2009, IEEE Transactions on Neural Networks.

[40] Patrick Gallinari,et al. A Framework for the Cooperation of Learning Algorithms , 1990, NIPS.

[41] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[42] 三嶋博之. The theory of affordances , 2008 .

[43] Pierre Vandergheynst,et al. Geometric Deep Learning: Going beyond Euclidean data , 2016, IEEE Signal Process. Mag..

[44] Bernhard Schölkopf,et al. On causal and anticausal learning , 2012, ICML.

[45] Rob Fergus,et al. Stochastic Video Generation with a Learned Prior , 2018, ICML.

[46] Jason Weston,et al. Tracking the World State with Recurrent Entity Networks , 2016, ICLR.

[47] Jürgen Schmidhuber,et al. World Models , 2018, ArXiv.

[48] Geoffrey E. Hinton,et al. Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[49] Jürgen Schmidhuber,et al. A Clockwork RNN , 2014, ICML.

[50] Samuel S. Schoenholz,et al. Neural Message Passing for Quantum Chemistry , 2017, ICML.

[51] Razvan Pascanu,et al. Visual Interaction Networks: Learning a Physics Simulator from Video , 2017, NIPS.

[52] Bernhard Schölkopf,et al. Learning Independent Causal Mechanisms , 2017, ICML.

[53] J. Pearl. Causality: Models, Reasoning and Inference , 2000 .

[54] Jürgen Schmidhuber,et al. One Big Net For Everything , 2018, ArXiv.

[55] J. Kalaska,et al. Neural mechanisms for interacting with a world full of action choices. , 2010, Annual review of neuroscience.

[56] Ignacio Cases,et al. Routing Networks and the Challenges of Modular and Compositional Computation , 2019, ArXiv.

[57] Sergio Gomez Colmenarejo,et al. Hybrid computing using a neural network with dynamic external memory , 2016, Nature.

[58] David Barber,et al. Modular Networks: Learning to Decompose Neural Computation , 2018, NeurIPS.

[59] Geoffrey E. Hinton,et al. Dynamic Routing Between Capsules , 2017, NIPS.

[60] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[61] Geoffrey E. Hinton,et al. Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer , 2017, ICLR.

[62] Yoshua Bengio,et al. A Recurrent Latent Variable Model for Sequential Data , 2015, NIPS.

[63] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[64] Sergey Levine,et al. Learning Powerful Policies by Using Consistent Dynamics Model , 2019, ArXiv.

[65] H. Francis Song,et al. Relational Forward Models for Multi-Agent Learning , 2018, ICLR.

[66] R. Zemel,et al. Neural Relational Inference for Interacting Systems , 2018, ICML.