论文信息 - Deep reinforcement learning with relational inductive biases

Deep reinforcement learning with relational inductive biases

We introduce an approach for augmenting model-free deep reinforcement learning agents with a mechanism for relational reasoning over structured representations, which improves performance, learning efficiency, generalization, and interpretability. Our architecture encodes an image as a set of vectors, and applies an iterative message-passing procedure to discover and reason about relevant entities and relations in a scene. In six of seven StarCraft II Learning Environment mini-games, our agent achieved state-of-the-art performance, and surpassed human grandmasterlevel on four. In a novel navigation and planning task, our agent’s performance and learning efficiency far exceeded non-relational baselines, it was able to generalize to more complex scenes than it had experienced during training. Moreover, when we examined its learned internal representations, they reflected important structure about the problem and the agent’s intentions. The main contribution of this work is to introduce techniques for representing and reasoning about states in model-free deep reinforcement learning agents via relational inductive biases. Our experiments show this approach can offer advantages in efficiency, generalization, and interpretability, and can scale up to meet some of the most challenging test environments in modern artificial intelligence.

[1] Pushmeet Kohli,et al. Learning Continuous Semantic Representations of Symbolic Expressions , 2016, ICML.

[2] Quoc V. Le,et al. Neural Programmer: Inducing Latent Programs with Gradient Descent , 2015, ICLR.

[3] David L. Dill,et al. Learning a SAT Solver from Single-Bit Supervision , 2018, ICLR.

[4] Jian Sun,et al. Identity Mappings in Deep Residual Networks , 2016, ECCV.

[5] Razvan Pascanu,et al. Imagination-Augmented Agents for Deep Reinforcement Learning , 2017, NIPS.

[6] Xinlei Chen,et al. Iterative Visual Reasoning Beyond Convolutions , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7] Raia Hadsell,et al. Graph networks as learnable physics engines for inference and control , 2018, ICML.

[8] Yedid Hoshen,et al. VAIN: Attentional Multi-agent Predictive Modeling , 2017, NIPS.

[9] Richard Evans,et al. Can Neural Networks Understand Logical Entailment? , 2018, ICLR.

[10] Joshua B. Tenenbaum,et al. Building machines that learn and think like people , 2016, Behavioral and Brain Sciences.

[11] Le Song,et al. 2 Common Formulation for Greedy Algorithms on Graphs , 2018 .

[12] Abhinav Gupta,et al. Non-local Neural Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13] Lihong Li,et al. Neuro-Symbolic Program Synthesis , 2016, ICLR.

[14] Max Jaderberg,et al. Population Based Training of Neural Networks , 2017, ArXiv.

[15] Chen Liang,et al. Neural Symbolic Machines: Learning Semantic Parsers on Freebase with Weak Supervision , 2016, ACL.

[16] Razvan Pascanu,et al. Discovering objects and their relations from entangled scene representations , 2017, ICLR.

[17] Danfei Xu,et al. Scene Graph Generation by Iterative Message Passing , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Razvan Pascanu,et al. Metacontrol for Adaptive Imagination-Based Optimization , 2017, ICLR.

[19] Razvan Pascanu,et al. Learning model-based planning from scratch , 2017, ArXiv.

[20] David Budden,et al. Distributed Prioritized Experience Replay , 2018, ICLR.

[21] Kurt Driessens,et al. Relational Reinforcement Learning , 1998, Machine-mediated learning.

[22] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[23] Andre Cohen,et al. An object-oriented representation for efficient reinforcement learning , 2008, ICML '08.

[24] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[25] Samy Bengio,et al. A Study on Overfitting in Deep Reinforcement Learning , 2018, ArXiv.

[26] Silvio Savarese,et al. Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27] R. Zemel,et al. Neural Relational Inference for Interacting Systems , 2018, ICML.

[28] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[29] Carl Doersch,et al. Learning Visual Question Answering by Bootstrapping Hard Attention , 2018, ECCV.