Decoding multitask DQN in the world of Minecraft

Multitask networks that can play multiple Atari games at expert level have been successfully trained using supervised learning from several single task Deep Q Networks (DQN). However, such networks are not be able to exploit the high level similarity between games or learn common representations of game states. In fact, learned representations were shown to be separable given the game. In our work, we show that with sufficient similarity between tasks, we can train a multitask extension of DQN (MDQN) which shares representations across tasks without loss of performance. To this end, we construct a novel set of tasks with shared characteristics in Minecraft, a complex 3D world, and are able to demonstrate meaningful representation sharing between different related tasks. Sharing representations for similar tasks has paramount importance for transfer learning and lifelong learning. We envision results of this work as a stepping stone to novel lifelong learning approaches.

[1]  Geoffrey E. Hinton,et al.  On the importance of initialization and momentum in deep learning , 2013, ICML.

[2]  Marc G. Bellemare,et al.  The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..

[3]  Razvan Pascanu,et al.  Policy Distillation , 2015, ICLR.

[4]  Honglak Lee,et al.  Control of Memory, Active Perception, and Action in Minecraft , 2016, ICML.

[5]  Shie Mannor,et al.  A Deep Hierarchical Approach to Lifelong Learning in Minecraft , 2016, AAAI.

[6]  Katja Hofmann,et al.  The Malmo Platform for Artificial Intelligence Experimentation , 2016, IJCAI.

[7]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[8]  Shie Mannor,et al.  Graying the black box: Understanding DQNs , 2016, ICML.

[9]  Ruslan Salakhutdinov,et al.  Actor-Mimic: Deep Multitask and Transfer Reinforcement Learning , 2015, ICLR.

[10]  Tom Schaul,et al.  Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[11]  Peter Stone,et al.  Transfer Learning for Reinforcement Learning Domains: A Survey , 2009, J. Mach. Learn. Res..

[12]  Hod Lipson,et al.  Convergent Learning: Do different neural networks learn the same representations? , 2015, FE@NIPS.

[13]  Andrew Zisserman,et al.  Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[14]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[15]  Kenta Oono,et al.  Chainer : a Next-Generation Open Source Framework for Deep Learning , 2015 .