论文信息 - An Initial Attempt of Combining Visual Selective Attention with Deep Reinforcement Learning

An Initial Attempt of Combining Visual Selective Attention with Deep Reinforcement Learning

Visual attention serves as a means of feature selection mechanism in the perceptual system. Motivated by Broadbent's leaky filter model of selective attention, we evaluate how such mechanism could be implemented and affect the learning process of deep reinforcement learning. We visualize and analyze the feature maps of DQN on a toy problem Catch, and propose an approach to combine visual selective attention with deep reinforcement learning. We experiment with optical flow-based attention and A2C on Atari games. Experiment results show that visual selective attention could lead to improvements in terms of sample efficiency on tested games. An intriguing relation between attention and batch normalization is also discovered.

Dana H. Ballard | Ruohan Zhang | Liu Yuezhang

[1] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[2] Philip Bachman,et al. Deep Reinforcement Learning that Matters , 2017, AAAI.

[3] Andrew Zisserman,et al. Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.

[5] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.

[6] Mikhail Pavlov,et al. Deep Attention Recurrent Q-Network , 2015, ArXiv.

[7] M. Gluck,et al. Hippocampal mediation of stimulus representation: A computational theory , 1993, Hippocampus.

[8] Luxin Zhang,et al. AGIL: Learning Attention from Human for Visuomotor Tasks , 2018, ECCV.

[9] Christof Koch,et al. A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[10] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[11] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.