论文信息 - Spatio-Temporal Attention Deep Recurrent Q-Network for POMDPs

Spatio-Temporal Attention Deep Recurrent Q-Network for POMDPs

One of the long-standing challenges for reinforcement learning agents is to deal with noisy environments. Although progress has been made in producing an agent capable of optimizing its environment in fully observable conditions, partial observability still remains a difficult task. In this paper, a novel model is proposed which inspired by human perception, utilizes two fundamental machine learning concepts, attention and memory, to better confront a noisy environment.

David Mulvaney | Pawel Ladosz | Mariano Etchart

[1] Pascal Poupart,et al. On Improving Deep Reinforcement Learning for POMDPs , 2017, ArXiv.

[2] Wenjun Zeng,et al. Spatio-Temporal Attention-Based LSTM Networks for 3D Action Recognition and Detection , 2018, IEEE Transactions on Image Processing.

[3] Mikhail Pavlov,et al. Deep Attention Recurrent Q-Network , 2015, ArXiv.

[4] Shimon Whiteson,et al. Deep Variational Reinforcement Learning for POMDPs , 2018, ICML.

[5] Joelle Pineau,et al. Online Planning Algorithms for POMDPs , 2008, J. Artif. Intell. Res..

[6] J. Juola,et al. Are spatial and temporal attention independent? , 2007, Perception & psychophysics.

[7] Heng Tao Shen,et al. Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning , 2017, IJCAI.

[8] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[9] Yunchao Wei,et al. STA: Spatial-Temporal Attention for Large-Scale Video-based Person Re-Identification , 2018, AAAI.

[10] Anna C. Nobre,et al. Synergistic Effect of Combined Temporal and Spatial Expectations on Visual Attention , 2005, The Journal of Neuroscience.

[11] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.