论文信息 - Distributed Deep Reinforcement Learning using TensorFlow

Distributed Deep Reinforcement Learning using TensorFlow

Deep Reinforcement Learning is the combination of Reinforcement Learning algorithms with Deep neural network, which had recent success in learning complicated unknown environments. The trained model is a Convolutional Neural Network trained using Q-Learning Loss value. The agent takes in observation, i.e. raw pixel image and reward from the environment for each step as input. The deep Q-learning algorithm gives out the optimal action for every observation and reward pair. The hyperparameters of Deep Q-Network remain unchanged for any environment. TensorFIow, an open source machine learning and numerical computation library is used to implement the deep Q-Learning algorithm on GPU. The distributed TensorFIow architecture is used to maximize the hardware resource utilization and reduce the training time. The usage of Graphics Processing Unit (GPU) in the distributed environment accelerated the training of deep Q-network. On implementing the deep Q-learning algorithm for many environments from OpenAI Gym, the agent outperforms a decent human reference player with few days of training.

T Praveena | P Ajay Rao | B Navaneesh Kumar | Siddharth Cadabam

[1] Yuan Yu,et al. TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[2] Santosha K. Dwivedy,et al. Reinforcement Learning via Recurrent Convolutional Neural Networks , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[3] Bram Bakker,et al. Reinforcement Learning with Long Short-Term Memory , 2001, NIPS.

[4] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[5] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[6] David Silver,et al. Deep Reinforcement Learning from Self-Play in Imperfect-Information Games , 2016, ArXiv.

[7] Marc'Aurelio Ranzato,et al. Large Scale Distributed Deep Networks , 2012, NIPS.

[8] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.

[9] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[10] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[11] Chrisantha Fernando,et al. PathNet: Evolution Channels Gradient Descent in Super Neural Networks , 2017, ArXiv.

[12] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..