论文信息 - Towards Playing a 3D First-Person Shooter Game Using a Classification Deep Neural Network Architecture

Towards Playing a 3D First-Person Shooter Game Using a Classification Deep Neural Network Architecture

In this work, we present a network architecture to solve a supervised learning problem, the classification of a handwritten dataset, and a reinforcement learning problem, a complex First-Person Shooter 3D game environment. We used a Deep Neural Network model to solve both problems. For classification, we used a Softmax regression and cross entropy loss to train the network. To play the game, we used a Q-Learning adaptation for Deep Learning to train the autonomous agent. In both cases, the input was only the pixels of an image. We show that this single network architecture is suitable for the classification task and is capable of playing the 3D game. This result gives us an insight into the possibility of a general network architecture, capable of solving any kind of problems, regardless of the learning paradigm.

Joaquim B. Cavalcante Neto | Creto Augusto Vidal | Yuri Lenon Barbosa Nogueira

[1] Hado van Hasselt,et al. Double Q-learning , 2010, NIPS.

[2] Shane Legg,et al. Massively Parallel Methods for Deep Reinforcement Learning , 2015, ArXiv.

[3] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[4] Guillaume Lample,et al. Playing FPS Games with Deep Reinforcement Learning , 2016, AAAI.

[5] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[6] George E. Nasr,et al. Cross Entropy Error Function in Neural Networks: Forecasting Gasoline Demand , 2002, FLAIRS.

[7] Yoshua Bengio,et al. Deep Sparse Rectifier Neural Networks , 2011, AISTATS.

[8] Yann LeCun,et al. Regularization of Neural Networks using DropConnect , 2013, ICML.

[9] Wojciech Jaskowski,et al. ViZDoom: A Doom-based AI research platform for visual reinforcement learning , 2016, 2016 IEEE Conference on Computational Intelligence and Games (CIG).

[10] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[11] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.