论文信息 - The PlayStation Reinforcement Learning Environment (PSXLE)

The PlayStation Reinforcement Learning Environment (PSXLE)

We propose a new benchmark environment for evaluating Reinforcement Learning (RL) algorithms: the PlayStation Learning Environment (PSXLE), a PlayStation emulator modified to expose a simple control API that enables rich game-state representations. We argue that the PlayStation serves as a suitable progression for agent evaluation and propose a framework for such an evaluation. We build an action-driven abstraction for a PlayStation game with support for the OpenAI Gym interface and demonstrate its use by running OpenAI Baselines.

Petar Velickovic | Catalina Cangea | Carlos Purves

[1] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.

[2] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.

[3] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..

[4] Zhihong Zeng,et al. Audio–Visual Affective Expression Recognition Through Multistream Fused HMM , 2008, IEEE Transactions on Multimedia.

[5] Fabien Moutarde,et al. Deep Reinforcement Learning for autonomous driving , 2019 .

[6] Weinan Zhang,et al. Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising , 2018, CIKM.

[7] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.

[8] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..

[9] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[10] T. Urbanik,et al. Reinforcement learning-based multi-agent system for network traffic signal control , 2010 .

[11] Mohan S. Kankanhalli,et al. Unsupervised classification of music genre using hidden Markov model , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[12] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.

[13] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[14] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.