暂无分享,去创建一个
[1] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[2] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[3] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[4] Zhihong Zeng,et al. Audio–Visual Affective Expression Recognition Through Multistream Fused HMM , 2008, IEEE Transactions on Multimedia.
[5] Fabien Moutarde,et al. Deep Reinforcement Learning for autonomous driving , 2019 .
[6] Weinan Zhang,et al. Real-Time Bidding with Multi-Agent Reinforcement Learning in Display Advertising , 2018, CIKM.
[7] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[8] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[9] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.
[10] T. Urbanik,et al. Reinforcement learning-based multi-agent system for network traffic signal control , 2010 .
[11] Mohan S. Kankanhalli,et al. Unsupervised classification of music genre using hidden Markov model , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).
[12] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[13] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[14] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.