暂无分享,去创建一个
[1] Hado van Hasselt,et al. Double Q-learning , 2010, NIPS.
[2] Nahum Shimkin,et al. Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning , 2016, ICML.
[3] Zhuoran Yang,et al. A Theoretical Analysis of Deep Q-Learning , 2019, L4DC.
[4] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[5] Nitish Srivastava,et al. Improving neural networks by preventing co-adaptation of feature detectors , 2012, ArXiv.
[6] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[7] Zoubin Ghahramani,et al. Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.
[8] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[9] Alex Kendall,et al. Concrete Dropout , 2017, NIPS.
[10] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[11] Yang Liu,et al. Stochastic Variance Reduction for Deep Q-learning , 2019, AAMAS.
[12] Sebastian Thrun,et al. Issues in Using Function Approximation for Reinforcement Learning , 1999 .
[13] Christopher D. Manning,et al. Fast dropout training , 2013, ICML.
[14] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[15] Yang Liu,et al. Learning to Play in a Day: Faster Deep Reinforcement Learning by Optimality Tightening , 2016, ICLR.
[16] R. Bellman. A Markovian Decision Process , 1957 .
[17] John N. Tsitsiklis,et al. Analysis of temporal-difference learning with function approximation , 1996, NIPS 1996.
[18] Tom Schaul,et al. Unifying Count-Based Exploration and Intrinsic Motivation , 2016, NIPS.
[19] Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
[20] Graham W. Taylor,et al. Improved Regularization of Convolutional Neural Networks with Cutout , 2017, ArXiv.
[21] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[22] Mahesan Niranjan,et al. On-line Q-learning using connectionist systems , 1994 .
[23] David Silver,et al. Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.
[24] Xiaodong Gu,et al. Towards dropout training for convolutional neural networks , 2015, Neural Networks.