暂无分享,去创建一个
Matteo Hessel | Fabio Viola | Hado van Hasselt | Aidan Clark | John Quan | Thomas Keck | Manuel Kroiss | Iurii Kemaev | M. Kroiss | Matteo Hessel | H. V. Hasselt | John Quan | Fabio Viola | Aidan Clark | T. Keck | Iurii Kemaev
[1] Demis Hassabis,et al. Mastering Atari, Go, chess and shogi by planning with a learned model , 2019, Nature.
[2] B. Speelpenning. Compiling Fast Partial Derivatives of Functions Given by Algorithms , 1980 .
[3] Marc'Aurelio Ranzato,et al. Large Scale Distributed Deep Networks , 2012, NIPS.
[4] K. Jarrod Millman,et al. Array programming with NumPy , 2020, Nat..
[5] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[6] Junhyuk Oh,et al. A Self-Tuning Actor-Critic Algorithm , 2020, NeurIPS.
[7] Junhyuk Oh,et al. Meta-Gradient Reinforcement Learning with an Objective Discovered Online , 2020, NeurIPS.
[8] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.
[9] David Budden,et al. Distributed Prioritized Experience Replay , 2018, ICLR.
[10] Shane Legg,et al. Massively Parallel Methods for Deep Reinforcement Learning , 2015, ArXiv.
[11] Piotr Stanczyk,et al. SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference , 2020, ICLR.
[12] R. E. Wengert,et al. A simple automatic derivative evaluation program , 1964, Commun. ACM.
[13] Michael J. Flynn,et al. Some Computer Organizations and Their Effectiveness , 1972, IEEE Transactions on Computers.
[14] Martin Ward. Proving program refinements and transformations , 1986 .
[15] Demis Hassabis,et al. Mastering the game of Go with deep neural networks and tree search , 2016, Nature.
[16] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[17] Ludovít Molnár,et al. Analytical differentiation on a digital computer , 1970, Kybernetika.
[18] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[19] Kevin P. Murphy,et al. Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.
[20] Marvin Minsky,et al. Steps toward Artificial Intelligence , 1995, Proceedings of the IRE.
[21] Junhyuk Oh,et al. Discovering Reinforcement Learning Algorithms , 2020, NeurIPS.
[22] John McCarthy,et al. Recursive functions of symbolic expressions and their computation by machine, Part I , 1960, Commun. ACM.
[23] Christopher M. Bishop,et al. Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .
[24] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.
[25] Robert M. Glorioso,et al. A micro controlled peripheral processor , 1973, MICRO 6.
[26] Victor Uc Cetina,et al. Reinforcement learning in continuous state and action spaces , 2009 .
[27] David A. Patterson,et al. In-datacenter performance analysis of a tensor processing unit , 2017, 2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA).
[28] Demis Hassabis,et al. Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm , 2017, ArXiv.
[29] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[30] Michael I. Jordan,et al. RLlib: Abstractions for Distributed Reinforcement Learning , 2017, ICML.
[31] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.