暂无分享,去创建一个
[1] Aleksandra Faust,et al. Learning Navigation Behaviors End-to-End With AutoRL , 2018, IEEE Robotics and Automation Letters.
[2] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[3] Jasper Snoek,et al. Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.
[4] Dario Floreano,et al. Neuroevolution: from architectures to learning , 2008, Evol. Intell..
[5] Philip Bachman,et al. Deep Reinforcement Learning that Matters , 2017, AAAI.
[6] Tom Schaul,et al. Prioritized Experience Replay , 2015, ICLR.
[7] Sae-Young Chung,et al. Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update , 2018, NeurIPS.
[8] Kagan Tumer,et al. Collaborative Evolutionary Reinforcement Learning , 2019, ICML.
[9] Pietro Lio',et al. Proximal Distilled Evolutionary Reinforcement Learning , 2019, AAAI.
[10] Peter Henderson,et al. Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control , 2017, ArXiv.
[11] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[12] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[13] Michel Tokic,et al. Adaptive epsilon-Greedy Exploration in Reinforcement Learning Based on Value Difference , 2010, KI.
[14] Günther Palm,et al. Value-Difference Based Exploration: Adaptive Control between Epsilon-Greedy and Softmax , 2011, KI.
[15] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[16] Shimon Whiteson,et al. Fast Efficient Hyperparameter Tuning for Policy Gradient Methods , 2019, NeurIPS.
[17] David Budden,et al. Distributed Prioritized Experience Replay , 2018, ICLR.
[18] Herke van Hoof,et al. Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.
[19] Kenneth O. Stanley,et al. Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning , 2017, ArXiv.
[20] Brian J. Ross,et al. A Lamarckian Evolution Strategy for Genetic Algorithms , 1998, Practical Handbook of Genetic Algorithms.
[21] David E. Goldberg,et al. Genetic Algorithms, Tournament Selection, and the Effects of Noise , 1995, Complex Syst..
[22] Elliot Meyerson,et al. Evolving Deep Neural Networks , 2017, Artificial Intelligence in the Age of Neural Networks and Brain Computing.
[23] Kenneth O. Stanley,et al. A Hypercube-Based Encoding for Evolving Large-Scale Neural Networks , 2009, Artificial Life.
[24] Damien Ernst,et al. How to Discount Deep Reinforcement Learning: Towards New Dynamic Strategies , 2015, ArXiv.
[25] Frank Hutter,et al. Learning to Design RNA , 2018, ICLR.
[26] Shimon Whiteson,et al. Fast Efficient Hyperparameter Tuning for Policy Gradients , 2019, NeurIPS.
[27] Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents , 2012, J. Artif. Intell. Res..
[28] Stephen Roberts,et al. Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits , 2020, NeurIPS.
[29] Tom Schaul,et al. Rainbow: Combining Improvements in Deep Reinforcement Learning , 2017, AAAI.
[30] Long Ji Lin,et al. Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.
[31] Max Jaderberg,et al. Population Based Training of Neural Networks , 2017, ArXiv.
[32] Junhyuk Oh,et al. A Self-Tuning Actor-Critic Algorithm , 2020, NeurIPS.
[33] Changhu Wang,et al. Network Morphism , 2016, ICML.
[34] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.
[35] Risto Miikkulainen,et al. Evolving Neural Networks through Augmenting Topologies , 2002, Evolutionary Computation.
[36] Anthony G. Francis,et al. Evolving Rewards to Automate Reinforcement Learning , 2019, ArXiv.
[37] Aaron Klein,et al. Hyperparameter Optimization , 2017, Encyclopedia of Machine Learning and Data Mining.
[38] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[39] Shane Legg,et al. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures , 2018, ICML.
[40] Kagan Tumer,et al. Evolution-Guided Policy Gradient in Reinforcement Learning , 2018, NeurIPS.
[41] Karen Simonyan,et al. Off-Policy Actor-Critic with Shared Experience Replay , 2020, ICML.
[42] Risto Miikkulainen,et al. Designing neural networks through neuroevolution , 2019, Nat. Mach. Intell..
[43] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[44] Byoung-Tak Zhang,et al. Evolving Optimal Neural Networks Using Genetic Algorithms with Occam's Razor , 1993, Complex Syst..
[45] James Bergstra,et al. Benchmarking Reinforcement Learning Algorithms on Real-World Robots , 2018, CoRL.