暂无分享,去创建一个
[1] Tim Salimans,et al. Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks , 2016, NIPS.
[2] Olivier Sigaud,et al. Policy Improvement Methods: Between Black-Box Optimization and Episodic Reinforcement Learning , 2012 .
[3] Philip Bachman,et al. Deep Reinforcement Learning that Matters , 2017, AAAI.
[4] Thomas Bäck,et al. Evolutionary algorithms in theory and practice - evolution strategies, evolutionary programming, genetic algorithms , 1996 .
[5] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[6] Marcin Andrychowicz,et al. Parameter Space Noise for Exploration , 2017, ICLR.
[7] Jascha Sohl-Dickstein,et al. Guided evolutionary strategies: escaping the curse of dimensionality in random search , 2018, ArXiv.
[8] Sergey Levine,et al. Meta-Reinforcement Learning of Structured Exploration Strategies , 2018, NeurIPS.
[9] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[10] Kagan Tumer,et al. Evolution-Guided Policy Gradient in Reinforcement Learning , 2018, NeurIPS.
[11] Zeb Kurth-Nelson,et al. Learning to reinforcement learn , 2016, CogSci.
[12] Pieter Abbeel,et al. Evolved Policy Gradients , 2018, NeurIPS.
[13] Frank Hutter,et al. Neural Architecture Search: A Survey , 2018, J. Mach. Learn. Res..
[14] Jascha Sohl-Dickstein,et al. Guided evolutionary strategies: augmenting random search with surrogate gradients , 2018, ICML.
[15] Olivier Sigaud,et al. Importance mixing: Improving sample reuse in evolutionary policy search methods , 2018, ArXiv.
[16] Benjamin Van Roy,et al. Deep Exploration via Bootstrapped DQN , 2016, NIPS.
[17] Olivier Sigaud,et al. Policy Search in Continuous Action Domains: an Overview , 2018, Neural Networks.
[18] Herke van Hoof,et al. Addressing Function Approximation Error in Actor-Critic Methods , 2018, ICML.
[19] Kagan Tumer,et al. Evolutionary Reinforcement Learning , 2018, NIPS 2018.
[20] Kenneth O. Stanley,et al. Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning , 2017, ArXiv.
[21] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.
[22] Anne Auger,et al. Information-Geometric Optimization Algorithms: A Unifying Picture via Invariance Principles , 2011, J. Mach. Learn. Res..
[23] Tom Schaul,et al. Efficient natural evolution strategies , 2009, GECCO.
[24] Kenneth O. Stanley,et al. ES is more than just a traditional finite-difference approximator , 2017, GECCO.
[25] Nikolaus Hansen,et al. The CMA Evolution Strategy: A Tutorial , 2016, ArXiv.
[26] Xi Chen,et al. Evolution Strategies as a Scalable Alternative to Reinforcement Learning , 2017, ArXiv.
[27] J. A. Lozano,et al. Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .
[28] Olivier Sigaud,et al. Path Integral Policy Improvement with Covariance Matrix Adaptation , 2012, ICML.
[29] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[30] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.
[31] Olivier Sigaud,et al. Robot Skill Learning: From Reinforcement Learning to Evolution Strategies , 2013, Paladyn J. Behav. Robotics.
[32] Kenneth O. Stanley,et al. Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents , 2017, NeurIPS.
[33] Larry Bull,et al. On the Baldwin Effect , 1999, Artificial Life.
[34] Pierre-Yves Oudeyer,et al. GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms , 2017, ICML.
[35] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.