暂无分享,去创建一个
Olivier Sigaud | Timothy M. Hospedales | Freek Stulp | Chenyang Zhao | F. Stulp | Olivier Sigaud | Chenyang Zhao
[1] Yuval Tassa,et al. DeepMind Control Suite , 2018, ArXiv.
[2] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[3] Pierre-Yves Oudeyer,et al. How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement Learning Experiments , 2018, ArXiv.
[4] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[5] Philip Bachman,et al. Deep Reinforcement Learning that Matters , 2017, AAAI.
[6] Antoine Cully,et al. Robots that can adapt like animals , 2014, Nature.
[7] Demis Hassabis,et al. Mastering the game of Go without human knowledge , 2017, Nature.
[8] Abhinav Gupta,et al. Robust Adversarial Reinforcement Learning , 2017, ICML.
[9] Stéphane Doncieux,et al. The Transferability Approach: Crossing the Reality Gap in Evolutionary Robotics , 2013, IEEE Transactions on Evolutionary Computation.
[10] Alec Radford,et al. Proximal Policy Optimization Algorithms , 2017, ArXiv.
[11] Stefan Schaal,et al. Learning to grasp under uncertainty , 2011, 2011 IEEE International Conference on Robotics and Automation.
[12] Dawn Xiaodong Song,et al. Assessing Generalization in Deep Reinforcement Learning , 2018, ArXiv.
[13] Gabriela Csurka,et al. Domain Adaptation in Computer Vision Applications , 2017, Advances in Computer Vision and Pattern Recognition.
[14] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[15] Pieter Abbeel,et al. Benchmarking Deep Reinforcement Learning for Continuous Control , 2016, ICML.
[16] Girish Chowdhary,et al. Robust Deep Reinforcement Learning with Adversarial Attacks , 2017, AAMAS.
[17] Joelle Pineau,et al. A Dissection of Overfitting and Generalization in Continuous Reinforcement Learning , 2018, ArXiv.
[18] Silvio Savarese,et al. Adversarially Robust Policy Learning: Active construction of physically-plausible perturbations , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[19] J. Andrew Bagnell,et al. Modeling Purposeful Adaptive Behavior with the Principle of Maximum Causal Entropy , 2010 .
[20] Wolfram Burgard,et al. The limits and potentials of deep learning for robotics , 2018, Int. J. Robotics Res..
[21] Yuval Tassa,et al. MuJoCo: A physics engine for model-based control , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[22] Samy Bengio,et al. A Study on Overfitting in Deep Reinforcement Learning , 2018, ArXiv.
[23] Sandy H. Huang,et al. Adversarial Attacks on Neural Network Policies , 2017, ICLR.
[24] Stefano Soatto,et al. Entropy-SGD: biasing gradient descent into wide valleys , 2016, ICLR.
[25] Jian Zhang,et al. Structured Control Nets for Deep Reinforcement Learning , 2018, ICML.
[26] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[27] Wojciech Zaremba,et al. Domain randomization for transferring deep neural networks from simulation to the real world , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[28] Taehoon Kim,et al. Quantifying Generalization in Reinforcement Learning , 2018, ICML.