暂无分享,去创建一个
Sergey Levine | Pieter Abbeel | Gregory Kahn | Adam Villaflor | Vitchyr Pong | Vitchyr H. Pong | S. Levine | P. Abbeel | G. Kahn | A. Villaflor
[1] B. Efron,et al. The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .
[2] B. Efron. The jackknife, the bootstrap, and other resampling plans , 1987 .
[3] Robert Tibshirani,et al. An Introduction to the Bootstrap , 1994 .
[4] S. T. Buckland,et al. An Introduction to the Bootstrap. , 1994 .
[5] Jeff G. Schneider,et al. Exploiting Model Uncertainty Estimates for Safe Dynamic Control Learning , 1996, NIPS.
[6] T. Fearn. The Jackknife , 2000 .
[7] Andrew G. Barto,et al. Lyapunov Design for Safe Reinforcement Learning , 2003, J. Mach. Learn. Res..
[8] Stefan Schaal,et al. Policy Gradient Methods for Robotics , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[9] Pierre-Brice Wieber,et al. Viability and predictive control for safe locomotion , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[10] William Whittaker,et al. Autonomous driving in urban environments: Boss and the Urban Challenge , 2008, J. Field Robotics.
[11] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.
[12] Carl E. Rasmussen,et al. PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.
[13] Purnamrita Sarkar,et al. The Big Data Bootstrap , 2012, ICML.
[14] Claire J. Tomlin,et al. Guaranteed Safe Online Learning via Reachability: tracking a ground target using a quadrotor , 2012, 2012 IEEE International Conference on Robotics and Automation.
[15] Pieter Abbeel,et al. Safe Exploration in Markov Decision Processes , 2012, ICML.
[16] Claire J. Tomlin,et al. Reducing Conservativeness in Safety Guarantees by Learning Disturbances Online: Iterated Guaranteed Safe Online Learning , 2012, Robotics: Science and Systems.
[17] Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
[18] Jan Peters,et al. Reinforcement learning in robotics: A survey , 2013, Int. J. Robotics Res..
[19] Jan Peters,et al. A Survey on Policy Search for Robotics , 2013, Found. Trends Robotics.
[20] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..
[21] Daniel Cownden,et al. Random feedback weights support learning in deep neural networks , 2014, ArXiv.
[22] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[23] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
[24] Vijay Kumar,et al. Safe receding horizon control for aggressive MAV flight with limited range sensing , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[25] Charles Richter,et al. Bayesian Learning for Safe High-Speed Navigation in Unknown Environments , 2015, ISRR.
[26] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[27] Martial Hebert,et al. Introspective perception: Learning to predict failures in vision systems , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[28] C. Rasmussen,et al. Improving PILCO with Bayesian Neural Network Dynamics Models , 2016 .
[29] Benjamin Van Roy,et al. Deep Exploration via Bootstrapped DQN , 2016, NIPS.
[30] Raffaello D'Andrea,et al. Relaxed hover solutions for multicopters: Application to algorithmic redundancy and novel vehicles , 2016, Int. J. Robotics Res..
[31] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[32] Zoubin Ghahramani,et al. Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.
[33] Russ Tedrake,et al. Funnel libraries for real-time robust feedback motion planning , 2016, Int. J. Robotics Res..
[34] Sergey Levine,et al. PLATO: Policy learning using adaptive trajectory optimization , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).
[35] Andreas Krause,et al. Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics , 2016, Machine Learning.