Recovery RL: Safe Reinforcement Learning With Learned Recovery Zones
暂无分享,去创建一个
Brijen Thananjeyan | Ashwin Balakrishna | Minho Hwang | Chelsea Finn | Joseph Gonzalez | Kenneth Y. Goldberg | Julian Ibarz | Suraj Nair | Michael Luo | Krishnan Srinivasan | Ken Goldberg | Chelsea Finn | Julian Ibarz | Joseph E. Gonzalez | Suraj Nair | Michael Luo | Brijen Thananjeyan | K. Srinivasan | M. Hwang | A. Balakrishna
[1] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[2] Mohammad Ghavamzadeh,et al. Lyapunov-based Safe Policy Optimization for Continuous Control , 2019, ArXiv.
[3] Fritz Wysotzki,et al. Risk-Sensitive Reinforcement Learning Applied to Control under Constraints , 2005, J. Artif. Intell. Res..
[4] S. Savarese,et al. Goal-Aware Prediction: Learning to Model What Matters , 2020, ICML.
[5] Sergey Levine,et al. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor , 2018, ICML.
[6] Pieter Abbeel,et al. Constrained Policy Optimization , 2017, ICML.
[7] Henry Zhu,et al. Soft Actor-Critic Algorithms and Applications , 2018, ArXiv.
[8] Klaus Obermayer,et al. Risk-Sensitive Reinforcement Learning , 2013, Neural Computation.
[9] Sergey Levine,et al. Learning compound multi-step controllers under unknown dynamics , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).
[10] Shuo Li,et al. Robust Model Predictive Shielding for Safe Reinforcement Learning with Stochastic Dynamics , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).
[11] Shie Mannor,et al. Reward Constrained Policy Optimization , 2018, ICLR.
[12] Brijen Thananjeyan,et al. Efficiently Calibrating Cable-Driven Surgical Robots With RGBD Sensing, Temporal Windowing, and Linear and Recurrent Neural Network Compensation , 2020, ArXiv.
[13] Jaime F. Fisac,et al. A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems , 2017, IEEE Transactions on Automatic Control.
[14] Jaime F. Fisac,et al. Bridging Hamilton-Jacobi Safety Analysis and Reinforcement Learning , 2019, 2019 International Conference on Robotics and Automation (ICRA).
[15] Ofir Nachum,et al. A Lyapunov-based Approach to Safe Reinforcement Learning , 2018, NeurIPS.
[16] Francesco Borrelli,et al. Learning Model Predictive Control for Iterative Tasks , 2016, ArXiv.
[17] Dario Amodei,et al. Benchmarking Safe Exploration in Deep Reinforcement Learning , 2019 .
[18] Sergey Levine,et al. Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning , 2017, ICLR.
[19] E. Altman. Constrained Markov Decision Processes , 1999 .
[20] Matthias Heger,et al. Consideration of Risk in Reinforcement Learning , 1994, ICML.
[21] Ruslan Salakhutdinov,et al. Worst Cases Policy Gradients , 2019, CoRL.
[22] Brijen Thananjeyan,et al. ABC-LMPC: Safe Sample-Based Learning MPC for Stochastic Nonlinear Dynamical Systems with Adjustable Boundary Conditions , 2020, WAFR.
[23] Francesco Borrelli,et al. Learning Model Predictive Control for Iterative Tasks. A Data-Driven Control Framework , 2016, IEEE Transactions on Automatic Control.
[24] Andreas Krause,et al. Safe Model-based Reinforcement Learning with Stability Guarantees , 2017, NIPS.
[25] Claire J. Tomlin,et al. Guaranteed Safe Online Learning via Reachability: tracking a ground target using a quadrotor , 2012, 2012 IEEE International Conference on Robotics and Automation.
[26] Shie Mannor,et al. Policy Gradients Beyond Expectations: Conditional Value-at-Risk , 2014, ArXiv.
[27] Sergey Levine,et al. Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models , 2018, NeurIPS.
[28] Mo Chen,et al. Hamilton-Jacobi reachability: A brief overview and recent advances , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).
[29] Sehoon Ha,et al. Learning to be Safe: Deep RL with a Safety Critic , 2020, ArXiv.
[30] Ufuk Topcu,et al. Safe Reinforcement Learning via Shielding , 2017, AAAI.
[31] Sergey Levine,et al. Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control , 2018, ArXiv.
[32] Sergey Levine,et al. QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation , 2018, CoRL.
[33] Peter Kazanzides,et al. An open-source research kit for the da Vinci® Surgical System , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).
[34] Andreas Krause,et al. Safe Exploration in Finite Markov Decision Processes with Gaussian Processes , 2016, NIPS.
[35] Charles Richter,et al. Safe Visual Navigation via Deep Learning and Novelty Detection , 2017, Robotics: Science and Systems.
[36] Sergey Levine,et al. Deep Dynamics Models for Learning Dexterous Manipulation , 2019, CoRL.
[37] S. Levine,et al. Safety Augmented Value Estimation From Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic Tasks , 2019, IEEE Robotics and Automation Letters.