Safe Human-Interactive Control via Shielding

Ensuring safety for human-interactive robotics is important due to the potential for human injury. The key challenge is defining safety in a way that accounts for the complex range of human behaviors without modeling the human as an unconstrained adversary. We propose a novel approach to ensuring safety in these settings. Our approach focuses on defining backup actions that we believe human always considers taking to avoid an accident—e.g., brake to avoid rear-ending the other agent. Given such a definition, we consider a safety constraint that guarantees safety as long as the human takes the appropriate backup actions when necessary to ensure safety. Then, we propose an algorithm that overrides an arbitrary given controller as needed to ensure that the robot is safe. We evaluate our approach in a simulated environment, interacting with both real and simulated humans.

[1]  Anca D. Dragan,et al.  Probabilistically Safe Robot Planning with Confidence-Based Human Predictions , 2018, Robotics: Science and Systems.

[2]  Ufuk Topcu,et al.  Safe Reinforcement Learning via Shielding , 2017, AAAI.

[3]  Jaime F. Fisac,et al.  Reachability-based safe learning with Gaussian processes , 2014, 53rd IEEE Conference on Decision and Control.

[4]  Kerstin Eder,et al.  Towards the safety of human-in-the-loop robotics: Challenges and opportunities for safety assurance of robotic co-workers' , 2014, The 23rd IEEE International Symposium on Robot and Human Interactive Communication.

[5]  Amnon Shashua,et al.  On a Formal Model of Safe and Scalable Self-driving Cars , 2017, ArXiv.

[6]  Shuo Li,et al.  Robust Model Predictive Shielding for Safe Reinforcement Learning with Stochastic Dynamics , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[7]  Olaf Stursberg,et al.  Control of Uncertain Nonlinear Systems Using Ellipsoidal Reachability Calculus , 2013, NOLCOS.

[8]  Siddhartha S. Srinivasa,et al.  Toward seamless human-robot handovers , 2013, Journal of Human-Robot Interaction.

[9]  Anind K. Dey,et al.  Maximum Entropy Inverse Reinforcement Learning , 2008, AAAI.

[10]  Tatiana F. Filippova,et al.  Ellipsoidal Estimates of Reachable Sets for Control Systems with Nonlinear Terms , 2017 .

[11]  Russ Tedrake,et al.  Linear Encodings for Polytope Containment Problems , 2019, 2019 IEEE 58th Conference on Decision and Control (CDC).

[12]  Sebastian Thrun,et al.  Towards fully autonomous driving: Systems and algorithms , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[13]  Sebastian Thrun,et al.  Junior: The Stanford entry in the Urban Challenge , 2008, J. Field Robotics.

[14]  Sanjit A. Seshia,et al.  Verifying Robustness of Human-Aware Autonomous Cars , 2019, IFAC-PapersOnLine.

[15]  Helbing,et al.  Social force model for pedestrian dynamics. , 1995, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[16]  Anca D. Dragan,et al.  Planning for Autonomous Cars that Leverage Effects on Human Actions , 2016, Robotics: Science and Systems.

[17]  Kim Peter Wabersich,et al.  Linear Model Predictive Safety Certification for Learning-Based Control , 2018, 2018 IEEE Conference on Decision and Control (CDC).

[18]  Siddhartha S. Srinivasa,et al.  Legibility and predictability of robot motion , 2013, 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[19]  Anca D. Dragan,et al.  Information gathering actions over human internal state , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[20]  O. Stursberg,et al.  Computing Reachable Sets of Hybrid Systems Using a Combination of Zonotopes and Polytopes , 2010 .

[21]  Işıl Dillig,et al.  Interpretation , 1994, A Bibliography of Islamic Law, 1980-1993.