Probabilistic Safety Constraints for Learned High Relative Degree System Dynamics

This paper focuses on learning a model of system dynamics online while satisfying safety constraints.Our motivation is to avoid offline system identification or hand-specified dynamics models and allowa system to safely and autonomously estimate and adapt its own model during online operation.Given streaming observations of the system state, we use Bayesian learning to obtain a distributionover the system dynamics. In turn, the distribution is used to optimize the system behavior andensure safety with high probability, by specifying a chance constraint over a control barrier function.

[1]  Paulo Tabuada,et al.  Control Barrier Functions: Theory and Applications , 2019, 2019 18th European Control Conference (ECC).

[2]  Tara Javidi,et al.  Gaussian Process bandits with adaptive discretization , 2017, ArXiv.

[3]  Evangelos Theodorou,et al.  Bayesian Learning-Based Adaptive Control for Safety Critical Systems , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[4]  Guang Yang,et al.  Self-triggered Control for Safety Critical Systems Using Control Barrier Functions , 2019, 2019 American Control Conference (ACC).

[5]  Thomas Lew,et al.  Safe Learning and Control using Meta-Learning , 2010 .

[6]  Dimos V. Dimarogonas,et al.  Learning Control Barrier Functions from Expert Demonstrations , 2020, 2020 59th IEEE Conference on Decision and Control (CDC).

[7]  Koushil Sreenath,et al.  Optimal robust control for constrained nonlinear hybrid systems with application to bipedal locomotion , 2016, 2016 American Control Conference (ACC).

[8]  Jaime F. Fisac,et al.  A General Safety Framework for Learning-Based Control in Uncertain Robotic Systems , 2017, IEEE Transactions on Automatic Control.

[9]  Aaron D. Ames,et al.  Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems* , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[10]  George J. Pappas,et al.  A Framework for Worst-Case and Stochastic Safety Verification Using Barrier Certificates , 2007, IEEE Transactions on Automatic Control.

[11]  Sandra Hirche,et al.  Feedback Linearization Based on Gaussian Processes With Event-Triggered Online Learning , 2019, IEEE Transactions on Automatic Control.

[12]  Paulo Tabuada,et al.  Realizing simultaneous lane keeping and adaptive speed regulation on accessible mobile robot testbeds , 2017, 2017 IEEE Conference on Control Technology and Applications (CCTA).

[13]  Nikolai Matni,et al.  On the Sample Complexity of the Linear Quadratic Regulator , 2017, Foundations of Computational Mathematics.

[14]  Lawrence Carin,et al.  Learning Structured Weight Uncertainty in Bayesian Neural Networks , 2017, AISTATS.

[15]  S. Ghosal,et al.  Posterior consistency of Gaussian process prior for nonparametric binary regression , 2006, math/0702686.

[16]  Jonathan P. How,et al.  Bayesian Nonparametric Adaptive Control Using Gaussian Processes , 2015, IEEE Transactions on Neural Networks and Learning Systems.

[17]  Carl E. Rasmussen,et al.  PILCO: A Model-Based and Data-Efficient Approach to Policy Search , 2011, ICML.

[18]  Vijay Kumar,et al.  Approximating Explicit Model Predictive Control Using Constrained Neural Networks , 2018, 2018 Annual American Control Conference (ACC).

[19]  Andreas Krause,et al.  Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting , 2009, IEEE Transactions on Information Theory.

[20]  Andreas Krause,et al.  Safe Model-based Reinforcement Learning with Stability Guarantees , 2017, NIPS.

[21]  Neil D. Lawrence,et al.  Kernels for Vector-Valued Functions: a Review , 2011, Found. Trends Mach. Learn..

[22]  Li Wang,et al.  Safe Learning of Quadrotor Dynamics Using Barrier Certificates , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[23]  Gábor Orosz,et al.  End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks , 2019, AAAI.

[24]  Marco Pavone,et al.  Meta-Learning Priors for Efficient Online Bayesian Regression , 2018, WAFR.

[25]  Aaron D. Ames,et al.  Adaptive Safety with Control Barrier Functions , 2019, 2020 American Control Conference (ACC).

[26]  Osbert Bastani,et al.  Safe Planning via Model Predictive Shielding , 2019 .

[27]  Paulo Tabuada,et al.  Control Barrier Function Based Quadratic Programs for Safety Critical Systems , 2016, IEEE Transactions on Automatic Control.

[28]  Massimo Franceschetti,et al.  Authentication of cyber-physical systems under learning-based attacks , 2018, IFAC-PapersOnLine.

[29]  Ali-akbar Agha-mohammadi,et al.  Deep Learning Tubes for Tube MPC , 2020, RSS 2020.

[30]  Soon-Jo Chung,et al.  Robust Regression for Safe Exploration in Control , 2019, L4DC.

[31]  Xiaojing Zhang,et al.  Data-Driven Predictive Control for Autonomous Systems , 2018, Annu. Rev. Control. Robotics Auton. Syst..

[32]  Andreas Krause,et al.  Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics , 2016, Machine Learning.

[33]  P. Olver Nonlinear Systems , 2013 .

[34]  Kim Peter Wabersich,et al.  Safe exploration of nonlinear dynamical systems: A predictive safety filter for reinforcement learning , 2018, ArXiv.

[35]  Munther A. Dahleh,et al.  Finite-Time System Identification for Partially Observed LTI Systems of Unknown Order , 2019, ArXiv.

[36]  Aaron D. Ames,et al.  Control barrier function based quadratic programs with application to bipedal robotic walking , 2015, 2015 American Control Conference (ACC).

[37]  Dorsa Sadigh,et al.  Efficient and Safe Exploration in Deterministic Markov Decision Processes with Unknown Transition Models , 2019, 2019 American Control Conference (ACC).

[38]  Andrew Clark,et al.  Control Barrier Functions for Complete and Incomplete Information Stochastic Systems , 2019, 2019 American Control Conference (ACC).

[39]  D. Sengupta Linear models , 2003 .

[40]  Munther A. Dahleh,et al.  Nonparametric Finite Time LTI System Identification , 2019, 1902.01848.

[41]  Andreas Krause,et al.  Safe controller optimization for quadrotors with Gaussian processes , 2015, 2016 IEEE International Conference on Robotics and Automation (ICRA).

[42]  Torsten Koller,et al.  Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning , 2019, ArXiv.

[43]  Aaron D. Ames,et al.  Formal Test Synthesis for Safety-Critical Autonomous Systems based on Control Barrier Functions , 2020, 2020 59th IEEE Conference on Decision and Control (CDC).

[44]  Ashwin P. Dani,et al.  Active Sampling based Safe Identification of Dynamical Systems using Extreme Learning Machines and Barrier Certificates , 2019, 2019 International Conference on Robotics and Automation (ICRA).

[45]  Paulo Tabuada,et al.  Robustness of Control Barrier Functions for Safety Critical Control , 2016, ADHS.

[46]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[47]  Samuel Coogan,et al.  A Barrier Function Approach to Finite-Time Stochastic System Verification and Control , 2019, Autom..

[48]  Benjamin Recht,et al.  Least-Squares Temporal Difference Learning for the Linear Quadratic Regulator , 2017, ICML.

[49]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[50]  John Lygeros,et al.  Data-Enabled Predictive Control: In the Shallows of the DeePC , 2018, 2019 18th European Control Conference (ECC).

[51]  Max Welling,et al.  Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors , 2016, ICML.

[52]  Anders Rantzer,et al.  Concentration Bounds for Single Parameter Adaptive Control , 2018, 2018 Annual American Control Conference (ACC).

[53]  Koushil Sreenath,et al.  Exponential Control Barrier Functions for enforcing high relative-degree safety-critical constraints , 2016, 2016 American Control Conference (ACC).