Safe Nonlinear Control Using Robust Neural Lyapunov-Barrier Functions

Safety and stability are common requirements for robotic control systems; however, designing safe, stable controllers remains difficult for nonlinear and uncertain models. We develop a model-based learning approach to synthesize robust feedback controllers with safety and stability guarantees. We take inspiration from robust convex optimization and Lyapunov theory to define robust control Lyapunov barrier functions that generalize despite model uncertainty. We demonstrate our approach in simulation on problems including car trajectory tracking, nonlinear control with obstacle avoidance, satellite rendezvous with safety constraints, and flight control with a learned ground effect model. Simulation results show that our approach yields controllers that match or exceed the capabilities of robust MPC while reducing computational costs by an order of magnitude.

[1]  Andreas Krause,et al.  The Lyapunov Neural Network: Adaptive Stability Certification for Safe Learning of Dynamical Systems , 2018, CoRL.

[2]  Matthias Althoff,et al.  CommonRoad: Composable benchmarks for motion planning on roads , 2017, 2017 IEEE Intelligent Vehicles Symposium (IV).

[3]  Sicun Gao,et al.  Neural Lyapunov Control , 2020, NeurIPS.

[4]  Zengyi Qin,et al.  Learning Safe Multi-Agent Control with Decentralized Neural Barrier Certificates , 2021, ArXiv.

[5]  Z. Artstein Stabilization with relaxed controls , 1983 .

[6]  Yuichi Yoshida,et al.  Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.

[7]  Nikolai Matni,et al.  Learning Stability Certificates from Data , 2020, CoRL.

[8]  Alessandro Abate,et al.  Automated Formal Synthesis of Neural Barrier Certificates for Dynamical Models , 2020, ArXiv.

[9]  Pieter Abbeel,et al.  Constrained Policy Optimization , 2017, ICML.

[10]  P. Giesl,et al.  Review on computational methods for Lyapunov functions , 2015 .

[11]  Lixian Zhang,et al.  Actor-Critic Reinforcement Learning for Control With Stability Guarantee , 2020, IEEE Robotics and Automation Letters.

[12]  Koushil Sreenath,et al.  Reinforcement Learning for Safety-Critical Control under Model Uncertainty, using Control Lyapunov Functions and Control Barrier Functions , 2020, Robotics: Science and Systems.

[13]  Mykel J. Kochenderfer,et al.  Algorithms for Verifying Deep Neural Networks , 2019, Found. Trends Optim..

[14]  William S. Levine,et al.  Handbook of Model Predictive Control , 2018, Control Engineering.

[15]  Soon-Jo Chung,et al.  Robust Regression for Safe Exploration in Control , 2019, L4DC.

[16]  Koushil Sreenath,et al.  Rapidly Exponentially Stabilizing Control Lyapunov Functions and Hybrid Zero Dynamics , 2014, IEEE Transactions on Automatic Control.

[17]  Susmit Jha,et al.  Learning Certified Control using Contraction Metric , 2020, CoRL.

[18]  Ian M. Mitchell,et al.  A Toolbox of Hamilton-Jacobi Solvers for Analysis of Nondeterministic Continuous and Hybrid Systems , 2005, HSCC.

[19]  Ali-akbar Agha-mohammadi,et al.  Deep Learning Tubes for Tube MPC , 2020, Robotics: Science and Systems.

[20]  Paulo Tabuada,et al.  Control Barrier Function Based Quadratic Programs for Safety Critical Systems , 2016, IEEE Transactions on Automatic Control.

[21]  Wei Xiao,et al.  High Order Control Lyapunov-Barrier Functions for Temporal Logic Specifications , 2021, 2021 American Control Conference (ACC).

[22]  Aaron D. Ames,et al.  Guaranteeing Safety of Learned Perception Modules via Measurement-Robust Control Barrier Functions , 2020, CoRL.

[23]  Hiroyasu Tsukamoto,et al.  Neural Contraction Metrics for Robust Estimation and Control: A Convex Optimization Approach , 2021, IEEE Control Systems Letters.

[24]  Mo Chen,et al.  Hamilton-Jacobi reachability: A brief overview and recent advances , 2017, 2017 IEEE 56th Annual Conference on Decision and Control (CDC).

[25]  R. Scott Erwin,et al.  A spacecraft benchmark problem for hybrid control and estimation , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[26]  Alberto Bemporad,et al.  Robust model predictive control: A survey , 1998, Robustness in Identification and Control.

[27]  Amir Ali Ahmadi,et al.  Some applications of polynomial optimization in operations research and real-time decision making , 2015, Optimization Letters.

[28]  Alessandro Abate,et al.  Formal Synthesis of Lyapunov Neural Networks , 2020, IEEE Control Systems Letters.

[29]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[30]  Mircea Lazar,et al.  Automated-Sampling-Based Stability Verification and DOA Estimation for Nonlinear Systems , 2018, IEEE Transactions on Automatic Control.

[31]  Johan Löfberg,et al.  Automatic robust convex programming , 2012, Optim. Methods Softw..

[32]  Aaron D. Ames,et al.  Episodic Learning with Control Lyapunov Functions for Uncertain Robotic Systems* , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[33]  Amir Ali Ahmadi,et al.  Control design along trajectories with sums of squares programming , 2012, 2013 IEEE International Conference on Robotics and Automation.

[34]  W. H. Clohessy,et al.  Terminal Guidance System for Satellite Rendezvous , 2012 .

[35]  Richard M. Murray,et al.  Feedback Systems An Introduction for Scientists and Engineers , 2007 .

[36]  Andrew Singletary,et al.  Comparative Analysis of Control Barrier Functions and Artificial Potential Fields for Obstacle Avoidance , 2020, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[37]  Daniel Liberzon,et al.  Almost Lyapunov functions for nonlinear systems , 2018, Autom..

[38]  Bayu Jayawardhana,et al.  Stabilization with guaranteed safety using Control Lyapunov-Barrier Function , 2016, Autom..

[39]  Sergey Levine,et al.  PLATO: Policy learning using adaptive trajectory optimization , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[40]  Paulo Tabuada,et al.  Control Barrier Functions: Theory and Applications , 2019, 2019 18th European Control Conference (ECC).

[41]  Claire J. Tomlin,et al.  Gaussian Process-based Min-norm Stabilizing Controller for Control-Affine Systems with Uncertain Input Effects and Dynamics , 2020, 2021 American Control Conference (ACC).

[42]  Gábor Orosz,et al.  End-to-End Safe Reinforcement Learning through Barrier Functions for Safety-Critical Continuous Control Tasks , 2019, AAAI.