Sequential Neural Barriers for Scalable Dynamic Obstacle Avoidance

There are two major challenges for scaling up robot navigation around dynamic obstacles: the complex interaction dynamics of the obstacles can be hard to model analytically, and the complexity of planning and control grows exponentially in the number of obstacles. Data-driven and learning-based methods are thus particularly valuable in this context. However, data-driven methods are sensitive to distribution drift, making it hard to train and generalize learned models across different obstacle densities. We propose a novel method for compositional learning of Sequential Neural Control Barrier models (SNCBFs) to achieve scalability. Our approach exploits an important observation: the spatial interaction patterns of multiple dynamic obstacles can be decomposed and predicted through temporal sequences of states for each obstacle. Through decomposition, we can generalize control policies trained only with a small number of obstacles, to environments where the obstacle density can be 100x higher. We demonstrate the benefits of the proposed methods in improving dynamic collision avoidance in comparison with existing methods including potential fields, end-to-end reinforcement learning, and model-predictive control. We also perform hardware experiments and show the practical effectiveness of the approach in the supplementary video.

[1]  M. Tomizuka,et al.  Learning Differentiable Safety-Critical Control using Control Barrier Functions for Generalization to Novel Environments , 2022, 2022 European Control Conference (ECC).

[2]  Sicun Gao,et al.  Safe Nonlinear Control Using Robust Neural Lyapunov-Barrier Functions , 2021, CoRL.

[3]  J. Breeden,et al.  Robust Control Barrier Functions under High Relative Degree and Input Constraints for Satellite Trajectories , 2021, Autom..

[4]  Shengbo Eben Li,et al.  Model-based Constrained Reinforcement Learning using Generalized Control Barrier Function , 2021, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[5]  K. Zhang,et al.  Learning Safe Multi-Agent Control with Decentralized Neural Barrier Certificates , 2021, ICLR.

[6]  Andrew Singletary,et al.  Comparative Analysis of Control Barrier Functions and Artificial Potential Fields for Obstacle Avoidance , 2020, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7]  Yan Chen,et al.  Switched Control Barrier Function With Applications to Vehicle Safety Control , 2020 .

[8]  Li Li,et al.  Multi-agent system motion planning under temporal logic specifications and control barrier function , 2020, Control Theory and Technology.

[9]  Shaoshuai Mou,et al.  Neural Certificates for Safe Control Policies , 2020, ArXiv.

[10]  Dimos V. Dimarogonas,et al.  Learning Control Barrier Functions from Expert Demonstrations , 2020, 2020 59th IEEE Conference on Decision and Control (CDC).

[11]  Vipin Kumar,et al.  Integrating Physics-Based Modeling with Machine Learning: A Survey , 2020, ArXiv.

[12]  Samuel Coogan,et al.  Synthesis of Control Barrier Functions Using a Supervised Machine Learning Approach , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[13]  Yisong Yue,et al.  Learning for Safety-Critical Control with Control Barrier Functions , 2019, L4DC.

[14]  Matteo Saveriano,et al.  Learning Barrier Functions for Constrained Motion Planning with Dynamical Systems , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[15]  Jonathan P. How,et al.  Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning , 2019, IEEE Access.

[16]  Mengdi Wang,et al.  Reinforcement Leaning in Feature Space: Matrix Bandit, Kernels, and Regret Bound , 2019, ICML.

[17]  Dariu M. Gavrila,et al.  Human motion trajectory prediction: a survey , 2019, Int. J. Robotics Res..

[18]  Paulo Tabuada,et al.  Control Barrier Functions: Theory and Applications , 2019, 2019 18th European Control Conference (ECC).

[19]  Calin Belta,et al.  Control Barrier Functions for Systems with High Relative Degree , 2019, 2019 IEEE 58th Conference on Decision and Control (CDC).

[20]  Magnus Egerstedt,et al.  Control of Multi-Agent Systems with Finite Time Control Barrier Certificates and Temporal Logic , 2018, 2018 IEEE Conference on Decision and Control (CDC).

[21]  Magnus Egerstedt,et al.  Boolean Composability of Constraints and Control Synthesis for Multi-Robot Systems via Nonsmooth Control Barrier Functions , 2018, 2018 IEEE Conference on Control Technology and Applications (CCTA).

[22]  Aaron D. Ames,et al.  Towards a Framework for Realizable Safety Critical Control through Active Set Invariance , 2018, 2018 ACM/IEEE 9th International Conference on Cyber-Physical Systems (ICCPS).

[23]  Marcin Andrychowicz,et al.  Overcoming Exploration in Reinforcement Learning with Demonstrations , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[24]  Hao Zhang,et al.  Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[25]  Alec Radford,et al.  Proximal Policy Optimization Algorithms , 2017, ArXiv.

[26]  Magnus Egerstedt,et al.  Nonsmooth Barrier Functions With Applications to Multi-Robot Systems , 2017, IEEE Control Systems Letters.

[27]  Jonathan P. How,et al.  Socially aware motion planning with deep reinforcement learning , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[28]  Aaron D. Ames,et al.  Safety Barrier Certificates for Collisions-Free Multirobot Systems , 2017, IEEE Transactions on Robotics.

[29]  Yanjun Huang,et al.  Path Planning and Tracking for Vehicle Collision Avoidance Based on Model Predictive Control With Multiconstraints , 2017, IEEE Transactions on Vehicular Technology.

[30]  Koushil Sreenath,et al.  3D dynamic walking on stepping stones with control barrier functions , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[31]  Li Wang,et al.  Multi-objective compositions for collision-free connectivity maintenance in teams of mobile robots , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[32]  Juš Kocijan,et al.  Modelling and Control of Dynamic Systems Using Gaussian Process Models , 2015 .

[33]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[34]  Rachid Alami,et al.  Human-aware robot navigation: A survey , 2013, Robotics Auton. Syst..

[35]  Jun Wang,et al.  Model Predictive Control of Unknown Nonlinear Dynamical Systems Based on Recurrent Neural Networks , 2012, IEEE Transactions on Industrial Electronics.

[36]  Dinesh Manocha,et al.  Reciprocal n-Body Collision Avoidance , 2011, ISRR.

[37]  Andreas Krause,et al.  Unfreezing the robot: Navigation in dense, interacting crowds , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[38]  Shuzhi Sam Ge,et al.  Dynamic Motion Planning for Mobile Robots Using Potential Field Method , 2002, Auton. Robots.

[39]  Helbing,et al.  Congested traffic states in empirical observations and microscopic simulations , 2000, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[40]  Anders Stenman,et al.  Model-free predictive control , 1999, Proceedings of the 38th IEEE Conference on Decision and Control (Cat. No.99CH36304).

[41]  S. Hochreiter,et al.  Long Short-Term Memory , 1997, Neural Computation.

[42]  Wolfram Burgard,et al.  The dynamic window approach to collision avoidance , 1997, IEEE Robotics Autom. Mag..

[43]  Jean-Claude Latombe,et al.  Numerical potential field techniques for robot path planning , 1991, Fifth International Conference on Advanced Robotics 'Robots in Unstructured Environments.

[44]  O. Khatib,et al.  Real-Time Obstacle Avoidance for Manipulators and Mobile Robots , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[45]  L. F. Henderson,et al.  The Statistics of Crowd Fluids , 1971, Nature.

[46]  Marko Bacic,et al.  Model predictive control , 2003 .

[47]  Alberto Bemporad,et al.  Robust model predictive control: A survey , 1998, Robustness in Identification and Control.