Incremental minimum-violation control synthesis for robots interacting with external agents

We consider the problem of control strategy synthesis for robots that interact with external agents, together known as the environment. Both the robot and the environment are modeled as dynamical systems with differential constraints and take part in a nonzero-sum two-player differential game to fulfill their respective task specifications while satisfying a set of safety rules. They minimize a cost function that is representative of the level of unsafety with respect to these safety rules. Throughout, the problem is motivated by an autonomous car in an urban environment that interacts with other cars in situations such as navigating stop signs at road junctions and single-lane roads. Ideas behind sampling-based motion-planning algorithms are used to incrementally construct a finite Kripke structure abstraction of a continuous dynamical system. Model-checking techniques for safety rules expressed using Linear Temporal Logic (LTL) are then leveraged to propose an algorithm which synthesizes a control strategy for the two-player game. We analyze the algorithm to show that, with probability one, it converges to the Stackelberg equilibrium asymptotically. This algorithm is also demonstrated in a number of simulation experiments.

[1]  L. Dubins On Curves of Minimal Length with a Constraint on Average Curvature, and with Prescribed Initial and Terminal Positions and Tangents , 1957 .

[2]  C. Chen,et al.  Stackelburg solution for two-person games with biased information patterns , 1972 .

[3]  T. Başar,et al.  Dynamic Noncooperative Game Theory , 1982 .

[4]  H. J. Pesch,et al.  Complex differential games of pursuit-evasion type with state constraints, part 2: Numerical computation of optimal open-loop strategies , 1993 .

[5]  Michael Sipser,et al.  Introduction to the Theory of Computation , 1996, SIGA.

[6]  Leonidas J. Guibas,et al.  A Visibility-Based Pursuit-Evasion Problem , 1999, Int. J. Comput. Geom. Appl..

[7]  James A. Sethian,et al.  Level Set Methods and Fast Marching Methods , 1999 .

[8]  Pierpaolo Soravia Optimal control with discontinuous running cost: Eikonal equation and shape-from-shading , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[9]  T. Raivio Capture Set Computation of an Optimally Guided Missile , 2001 .

[10]  Philippe Schnoebelen,et al.  On Model Checking Durational Kripke Structures , 2002, FoSSaCS.

[11]  Doron A. Peled,et al.  Temporal Debugging for Concurrent Systems , 2002, TACAS.

[12]  Christel Baier,et al.  PROBMELA: a modeling language for communicating probabilistic processes , 2004, Proceedings. Second ACM and IEEE International Conference on Formal Methods and Models for Co-Design, 2004. MEMOCODE '04..

[13]  Alexandre M. Bayen,et al.  A time-dependent Hamilton-Jacobi formulation of reachable sets for continuous dynamic games , 2005, IEEE Transactions on Automatic Control.

[14]  Paulo Tabuada,et al.  Linear Time Logic Control of Discrete-Time Linear Systems , 2006, IEEE Transactions on Automatic Control.

[15]  Steven M. LaValle,et al.  Time-optimal paths for a Dubins airplane , 2007, 2007 46th IEEE Conference on Decision and Control.

[16]  Luke Fletcher,et al.  A perception‐driven autonomous urban vehicle , 2008, J. Field Robotics.

[17]  Christel Baier,et al.  Principles of model checking , 2008 .

[18]  Luke Fletcher,et al.  A perception-driven autonomous urban vehicle , 2008 .

[19]  Emilio Frazzoli,et al.  Incremental Sampling-Based Algorithms for a Class of Pursuit-Evasion Games , 2010, WAFR.

[20]  Ufuk Topcu,et al.  Receding horizon control for temporal logic specifications , 2010, HSCC '10.

[21]  Emilio Frazzoli,et al.  Sampling-based algorithms for optimal motion planning , 2011, Int. J. Robotics Res..

[22]  Calin Belta,et al.  MDP optimal control under temporal logic constraints , 2011, IEEE Conference on Decision and Control and European Control Conference.

[23]  Thomas Sauerwald,et al.  Diameter and Broadcast Time of Random Geometric Graphs in Arbitrary Dimensions , 2011, Algorithmica.

[24]  Emilio Frazzoli,et al.  Intention-Aware Motion Planning , 2013, WAFR.

[25]  Emilio Frazzoli,et al.  Sampling-based algorithms for optimal motion planning with deterministic μ-calculus specifications , 2012, 2012 American Control Conference (ACC).

[26]  Emilio Frazzoli,et al.  Least-violating control strategy synthesis with safety rules , 2013, HSCC '13.

[27]  Emilio Frazzoli,et al.  Incremental sampling-based algorithm for minimum-violation motion planning , 2013, 52nd IEEE Conference on Decision and Control.

[28]  Emilio Frazzoli,et al.  Sampling-based optimal motion planning for non-holonomic dynamical systems , 2013, 2013 IEEE International Conference on Robotics and Automation.