Probabilistic Control of Heterogeneous Swarms Subject to Graph Temporal Logic Specifications: A Decentralized and Scalable Approach

We develop a probabilistic control algorithm, GTLProCo, for swarms of agents with heterogeneous dynamics and objectives, subject to high-level task specifications. The resulting algorithm not only achieves decentralized control of the swarm but also significantly improves scalability over state-of-the-art existing algorithms. Specifically, we study a setting in which the agents move along the nodes of a graph, and the high-level task specifications for the swarm are expressed in a recently-proposed language called graph temporal logic (GTL). By constraining the distribution of the swarm over the nodes of the graph, GTL can specify a wide range of properties, including safety, progress, and response. GTLProCo, agnostic to the number of agents comprising the swarm, controls the density distribution of the swarm in a decentralized and probabilistic manner. To this end, it synthesizes a time-varying Markov chain modeling the time evolution of the density distribution under the GTL constraints. We first identify a subset of GTL, namely reach-avoid specifications, for which we can reduce the synthesis of such a Markov chain to either linear or semi-definite programs. Then, in the general case, we formulate the synthesis of the Markov chain as a mixed-integer nonlinear program (MINLP). We exploit the structure of the problem to provide an efficient sequential mixed-integer linear programming scheme with trust regions to solve the MINLP. We empirically demonstrate that our sequential scheme is at least three orders of magnitude faster than off-the-shelf MINLP solvers and illustrate the effectiveness of GTLProCo in several swarm scenarios.

[1]  Benjamin Müller,et al.  The SCIP Optimization Suite 5.0 , 2017, 2112.08872.

[2]  Calin Belta,et al.  A Fully Automated Framework for Control of Linear Systems from Temporal Logic Specifications , 2008, IEEE Transactions on Automatic Control.

[3]  Calin Belta,et al.  SpaTeL: a novel spatial-temporal logic and its applications to networked systems , 2015, HSCC.

[4]  Sonia Martínez,et al.  Coverage control for mobile sensing networks , 2002, IEEE Transactions on Robotics and Automation.

[5]  E. Seneta Non-negative Matrices and Markov Chains , 2008 .

[6]  Ufuk Topcu,et al.  Probabilistic Swarm Guidance Subject to Graph Temporal Logic Specifications , 2020, Robotics: Science and Systems.

[7]  Fred Kröger,et al.  Temporal Logic of Programs , 1987, EATCS Monographs on Theoretical Computer Science.

[8]  Vijay Kumar,et al.  Towards a swarm of agile micro quadrotors , 2012, Autonomous Robots.

[9]  Ufuk Topcu,et al.  Policy Synthesis for Factored MDPs with Graph Temporal Logic Specifications , 2020, AAMAS.

[10]  Radhika Nagpal,et al.  Programmable self-assembly in a thousand-robot swarm , 2014, Science.

[11]  Zhe Xu,et al.  Swarm Signal Temporal Logic Inference for Swarm Behavior Analysis , 2019, IEEE Robotics and Automation Letters.

[12]  Petter Nilsson,et al.  Provably-Correct Coordination of Large Collections of Agents with Counting Temporal Logic Constraints , 2017, 2017 ACM/IEEE 8th International Conference on Cyber-Physical Systems (ICCPS).

[13]  Asok Ray,et al.  Supervised Self-Organization of Homogeneous Swarms Using Ergodic Projections of Markov Chains , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14]  Ufuk Topcu,et al.  Receding Horizon Temporal Logic Planning , 2012, IEEE Transactions on Automatic Control.

[15]  Benjamin Müller,et al.  The SCIP Optimization Suite 3.2 , 2016 .

[16]  Behçet Açikmese,et al.  Successive Convexification: A Superlinearly Convergent Algorithm for Non-convex Optimal Control Problems , 2018, 1804.06539.

[17]  Stephen P. Boyd,et al.  Fastest Mixing Markov Chain on a Graph , 2004, SIAM Rev..

[18]  Ufuk Topcu,et al.  Optimization-based trajectory generation with linear temporal logic specifications , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[19]  Thomas A. Henzinger,et al.  Alternating-time temporal logic , 1997, Proceedings 38th Annual Symposium on Foundations of Computer Science.

[20]  Jie Chen,et al.  Towards Energy Optimization: Emergent Task Allocation in a Swarm of Foraging Robots , 2007, Adapt. Behav..

[21]  Ufuk Topcu,et al.  Controller Synthesis for Multi-Agent Systems With Intermittent Communication. A Metric Temporal Logic Approach , 2019, 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[22]  Behçet Açikmese,et al.  A Markov chain approach to probabilistic swarm guidance , 2012, 2012 American Control Conference (ACC).

[23]  Nikolaos V. Sahinidis,et al.  Convexification and Global Optimization in Continuous and Mixed-Integer Nonlinear Programming , 2002 .

[24]  J. A. Fill Eigenvalue bounds on convergence to stationarity for nonreversible markov chains , 1991 .

[25]  Hadas Kress-Gazit,et al.  Automated synthesis of decentralized controllers for robot swarms from high-level temporal logic specifications , 2020, Auton. Robots.

[26]  Christel Baier,et al.  Principles of Model Checking (Representation and Mind Series) , 2008 .

[27]  Alessio Lomuscio,et al.  Formal Verification of Opinion Formation in Swarms , 2016, AAMAS.

[28]  Ufuk Topcu,et al.  Correct, Reactive, High-Level Robot Control , 2011, IEEE Robotics & Automation Magazine.

[29]  Calin Belta,et al.  Robotic swarm control from spatio-temporal specifications , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[30]  Elwood S. Buffa,et al.  Graph Theory with Applications , 1977 .

[31]  Ufuk Topcu,et al.  Graph Temporal Logic Inference for Classification and Identification , 2019, 2019 IEEE 58th Conference on Decision and Control (CDC).

[32]  Behçet Açikmese,et al.  Decentralized probabilistic density control of autonomous swarms with safety constraints , 2015, Auton. Robots.

[33]  Spring Berman,et al.  Optimized Stochastic Policies for Task Allocation in Swarms of Robots , 2009, IEEE Transactions on Robotics.

[34]  Yurii Nesterov,et al.  Interior-point polynomial algorithms in convex programming , 1994, Siam studies in applied mathematics.

[35]  A. Agung Julius,et al.  Census Signal Temporal Logic Inference for Multiagent Group Behavior Analysis , 2016, IEEE Transactions on Automation Science and Engineering.

[36]  Aldo Jaimes,et al.  An approach to surveillance an area using swarm of fixed wing and quad-rotor unmanned aerial vehicles UAV(s) , 2008, 2008 IEEE International Conference on System of Systems Engineering.