论文信息 - Probabilistic Control of Heterogeneous Swarms Subject to Graph Temporal Logic Specifications: A Decentralized and Scalable Approach

Probabilistic Control of Heterogeneous Swarms Subject to Graph Temporal Logic Specifications: A Decentralized and Scalable Approach

We develop a probabilistic control algorithm, GTLProCo, for swarms of agents with heterogeneous dynamics and objectives, subject to high-level task specifications. The resulting algorithm not only achieves decentralized control of the swarm but also significantly improves scalability over state-of-the-art existing algorithms. Specifically, we study a setting in which the agents move along the nodes of a graph, and the high-level task specifications for the swarm are expressed in a recently-proposed language called graph temporal logic (GTL). By constraining the distribution of the swarm over the nodes of the graph, GTL can specify a wide range of properties, including safety, progress, and response. GTLProCo, agnostic to the number of agents comprising the swarm, controls the density distribution of the swarm in a decentralized and probabilistic manner. To this end, it synthesizes a time-varying Markov chain modeling the time evolution of the density distribution under the GTL constraints. We first identify a subset of GTL, namely reach-avoid specifications, for which we can reduce the synthesis of such a Markov chain to either linear or semi-definite programs. Then, in the general case, we formulate the synthesis of the Markov chain as a mixed-integer nonlinear program (MINLP). We exploit the structure of the problem to provide an efficient sequential mixed-integer linear programming scheme with trust regions to solve the MINLP. We empirically demonstrate that our sequential scheme is at least three orders of magnitude faster than off-the-shelf MINLP solvers and illustrate the effectiveness of GTLProCo in several swarm scenarios.

[1] Benjamin Müller,et al. The SCIP Optimization Suite 5.0 , 2017, 2112.08872.

[2] Calin Belta,et al. A Fully Automated Framework for Control of Linear Systems from Temporal Logic Specifications , 2008, IEEE Transactions on Automatic Control.

[3] Calin Belta,et al. SpaTeL: a novel spatial-temporal logic and its applications to networked systems , 2015, HSCC.

[4] Sonia Martínez,et al. Coverage control for mobile sensing networks , 2002, IEEE Transactions on Robotics and Automation.

[5] E. Seneta. Non-negative Matrices and Markov Chains , 2008 .

[6] Ufuk Topcu,et al. Probabilistic Swarm Guidance Subject to Graph Temporal Logic Specifications , 2020, Robotics: Science and Systems.

[7] Fred Kröger,et al. Temporal Logic of Programs , 1987, EATCS Monographs on Theoretical Computer Science.

[8] Vijay Kumar,et al. Towards a swarm of agile micro quadrotors , 2012, Autonomous Robots.

[9] Ufuk Topcu,et al. Policy Synthesis for Factored MDPs with Graph Temporal Logic Specifications , 2020, AAMAS.

[10] Radhika Nagpal,et al. Programmable self-assembly in a thousand-robot swarm , 2014, Science.

[11] Zhe Xu,et al. Swarm Signal Temporal Logic Inference for Swarm Behavior Analysis , 2019, IEEE Robotics and Automation Letters.

[12] Petter Nilsson,et al. Provably-Correct Coordination of Large Collections of Agents with Counting Temporal Logic Constraints , 2017, 2017 ACM/IEEE 8th International Conference on Cyber-Physical Systems (ICCPS).

[13] Asok Ray,et al. Supervised Self-Organization of Homogeneous Swarms Using Ergodic Projections of Markov Chains , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[14] Ufuk Topcu,et al. Receding Horizon Temporal Logic Planning , 2012, IEEE Transactions on Automatic Control.

[15] Benjamin Müller,et al. The SCIP Optimization Suite 3.2 , 2016 .

[16] Behçet Açikmese,et al. Successive Convexification: A Superlinearly Convergent Algorithm for Non-convex Optimal Control Problems , 2018, 1804.06539.

[17] Stephen P. Boyd,et al. Fastest Mixing Markov Chain on a Graph , 2004, SIAM Rev..

[18] Ufuk Topcu,et al. Optimization-based trajectory generation with linear temporal logic specifications , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[19] Thomas A. Henzinger,et al. Alternating-time temporal logic , 1997, Proceedings 38th Annual Symposium on Foundations of Computer Science.

[20] Jie Chen,et al. Towards Energy Optimization: Emergent Task Allocation in a Swarm of Foraging Robots , 2007, Adapt. Behav..

[21] Ufuk Topcu,et al. Controller Synthesis for Multi-Agent Systems With Intermittent Communication. A Metric Temporal Logic Approach , 2019, 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[22] Behçet Açikmese,et al. A Markov chain approach to probabilistic swarm guidance , 2012, 2012 American Control Conference (ACC).

[23] Nikolaos V. Sahinidis,et al. Convexification and Global Optimization in Continuous and Mixed-Integer Nonlinear Programming , 2002 .

[24] J. A. Fill. Eigenvalue bounds on convergence to stationarity for nonreversible markov chains , 1991 .

[25] Hadas Kress-Gazit,et al. Automated synthesis of decentralized controllers for robot swarms from high-level temporal logic specifications , 2020, Auton. Robots.

[26] Christel Baier,et al. Principles of Model Checking (Representation and Mind Series) , 2008 .

[27] Alessio Lomuscio,et al. Formal Verification of Opinion Formation in Swarms , 2016, AAMAS.

[28] Ufuk Topcu,et al. Correct, Reactive, High-Level Robot Control , 2011, IEEE Robotics & Automation Magazine.

[29] Calin Belta,et al. Robotic swarm control from spatio-temporal specifications , 2016, 2016 IEEE 55th Conference on Decision and Control (CDC).

[30] Elwood S. Buffa,et al. Graph Theory with Applications , 1977 .

[31] Ufuk Topcu,et al. Graph Temporal Logic Inference for Classification and Identification , 2019, 2019 IEEE 58th Conference on Decision and Control (CDC).

[32] Behçet Açikmese,et al. Decentralized probabilistic density control of autonomous swarms with safety constraints , 2015, Auton. Robots.

[33] Spring Berman,et al. Optimized Stochastic Policies for Task Allocation in Swarms of Robots , 2009, IEEE Transactions on Robotics.

[34] Yurii Nesterov,et al. Interior-point polynomial algorithms in convex programming , 1994, Siam studies in applied mathematics.

[35] A. Agung Julius,et al. Census Signal Temporal Logic Inference for Multiagent Group Behavior Analysis , 2016, IEEE Transactions on Automation Science and Engineering.

[36] Aldo Jaimes,et al. An approach to surveillance an area using swarm of fixed wing and quad-rotor unmanned aerial vehicles UAV(s) , 2008, 2008 IEEE International Conference on System of Systems Engineering.