Decentralized probabilistic density control of autonomous swarms with safety constraints

This paper presents a Markov chain based approach for the probabilistic density control of a large number, swarm, of autonomous agents. The proposed approach specifies the time evolution of the probabilistic density distribution by using a Markov chain, which guides the swarm to a desired steady-state distribution, while satisfying the prescribed ergodicity, motion, and safety constraints. This paper generalizes our previous results on density upper bound constraints and captures a general class of linear safety constraints that bound the flow of agents. The safety constraints are formulated as equivalent linear inequality conditions on the Markov chain matrices by using the duality theory of convex optimization which is our first contribution. With the safety constraints, we can facilitate proper low-level conflict avoidance policies to compute and execute the detailed agent state trajectories. Our second contribution is to develop (i) linear matrix inequality based offline methods, and (ii) quadratic programming based online methods that can incorporate these constraints into the Markov chain synthesis. The offline method provides a feasible solution for Markov matrix when there is no density feedback. The online method utilizes realtime estimates of the swarm density distribution to continuously update the Markov matrices to maximize the convergence rates within the problem constraints. The paper also introduces a decentralized method to compute the density estimates needed for the online synthesis method.

[1]  Behcet Acikmese,et al.  Density Control for Decentralized Autonomous Agents with Conflict Avoidance , 2014, IFAC Proceedings Volumes.

[2]  W. M. Wonham,et al.  Supervisory control of probabilistic discrete event systems , 1993, Proceedings of 36th Midwest Symposium on Circuits and Systems.

[3]  Karl Johan Åström,et al.  Optimotaxis: A Stochastic Multi-agent Optimization Procedure with Point Measurements , 2008, HSCC.

[4]  R. E. Kalman,et al.  Control System Analysis and Design Via the “Second Method” of Lyapunov: II—Discrete-Time Systems , 1960 .

[5]  Jonathan P. How,et al.  Spacecraft trajectory planning with avoidance constraints using mixed-integer linear programming , 2002 .

[6]  Martin Corless,et al.  Linear systems and control : an operator perspective , 2003 .

[7]  S. Ploen,et al.  A survey of spacecraft formation flying guidance and control (part 1): guidance , 2003, Proceedings of the 2003 American Control Conference, 2003..

[8]  John N. Tsitsiklis,et al.  Comments on "Coordination of Groups of Mobile Autonomous Agents Using Nearest Neighbor Rules" , 2007, IEEE Trans. Autom. Control..

[9]  Stephen P. Boyd,et al.  Fastest Mixing Markov Chain on a Graph , 2004, SIAM Rev..

[10]  Magnus Egerstedt,et al.  Graph Theoretic Methods in Multiagent Networks , 2010, Princeton Series in Applied Mathematics.

[11]  Emilio Frazzoli,et al.  Adaptive and Distributed Algorithms for Vehicle Routing in a Stochastic and Dynamic Environment , 2009, IEEE Transactions on Automatic Control.

[12]  Asok Ray,et al.  Supervised self-organization of large homogeneous Swarms using Ergodic Projections of Markov Chains , 2009, 2009 American Control Conference.

[13]  N. Metropolis,et al.  The Monte Carlo method. , 1949 .

[14]  J. LumelskyV.,et al.  Decentralized Motion Planning for Multiple Mobile Robots , 1997 .

[15]  Antonio Bicchi,et al.  Decentralized Cooperative Policy for Conflict Resolution in Multivehicle Systems , 2007, IEEE Transactions on Robotics.

[16]  Charles R. Johnson,et al.  Matrix analysis , 1985, Statistical Inference for Engineers and Data Scientists.

[17]  Stephen P. Boyd,et al.  ECOS: An SOCP solver for embedded systems , 2013, 2013 European Control Conference (ECC).

[18]  Behçet Açikmese,et al.  Finite-horizon controllability and reachability for deterministic and stochastic linear control systems with convex constraints , 2014, 2014 American Control Conference.

[19]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[20]  Kai Lai Chung,et al.  A Course in Probability Theory , 1949 .

[21]  Vladimir J. Lumelsky,et al.  Decentralized Motion Planning for Multiple Mobile Robots: The Cocktail Party Model , 1997, Auton. Robots.

[22]  Jie Lin,et al.  Coordination of groups of mobile autonomous agents using nearest neighbor rules , 2003, IEEE Trans. Autom. Control..

[23]  Stephen P. Boyd,et al.  Fastest Mixing Markov Chain on Graphs with Symmetries , 2007, SIAM J. Optim..

[24]  Å. Lindahl Convexity and Optimization , 2015 .

[25]  Spring Berman,et al.  Biologically inspired redistribution of a swarm of robots among multiple sites , 2008, Swarm Intelligence.

[26]  Stephen P. Boyd,et al.  Real-Time Convex Optimization in Signal Processing , 2010, IEEE Signal Processing Magazine.

[27]  Thomas A. Henzinger,et al.  Hybrid Systems: Computation and Control , 1998, Lecture Notes in Computer Science.

[28]  L. Berkovitz Convexity and Optimization in Rn , 2001 .

[29]  Kim-Chuan Toh,et al.  Solving semidefinite-quadratic-linear programs using SDPT3 , 2003, Math. Program..

[30]  N.R. Malik,et al.  Graph theory with applications to engineering and computer science , 1975, Proceedings of the IEEE.

[31]  Tanja Neumann Convexity And Optimization In R N , 2016 .

[32]  M. Ani Hsieh,et al.  Decentralized controllers for shape generation with robotic swarms , 2008, Robotica.

[33]  Jonathan P. How,et al.  Co‐ordination and control of distributed spacecraft systems using convex optimization techniques , 2002 .

[34]  Charles R. Johnson,et al.  Topics in Matrix Analysis , 1991 .

[35]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[37]  Antoine Girard,et al.  Zonotope/Hyperplane Intersection for Hybrid Systems Reachability Analysis , 2008, HSCC.

[38]  J. Geromel,et al.  A new discrete-time robust stability condition , 1999 .

[39]  M. Fiedler Special matrices and their applications in numerical mathematics , 1986 .

[40]  Karl Johan Åström,et al.  Optimotaxis: A Stochastic Multi-agent on Site Optimization Procedure , 2008 .

[41]  Sekhar Tangirala,et al.  Controlled Markov chains with safety upper bound , 2003, IEEE Trans. Autom. Control..

[42]  Johan Löfberg,et al.  YALMIP : a toolbox for modeling and optimization in MATLAB , 2004 .

[43]  Emilio Frazzoli,et al.  Distributed control of spacecraft formation via cyclic pursuit: Theory and experiments , 2009, 2009 American Control Conference.

[44]  Ilya V. Kolmanovsky,et al.  Predictive energy management of a power-split hybrid electric vehicle , 2009, 2009 American Control Conference.

[45]  Mehran Mesbahi,et al.  Multiple-Spacecraft Reconfiguration Through Collision Avoidance, Bouncing, and Stalemate , 2004 .

[46]  Emilio Frazzoli,et al.  On synchronous robotic networks Part II: Time complexity of rendezvous and deployment algorithms , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[47]  E. Yaz Linear Matrix Inequalities In System And Control Theory , 1998, Proceedings of the IEEE.

[48]  Behçet Açikmese,et al.  Convex Necessary and Sufficient Conditions for Density Safety Constraints in Markov Chain Synthesis , 2015, IEEE Transactions on Automatic Control.

[49]  F. Bullo,et al.  Motion Coordination with Distributed Information , 2007 .

[50]  Emilio Frazzoli,et al.  Distributed and Adaptive Algorithms for Vehicle Routing in a Stochastic and Dynamic Environment , 2009, ArXiv.

[51]  Sonia Martínez,et al.  Coverage control for mobile sensing networks , 2002, IEEE Transactions on Robotics and Automation.

[52]  Jiming Peng,et al.  Self-regularity - a new paradigm for primal-dual interior-point algorithms , 2002, Princeton series in applied mathematics.

[53]  Narsingh Deo,et al.  Graph Theory with Applications to Engineering and Computer Science , 1975, Networks.

[54]  M. Fiedler Algebraic connectivity of graphs , 1973 .

[55]  Emilio Frazzoli,et al.  Decentralized Policies for Geometric Pattern Formation and Path Coverage , 2007 .

[56]  Johan Efberg,et al.  YALMIP : A toolbox for modeling and optimization in MATLAB , 2004 .

[57]  Stephen P. Boyd,et al.  CVXGEN: a code generator for embedded convex optimization , 2011, Optimization and Engineering.

[58]  Behçet Açikmese,et al.  Convex optimization formulation of density upper bound constraints in Markov chain synthesis , 2014, 2014 American Control Conference.

[59]  David S. Bayard,et al.  Markov Chain Approach to Probabilistic Guidance for Swarms of Autonomous Agents , 2015 .

[60]  Robert J. Plemmons,et al.  Nonnegative Matrices in the Mathematical Sciences , 1979, Classics in Applied Mathematics.

[61]  Stephen P. Boyd,et al.  Linear Matrix Inequalities in Systems and Control Theory , 1994 .

[62]  Jing Zhang,et al.  Automated Custom Code Generation for Embedded, Real-time Second Order Cone Programming , 2014 .

[63]  Fred Y. Hadaegh,et al.  Formulation and analysis of stability for spacecraft formations , 2007 .

[64]  Ian M. Mitchell,et al.  Lagrangian methods for approximating the viability kernel in high-dimensional systems , 2013, Autom..

[65]  Jason R. Marden,et al.  Autonomous Vehicle-Target Assignment: A Game-Theoretical Formulation , 2007 .

[66]  Behçet Açikmese,et al.  A Markov chain approach to probabilistic swarm guidance , 2012, 2012 American Control Conference (ACC).

[67]  Fred Y. Hadaegh,et al.  Guidance and Control of Formation Flying Spacecraft , 2009 .

[68]  R. Kumar,et al.  On optimal control of Markov chains with safety constraint , 2006, 2006 American Control Conference.

[69]  John T. Wen,et al.  Adaptive design for reference velocity recovery in motion coordination , 2008, Syst. Control. Lett..

[70]  Chee Khiang Pang,et al.  Special Issue on “Distributed and Networked Control Systems” , 2015 .