Random Finite Set Theory and Optimal Control for Large Spacecraft Swarms

Controlling large swarms of robotic agents has many challenges including, but not limited to, computational complexity due to the number of agents, uncertainty in the functionality of each agent in the swarm, and uncertainty in the swarm's configuration. This work generalizes the swarm state using Random Finite Set (RFS) theory and solves the control problem using model predictive control (MPC) which naturally handles the challenges. To determine more computationally efficient solutions, iterative linear quadratic regulator (ILQR) is also explored. This work uses information divergence to define the distance between swarm RFS and a desired distribution. A stochastic optimal control problem is formulated using a modified L2^2 distance. Simulation results using MPC and ILQR show that swarm intensities converge to a target destination, and the RFS control formulation can vary in the number of target destinations. ILQR also provides a more computationally efficient solution to the RFS swarm problem when compared to the MPC solution. Lastly, the RFS control solution is applied to the spacecraft relative motion problem showing the viability for this real-world scenario.

[1]  Howard D. Curtis,et al.  Orbital Mechanics for Engineering Students , 2005 .

[2]  Fu Lin,et al.  Sparse quadratic regulator , 2013, 2013 European Control Conference (ECC).

[3]  Soon-Jo Chung,et al.  INHOMOGENEOUS MARKOV CHAIN APPROACH TO PROBABILISTIC SWARM GUIDANCE ALGORITHMS , 2013 .

[4]  L. Liao,et al.  Advantages of Differential Dynamic Programming Over Newton''s Method for Discrete-time Optimal Control Problems , 1992 .

[5]  Masayuki Inaba,et al.  Robotics Research: The 16th International Symposium ISRR , 2016 .

[6]  Jan Lunze,et al.  Feedback control of large-scale systems , 1992 .

[7]  Asok Ray,et al.  Supervised Self-Organization of Homogeneous Swarms Using Ergodic Projections of Markov Chains , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[8]  Hongyan Wang,et al.  Social potential fields: A distributed behavioral control for autonomous robots , 1995, Robotics Auton. Syst..

[9]  R. Mahler Multitarget Bayes filtering via first-order multitarget moments , 2003 .

[10]  Bryce Doerr,et al.  Control of Large Swarms via Random Finite Set Theory , 2018, 2018 Annual American Control Conference (ACC).

[11]  Luca Maria Gambardella,et al.  The cooperation of swarm-bots: physical interactions in collective robotics , 2005, IEEE Robotics & Automation Magazine.

[12]  Soon-Jo Chung,et al.  Spacecraft Swarm Guidance Using a Sequence of Decentralized Convex Optimizations , 2012 .

[13]  R Bellman,et al.  On the Theory of Dynamic Programming. , 1952, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Hong Zhang,et al.  Collective Robotics: From Social Insects to Robots , 1993, Adapt. Behav..

[15]  Ba-Ngu Vo,et al.  The Cauchy-Schwarz divergence for poisson point processes , 2014, 2014 IEEE Workshop on Statistical Signal Processing (SSP).

[16]  Frank Allgöwer,et al.  An Introduction to Nonlinear Model Predictive Control , 2002 .

[17]  J.K. Hedrick,et al.  An overview of emerging results in cooperative UAV control , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[18]  E. Todorov,et al.  A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems , 2005, Proceedings of the 2005, American Control Conference, 2005..

[19]  Soon-Jo Chung,et al.  Decentralized Model Predictive Control of Swarms of Spacecraft Using Sequential Convex Programming , 2013 .

[20]  Michael G. Hinchey,et al.  Towards an ASSL specification model for NASA swarm-based exploration missions , 2008, SAC '08.

[21]  Silvia Ferrari,et al.  Distributed optimal control for multi-agent trajectory optimization , 2014, Autom..

[22]  A. Plastino,et al.  On the use and abuse of Newton's second law for variable mass problems , 1992 .

[23]  Inderjit S. Dhillon,et al.  Clustering with Bregman Divergences , 2005, J. Mach. Learn. Res..

[24]  Ilya Kolmanovsky,et al.  Model Predictive Control approach for guidance of spacecraft rendezvous and proximity maneuvering , 2012 .

[25]  Soon-Jo Chung,et al.  On Development of 100-Gram-Class Spacecraft for Swarm Applications , 2016, IEEE Systems Journal.

[26]  Silvia Ferrari,et al.  A generalized reduced gradient method for the optimal control of multiscale dynamical systems , 2013, 52nd IEEE Conference on Decision and Control.

[27]  Marko Bacic,et al.  Model predictive control , 2003 .

[28]  Silvia Ferrari,et al.  Distributed Optimal Control of Multiscale Dynamical Systems: A Tutorial , 2016, IEEE Control Systems.

[29]  Saptarshi Bandyopadhyay,et al.  Probabilistic swarm guidance using optimal transport , 2014, 2014 IEEE Conference on Control Applications (CCA).

[30]  Behcet Acikmese,et al.  Velocity Field Generation for Density Control of Swarms using Heat Equation and Smoothing Kernels , 2017, 1710.00299.

[31]  Ba-Ngu Vo,et al.  The Gaussian Mixture Probability Hypothesis Density Filter , 2006, IEEE Transactions on Signal Processing.

[32]  M. The Swarm / Potential Model : Modeling Robotics Swarms with Measure-valued Recursions Associated to Random Finite Sets , 2013 .

[33]  Behçet Açikmese,et al.  A Markov chain approach to probabilistic swarm guidance , 2012, 2012 American Control Conference (ACC).

[34]  Silvia Ferrari,et al.  Distributed Optimal Control of Sensor Networks for Dynamic Target Tracking , 2018, IEEE Transactions on Control of Network Systems.

[35]  F. Y. Hadaegh,et al.  A model predictive control-based approach for spacecraft formation keeping and attitude control , 1999, Proceedings of the 1999 American Control Conference (Cat. No. 99CH36251).

[36]  Soon-Jo Chung,et al.  Swarm Assignment and Trajectory Optimization Using Variable-Swarm, Distributed Auction Assignment and Model Predictive Control , 2015 .

[37]  Radhika Nagpal,et al.  Programmable self-assembly in a thousand-robot swarm , 2014, Science.

[38]  L. Liao,et al.  Convergence in unconstrained discrete-time differential dynamic programming , 1991 .

[39]  Ba-Ngu Vo,et al.  Tracking an unknown time-varying number of speakers using TDOA measurements: a random finite set approach , 2006, IEEE Transactions on Signal Processing.