Decentralized Ergodic Control: Distribution-Driven Sensing and Exploration for Multiagent Systems

We present a decentralized ergodic control policy for time-varying area coverage problems for multiple agents with nonlinear dynamics. Ergodic control allows us to specify distributions as objectives for area coverage problems for nonlinear robotic systems as a closed-form controller. We derive a variation to the ergodic control policy that can be used with consensus to enable a fully decentralized multiagent control policy. Examples are presented to illustrate the applicability of our method for multiagent terrain mapping as well as target localization. An analysis on ergodic policies as a Nash equilibrium is provided for game theoretic applications.

[1]  Mei Wu,et al.  Distributed SLAM Using Improved Particle Filter for Mobile Robot Localization , 2014, TheScientificWorldJournal.

[2]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[3]  Michael R. M. Jenkin,et al.  A taxonomy for multi-agent robotics , 1996, Auton. Robots.

[4]  Hugh F. Durrant-Whyte,et al.  Gaussian Process modeling of large scale terrain , 2009, 2009 IEEE International Conference on Robotics and Automation.

[5]  Mohamed A. Khamis,et al.  Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework , 2014, Eng. Appl. Artif. Intell..

[6]  Martial Hebert,et al.  Experimental Comparison of Techniques for Localization and Mapping Using a Bearing-Only Sensor , 2000, ISER.

[7]  Sourabh Bhattacharya,et al.  On the Existence of Nash Equilibrium for a Two-player Pursuit - Evasion Game with Visibility Constraints , 2008, Int. J. Robotics Res..

[8]  Wolfram Burgard,et al.  Learning predictive terrain models for legged robot locomotion , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Sonia Martínez,et al.  Coverage control for mobile sensing networks , 2002, IEEE Transactions on Robotics and Automation.

[10]  Michael M. Zavlanos,et al.  Distributed Active State Estimation With User-Specified Accuracy , 2017, IEEE Transactions on Automatic Control.

[11]  S. Shankar Sastry,et al.  Consistent Approximations for the Optimal Control of Constrained Switched Systems - Part 1: A Conceptual Algorithm , 2013, SIAM J. Control. Optim..

[12]  Anthony Tzes,et al.  Distributed coverage control for concave areas by a heterogeneous Robot-Swarm with visibility sensing constraints , 2015, Autom..

[13]  L. Armijo Minimization of functions having Lipschitz continuous first partial derivatives. , 1966 .

[14]  Jeffrey K. Uhlmann,et al.  New extension of the Kalman filter to nonlinear systems , 1997, Defense, Security, and Sensing.

[15]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[16]  Christoph Manss,et al.  Decentralized multi-agent entropy-driven exploration under sparsity constraints , 2016, 2016 4th International Workshop on Compressed Sensing Theory and its Applications to Radar, Sonar and Remote Sensing (CoSeRa).

[17]  Todd D. Murphey,et al.  Optimal planning for target localization and coverage using range sensing , 2015, 2015 IEEE International Conference on Automation Science and Engineering (CASE).

[18]  John N. Tsitsiklis,et al.  Parallel and distributed computation , 1989 .

[19]  I. Mezić,et al.  Metrics for ergodicity and design of ergodic dynamics for multi-agent systems , 2011 .

[20]  Sung G. Lee,et al.  Multirobot Control Using Time-Varying Density Functions , 2014, IEEE Transactions on Robotics.

[21]  N.R. Malik,et al.  Graph theory with applications to engineering and computer science , 1975, Proceedings of the IEEE.

[22]  Todd D. Murphey,et al.  Ergodic Exploration of Distributed Information , 2016, IEEE Transactions on Robotics.

[23]  Mohammad Bagher Menhaj,et al.  Decentralized Coverage Control for Multi-Agent Systems with Nonlinear Dynamics , 2011, IEICE Trans. Inf. Syst..

[24]  E. Damme Refinements of the Nash Equilibrium Concept , 1983 .

[25]  Erik I. Verriest,et al.  Gradient Descent Approach to Optimal Mode Scheduling in Hybrid Dynamical Systems , 2008 .

[26]  Maja J. Mataric,et al.  Ergodic Dynamics for Large-Scale Distributed Robot Systems , 2006, UC.

[27]  Ruggero Carli,et al.  Distributed Kalman filtering based on consensus strategies , 2008, IEEE Journal on Selected Areas in Communications.

[28]  Todd D. Murphey,et al.  Trajectory optimization for continuous ergodic exploration , 2013, 2013 American Control Conference.

[29]  David Carmel,et al.  Exploration Strategies for Model-based Learning in Multi-agent Systems: Exploration Strategies , 1999, Autonomous Agents and Multi-Agent Systems.

[30]  Pratap Tokekar,et al.  Algorithms for Cooperative Active Localization of Static Targets With Mobile Bearing Sensors Under Communication Constraints , 2015, IEEE Transactions on Robotics.

[31]  Todd D. Murphey,et al.  Real-Time Area Coverage and Target Localization Using Receding-Horizon Ergodic Exploration , 2017, IEEE Transactions on Robotics.

[32]  Todd D. Murphey,et al.  Sequential Action Control: Closed-Form Optimal Control for Nonlinear and Nonsmooth Systems , 2016, IEEE Transactions on Robotics.

[33]  M. Hartmann,et al.  Ergodic Exploration Using Binary Sensing for Nonparametric Shape Estimation , 2017, IEEE Robotics and Automation Letters.

[34]  Todd Murphey,et al.  Projection-based iterative mode scheduling for switched systems , 2016 .

[35]  Magnus Egerstedt,et al.  Transition-time optimization for switched-mode dynamical systems , 2006, IEEE Transactions on Automatic Control.

[36]  Alberto Viseras Ruiz,et al.  Efficient Multi-Agent Exploration with Gaussian Processes , 2014 .

[37]  Philippe Martin,et al.  The true role of accelerometer feedback in quadrotor control , 2010, 2010 IEEE International Conference on Robotics and Automation.

[38]  Davide Spinello,et al.  Generalized non-autonomous metric optimization for area coverage problems with mobile autonomous agents , 2017, Autom..