Distributed learning and cooperative control for multi-agent systems

This paper presents an algorithm and analysis of distributed learning and cooperative control for a multi-agent system so that a global goal of the overall system can be achieved by locally acting agents. We consider a resource-constrained multi-agent system, in which each agent has limited capabilities in terms of sensing, computation, and communication. The proposed algorithm is executed by each agent independently to estimate an unknown field of interest from noisy measurements and to coordinate multiple agents in a distributed manner to discover peaks of the unknown field. Each mobile agent maintains its own local estimate of the field and updates the estimate using collective measurements from itself and nearby agents. Each agent then moves towards peaks of the field using the gradient of its estimated field while avoiding collision and maintaining communication connectivity. The proposed algorithm is based on a recursive spatial estimation of an unknown field. We show that the closed-loop dynamics of the proposed multi-agent system can be transformed into a form of a stochastic approximation algorithm and prove its convergence using Ljung's ordinary differential equation (ODE) approach. We also present extensive simulation results supporting our theoretical results.

[1]  R. Olfati-Saber,et al.  Consensus Filters for Sensor Networks and Distributed Sensor Fusion , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[2]  Joonho Lee,et al.  Biologically-inspired navigation strategies for swarm intelligence using spatial Gaussian processes , 2008 .

[3]  Linda Brus,et al.  Recursive black-box identification of nonlinear state-space ODE models , 2006 .

[4]  T. Wigren Convergence analysis of recursive identification algorithms based on the nonlinear Wiener model , 1994, IEEE Trans. Autom. Control..

[5]  Craig W. Reynolds Flocks, herds, and schools: a distributed behavioral model , 1987, SIGGRAPH.

[6]  Reza Olfati-Saber,et al.  Flocking for multi-agent dynamic systems: algorithms and theory , 2006, IEEE Transactions on Automatic Control.

[7]  Robert Haining,et al.  Statistics for spatial data: by Noel Cressie, 1991, John Wiley & Sons, New York, 900 p., ISBN 0-471-84336-9, US $89.95 , 1993 .

[8]  D. Grünbaum Schooling as a strategy for taxis in a noisy environment , 1998, Evolutionary Ecology.

[9]  Gaurav S. Sukhatme,et al.  Connecting the Physical World with Pervasive Networks , 2002, IEEE Pervasive Comput..

[10]  Lennart Ljung,et al.  Theorems for the Asymptotic Analysis of Recursive Stochastic Algorithms , 1975 .

[11]  Sonia Martínez,et al.  Distributed Interpolation Schemes for Field Estimation by Mobile Sensor Networks , 2010, IEEE Transactions on Control Systems Technology.

[12]  Jorge Cortés,et al.  Asymptotic Optimality of Multicenter Voronoi Configurations for Random Field Estimation , 2009, IEEE Transactions on Automatic Control.

[13]  Jie Lin,et al.  Coordination of groups of mobile autonomous agents using nearest neighbor rules , 2003, IEEE Trans. Autom. Control..

[14]  Mani Srivastava,et al.  Overview of sensor networks , 2004 .

[15]  George J. Pappas,et al.  Stability of Flocking Motion , 2003 .

[16]  Iain Murray,et al.  Introduction to Gaussian Processes , 2008 .

[17]  Mark G. Terwilliger,et al.  Overview of Sensor Networks , 2004 .

[18]  Jorge Cortés,et al.  Distributed Kriged Kalman Filter for Spatial Estimation , 2009, IEEE Transactions on Automatic Control.

[19]  Joonho Lee,et al.  Swarm intelligence for achieving the global maximum using spatio-temporal Gaussian processes , 2008, 2008 American Control Conference.

[20]  Jongeun Choi,et al.  Cooperatively learning mobile agents for gradient climbing , 2007, 2007 46th IEEE Conference on Decision and Control.

[21]  Gaurav S. Sukhatme,et al.  Bacterium-inspired robots for environmental monitoring , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[22]  J. Adler Chemotaxis in Bacteria , 1966, Science.

[23]  Petter Ögren,et al.  Cooperative control of mobile sensor networks:Adaptive gradient climbing in a distributed environment , 2004, IEEE Transactions on Automatic Control.

[24]  Barbara Webb,et al.  Swarm Intelligence: From Natural to Artificial Systems , 2002, Connect. Sci..

[25]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[26]  S. Shankar Sastry,et al.  Tracking and Coordination of Multiple Agents Using Sensor Networks: System Design, Algorithms and Experiments , 2007, Proceedings of the IEEE.

[27]  Sonia Martínez,et al.  Coverage control for mobile sensing networks , 2002, IEEE Transactions on Robotics and Automation.

[28]  Gordon F. Royle,et al.  Algebraic Graph Theory , 2001, Graduate texts in mathematics.

[29]  Tamer Basar,et al.  Analysis of Recursive Stochastic Algorithms , 2001 .

[30]  I. H. Öğüş,et al.  NATO ASI Series , 1997 .

[31]  P. Kumar,et al.  Theory and practice of recursive identification , 1985, IEEE Transactions on Automatic Control.

[32]  Randal W. Beard,et al.  Consensus seeking in multiagent systems under dynamically changing interaction topologies , 2005, IEEE Transactions on Automatic Control.

[33]  Carlos S. Kubrusly,et al.  Stochastic approximation algorithms and applications , 1973, CDC 1973.

[34]  Gaurav S. Sukhatme,et al.  Mobile Robot Sensing for Environmental Applications , 2007, FSR.

[35]  Karl Johan Åström,et al.  Adaptive Control , 1989, Embedded Digital Control with Microcontrollers.