Distributed Coverage Control by Robot Networks in Unknown Environments Using a Modified EM Algorithm

In this paper, we study a distributed control algorithm for the problem of unknown area coverage by a network of robots. The coverage objective is to locate a set of targets in the area and to minimize the robots’ energy consumption. The robots have no prior knowledge about the location and also about the number of the targets in the area. One efficient approach that can be used to relax the robots’ lack of knowledge is to incorporate an auxiliary learning algorithm into the control scheme. A learning algorithm actually allows the robots to explore and study the unknown environment and to eventually overcome their lack of knowledge. The control algorithm itself is modeled based on game theory where the network of the robots use their collective information to play a non-cooperative potential game. The algorithm is tested via simulations to verify its performance and adaptability. Keywords—Distributed control, game theory, multi-agent learning, reinforcement learning.

[1]  Francesco Bullo,et al.  Esaim: Control, Optimisation and Calculus of Variations Spatially-distributed Coverage Optimization and Control with Limited-range Interactions , 2022 .

[2]  Wei Ren,et al.  Game theory control solution for sensor coverage problem in unknown environment , 2014, 53rd IEEE Conference on Decision and Control.

[3]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[4]  Sonia Martínez,et al.  Coverage control for mobile sensing networks , 2002, IEEE Transactions on Robotics and Automation.

[5]  H. Akaike A new look at the statistical model identification , 1974 .

[6]  S. Sitharama Iyengar,et al.  Sensor placement for grid coverage under imprecise detections , 2002, Proceedings of the Fifth International Conference on Information Fusion. FUSION 2002. (IEEE Cat.No.02EX5997).

[7]  Jason R. Marden,et al.  Revisiting log-linear learning: Asynchrony, completeness and payoff-based implementation , 2010, 2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[8]  Lacra Pavel,et al.  A Modified Q-Learning Algorithm for Potential Games , 2014 .

[9]  Takamichi Nakamoto,et al.  Active odor sensing system , 1997, ISIE '97 Proceeding of the IEEE International Symposium on Industrial Electronics.

[10]  Sonia Martinez,et al.  Deployment algorithms for a power‐constrained mobile sensor network , 2010 .

[11]  Geoffrey E. Hinton,et al.  Split and Merge EM Algorithm for Improving Gaussian Mixture Density Estimates , 2000, J. VLSI Signal Process..

[12]  V. Lakshmanan,et al.  A Gaussian Mixture Model Approach to Forecast Verification , 2010 .

[13]  Yoshiaki Shirai,et al.  Active sensor fusion for collision avoidance , 1997, Proceedings of the 1997 IEEE/RSJ International Conference on Intelligent Robot and Systems. Innovative Robotics for Real-World Applications. IROS '97.

[14]  Jeff S. Shamma,et al.  Robustness of stochastic stability in game theoretic learning , 2013, 2013 American Control Conference.

[15]  Camillo J. Taylor,et al.  Dynamic Sensor Planning and Control for Optimally Tracking Targets , 2003, Int. J. Robotics Res..

[16]  H. Isil Bozma,et al.  Multirobot communication network topology via centralized pairwise games , 2013, 2013 IEEE International Conference on Robotics and Automation.

[17]  Wei Li,et al.  Distributed Cooperative coverage Control of Sensor Networks , 2005, Proceedings of the 44th IEEE Conference on Decision and Control.

[18]  J R Frost,et al.  REVIEW OF SEARCH THEORY: ADVANCES AND APPLICATIONS TO SEARCH AND RESCUE DECISION SUPPORT: FINAL REPORT , 2001 .

[19]  Adam Wierman,et al.  Distributed welfare games with applications to sensor coverage , 2008, 2008 47th IEEE Conference on Decision and Control.

[20]  Dariusz Ucinski,et al.  Measurement Optimization for Parameter Estimation in Distributed Systems , 1999 .

[21]  R.M. Murray,et al.  On a decentralized active sensing strategy using mobile sensor platforms in a network , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).