Optimizing Consensus-based Multi-target Tracking with Multiagent Rollout Control Policies

This paper considers a multiagent, connected, robotic fleet where the primary functionality of the agents is sensing. A distributed multi-sensor control strategy maximizes the value of the collective sensing capability of the fleet, using an information-driven approach. Each agent individually performs sensor processing (Kalman Filtering and Joint Probabilistic Data Association) to identify trajectories (and associated distributions). Using communication with neighbors, the agent enhances the prediction of the trajectories by a Consensus of Information approach that iteratively calculates the Kullback-Leibler average of trajectory distributions, which enables the calculation of the collective information for the fleet. The dynamics of the agents, the evolution of the identified trajectories for each agent, and the dynamics of individual observed objects are captured as a Partially Observable Markov Decision Process (POMDP). Using this POMDP and applying rollout with receding horizon control, an optimized non-myopic control policy that maximizes the collective fleet information value is synthesized. Simulations are presented for a scenario with three heterogeneous UAVs performing coordinated target tracking that illustrate the proposed methodology and compare the centralized approach with a contemporary sequential multiagent distributed decision technique.

[1]  Roberto Naldi,et al.  Information driven path planning and control for collaborative aerial robotic sensors using artificial potential functions , 2014, 2014 American Control Conference.

[2]  R. Olfati-Saber Distributed Tracking for Mobile Sensor Networks with Information-Driven Mobility , 2007, 2007 American Control Conference.

[3]  Yaakov Bar-Shalom,et al.  Sonar tracking of multiple targets using joint probabilistic data association , 1983 .

[4]  Sonia Martínez,et al.  Coverage control for mobile sensing networks , 2002, IEEE Transactions on Robotics and Automation.

[5]  Feng Zhao,et al.  Information-driven dynamic sensor collaboration , 2002, IEEE Signal Process. Mag..

[6]  Branko Ristic,et al.  Intermittent Information-Driven Multi-Agent Area-Restricted Search , 2020, Entropy.

[7]  Dimitri Bertsekas Multiagent Rollout Algorithms and Reinforcement Learning , 2019, ArXiv.

[8]  Robert J. Lang,et al.  Circle Packing for Origami Design Is Hard , 2010, ArXiv.

[9]  Feng Zhao,et al.  Scalable Information-Driven Sensor Querying and Routing for Ad Hoc Heterogeneous Sensor Networks , 2002, Int. J. High Perform. Comput. Appl..

[10]  Nils F. Sandell,et al.  Distributed data association for Multi-target tracking in sensor networks , 2008, 2008 47th IEEE Conference on Decision and Control.

[11]  Stephen P. Boyd,et al.  A scheme for robust distributed sensor fusion based on average consensus , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[12]  Kaiyu Zheng,et al.  From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[13]  Stergios I. Roumeliotis,et al.  Multirobot Active Target Tracking With Combinations of Relative Observations , 2011, IEEE Transactions on Robotics.

[14]  Zhi-Quan Luo,et al.  Universal decentralized estimation in a bandwidth constrained sensor network , 2005, IEEE Transactions on Information Theory.

[15]  Edwin K. P. Chong,et al.  Simultaneous Non-Myopic Optimization of UAV Guidance and Camera Gimbal Control for Target Tracking , 2018, 2018 IEEE Conference on Control Technology and Applications (CCTA).

[16]  Liam Paull,et al.  Sensor driven online coverage planning for autonomous underwater vehicles , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17]  Giorgio Battistelli,et al.  Kullback-Leibler average, consensus on probability densities, and distributed state estimation with guaranteed stability , 2014, Autom..

[18]  Giorgio Battistelli,et al.  Consensus-Based Linear and Nonlinear Filtering , 2015, IEEE Transactions on Automatic Control.

[19]  Petros G. Voulgaris,et al.  On optimal ℓ∞ to ℓ∞ filtering , 1995, Autom..

[20]  Edwin K. P. Chong,et al.  Sensor scheduling for target tracking: A Monte Carlo sampling approach , 2006, Digit. Signal Process..

[21]  Ba-Ngu Vo,et al.  A Consistent Metric for Performance Evaluation of Multi-Object Filters , 2008, IEEE Transactions on Signal Processing.

[22]  Gaurav S. Sukhatme,et al.  Resilience in multi-robot target tracking through reconfiguration , 2020, 2020 IEEE International Conference on Robotics and Automation (ICRA).

[23]  Dimitri P. Bertsekas,et al.  Rollout Algorithms for Discrete Optimization: A Survey , 2012 .