Multi-target tracking for unmanned aerial vehicle swarms using deep reinforcement learning

[1]  Xia Lei,et al.  Deep Reinforcement Learning with Experience Sharing for Power Control , 2020, 2020 IEEE 20th International Conference on Communication Technology (ICCT).

[2]  Akansel Cosgun,et al.  Decentralized Multi-Agent Pursuit Using Deep Reinforcement Learning , 2020, IEEE Robotics and Automation Letters.

[3]  Liujing Wang,et al.  Joint Optimization of Multi-UAV Target Assignment and Path Planning Based on Multi-Agent Reinforcement Learning , 2019, IEEE Access.

[4]  H. Snoussi,et al.  A reinforcement learning approach for UAV target searching and tracking , 2019, Multimedia Tools and Applications.

[5]  Gerhard Neumann,et al.  Deep Reinforcement Learning for Swarm Systems , 2018, J. Mach. Learn. Res..

[6]  Mykel J. Kochenderfer,et al.  Multi-Agent Reinforcement Learning for Multi-Object Tracking , 2018, AAMAS.

[7]  Nick-Marios T. Kokolakis,et al.  Coordinated Standoff Tracking of a Ground Moving Target and the Phase Separation Problem , 2018, 2018 International Conference on Unmanned Aircraft Systems (ICUAS).

[8]  Mariette Awad,et al.  Decision Making in Multiagent Systems: A Survey , 2018, IEEE Transactions on Cognitive and Developmental Systems.

[9]  Alberto Sanfeliu,et al.  Searching and tracking people with cooperative mobile robots , 2017, Autonomous Robots.

[10]  Xiao Zhang,et al.  Autonomous navigation of UAV in large-scale unknown complex environment with deep reinforcement learning , 2017, 2017 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[11]  Gerhard Neumann,et al.  Local Communication Protocols for Learning Complex Swarm Behaviors with Deep Reinforcement Learning , 2017, ANTS Conference.

[12]  Cameron K. Peterson Dynamic grouping of cooperating vehicles using a receding horizon controller for ground target search and track missions , 2017, 2017 IEEE Conference on Control Technology and Applications (CCTA).

[13]  Yi Wu,et al.  Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.

[14]  Jonathan P. How,et al.  Decentralized non-communicating multiagent collision avoidance with deep reinforcement learning , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[15]  James C. Spall,et al.  Multi-agent surveillance and tracking using cyclic stochastic gradient , 2016, 2016 American Control Conference (ACC).

[16]  Honglun Wang,et al.  Cooperative path planning with applications to target tracking and obstacle avoidance for multi-UAVs , 2016 .

[17]  D. Kudenko,et al.  Potential-based reward shaping for finite horizon online POMDP planning , 2016, Autonomous Agents and Multi-Agent Systems.

[18]  Dorian Kodelja,et al.  Multiagent cooperation and competition with deep reinforcement learning , 2015, PloS one.

[19]  Toru Namerikawa,et al.  Formation control with collision avoidance for a multi-UAV system using decentralized MPC and consensus-based control , 2015, 2015 European Control Conference (ECC).

[20]  Shane Legg,et al.  Human-level control through deep reinforcement learning , 2015, Nature.

[21]  Jie Shao,et al.  Swarm robots reinforcement learning convergence accuracy-based learning classifier systems with gradient descent (XCS-GD) , 2013, Proceedings of 2013 3rd International Conference on Computer Science and Network Technology.

[22]  Olivier Buffet,et al.  Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Optimally Solving Dec-POMDPs as Continuous-State MDPs , 2022 .

[23]  Vincent Roberge,et al.  Comparison of Parallel Genetic Algorithm and Particle Swarm Optimization for Real-Time UAV Path Planning , 2013, IEEE Transactions on Industrial Informatics.

[24]  Naomi Ehrich Leonard,et al.  Starling Flock Networks Manage Uncertainty in Consensus at Low Cost , 2013, PLoS Comput. Biol..

[25]  X. Rong Li,et al.  UAV Route Planning for Joint Search and Track Missions—An Information-Value Approach , 2012, IEEE Transactions on Aerospace and Electronic Systems.

[26]  Ganesh K. Venayagamoorthy,et al.  Bio-inspired Algorithms for Autonomous Deployment and Localization of Sensor Nodes , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[27]  Han-Lim Choi,et al.  Consensus-Based Decentralized Auctions for Robust Task Allocation , 2009, IEEE Transactions on Robotics.

[28]  G. Parisi,et al.  Interaction ruling animal collective behavior depends on topological rather than metric distance: Evidence from a field study , 2007, Proceedings of the National Academy of Sciences.

[29]  Bernhard Rinner,et al.  Cooperative Robots to Observe Moving Targets: Review , 2018, IEEE Transactions on Cybernetics.

[30]  Joarder Kamruzzaman,et al.  Search and tracking algorithms for swarms of robots: A survey , 2016, Robotics Auton. Syst..

[31]  Huang Ning A UAV Route Planning Method Based on Voronoi Diagram and Quantum Genetic Algorithm , 2013 .

[32]  Carsten Peterson,et al.  Explorations of the mean field theory learning algorithm , 1989, Neural Networks.