Cooperatively pursuing a target unmanned aerial vehicle by multiple unmanned aerial vehicles based on multiagent reinforcement learning
暂无分享,去创建一个
[1] Liangjun Ke,et al. Large-Scale Traffic Signal Control Using a Novel Multiagent Reinforcement Learning , 2019, IEEE Transactions on Cybernetics.
[2] Lantao Yu,et al. A Study of AI Population Dynamics with Million-agent Reinforcement Learning , 2017, AAMAS.
[3] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.
[4] Yi Wu,et al. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.
[5] Darius Burschka,et al. Toward a Fully Autonomous UAV: Research Platform for Indoor and Outdoor Urban Search and Rescue , 2012, IEEE Robotics & Automation Magazine.
[6] Ming Zhou,et al. Mean Field Multi-Agent Reinforcement Learning , 2018, ICML.
[7] Shin Ishii,et al. Multiagent reinforcement learning applied to a chase problem in a continuous world , 2001, Artificial Life and Robotics.
[8] Jonathan P. How,et al. Increasing autonomy of UAVs , 2009, IEEE Robotics & Automation Magazine.
[9] Yang Yang,et al. Energy-efficient multi-UAV coverage deployment in UAV networks: A game-theoretic framework , 2018, China Communications.
[10] Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.
[11] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[12] Sergio Salazar,et al. Adaptive consensus algorithms for real‐time operation of multi‐agent systems affected by switching network events , 2017 .
[13] M. M. Flood. THE HIDE AND SEEK GAME OF VON NEUMANN , 1972 .
[14] Yishay Mansour,et al. Policy Gradient Methods for Reinforcement Learning with Function Approximation , 1999, NIPS.
[15] Sven Koenig,et al. ESP: pursuit evasion on series-parallel graphs , 2010, AAMAS.
[16] Geoffrey E. Hinton,et al. Deep Learning , 2015, Nature.
[17] Volkan Isler,et al. The role of information in the cop-robber game , 2008, Theor. Comput. Sci..
[18] Noe Casas,et al. Deep Deterministic Policy Gradient for Urban Traffic Light Control , 2017, ArXiv.
[19] Pratap Tokekar,et al. Sensor Planning for a Symbiotic UAV and UGV System for Precision Agriculture , 2016, IEEE Trans. Robotics.
[20] Dorian Kodelja,et al. Multiagent cooperation and competition with deep reinforcement learning , 2015, PloS one.
[21] Sampath Kannan,et al. Randomized pursuit-evasion in a polygonal environment , 2005, IEEE Transactions on Robotics.
[22] Peng Peng,et al. Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games , 2017, 1703.10069.
[23] Yongcan Cao,et al. Band-reconfigurable Multi-UAV-based Cooperative Remote Sensing for Real-time Water Management and Distributed Irrigation Control , 2008 .
[24] Xiaogang Wang,et al. Distributed task allocation for multiple heterogeneous UAVs based on consensus algorithm and online cooperative strategy , 2018, Aircraft Engineering and Aerospace Technology.
[25] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[26] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[27] S. Shankar Sastry,et al. Probabilistic pursuit-evasion games: theory, implementation, and experimental evaluation , 2002, IEEE Trans. Robotics Autom..