Cooperative Multi-agent Control Using Deep Reinforcement Learning
暂无分享,去创建一个
[1] Jason Weston,et al. Curriculum learning , 2009, ICML '09.
[2] Jonathan P. How,et al. Decentralized control of partially observable Markov decision processes , 2015, 52nd IEEE Conference on Decision and Control.
[3] J. Schulman,et al. Variational Information Maximizing Exploration , 2016 .
[4] Camillo J. Taylor,et al. A vision-based formation control framework , 2002, IEEE Trans. Robotics Autom..
[5] Dorian Kodelja,et al. Multiagent cooperation and competition with deep reinforcement learning , 2015, PloS one.
[6] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[7] Neil Immerman,et al. The Complexity of Decentralized Control of Markov Decision Processes , 2000, UAI.
[8] Sean Luke,et al. Cooperative Multi-Agent Learning: The State of the Art , 2005, Autonomous Agents and Multi-Agent Systems.
[9] Reza Olfati-Saber,et al. Consensus and Cooperation in Networked Multi-Agent Systems , 2007, Proceedings of the IEEE.
[10] P. Cochat,et al. Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.
[11] Bart De Schutter,et al. Multi-Agent Reinforcement Learning: A Survey , 2006, 2006 9th International Conference on Control, Automation, Robotics and Vision.
[12] S. Shankar Sastry,et al. Probabilistic pursuit-evasion games: theory, implementation, and experimental evaluation , 2002, IEEE Trans. Robotics Autom..
[13] Gerald Tesauro,et al. Extending Q-Learning to General Adaptive Multi-Agent Systems , 2003, NIPS.
[14] Shin Ishii,et al. Multiagent reinforcement learning applied to a chase problem in a continuous world , 2001, Artificial Life and Robotics.
[15] Joshua B. Tenenbaum,et al. Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation , 2016, NIPS.
[16] Martin Lauer,et al. An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems , 2000, ICML.
[17] Stuart J. Russell,et al. Reinforcement Learning with Hierarchies of Machines , 1997, NIPS.
[18] Richard L. Lewis,et al. Variance-Based Rewards for Approximate Bayesian Reinforcement Learning , 2010, UAI.
[19] Yuval Tassa,et al. Continuous control with deep reinforcement learning , 2015, ICLR.
[20] Michael I. Jordan,et al. Learning Without State-Estimation in Partially Observable Markovian Decision Processes , 1994, ICML.
[21] L. E. ParkerCenter. Learning in Large Cooperative Multi-Robot Domains , 2001 .
[22] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.
[23] Sonia Martínez,et al. Coverage control for mobile sensing networks , 2002, IEEE Transactions on Robotics and Automation.
[24] Bikramjit Banerjee,et al. Sample Bounded Distributed Reinforcement Learning for Decentralized POMDPs , 2012, AAAI.
[25] Kee-Eung Kim,et al. Learning to Cooperate via Policy Search , 2000, UAI.
[26] Rob Fergus,et al. Learning Multiagent Communication with Backpropagation , 2016, NIPS.
[27] Sergey Levine,et al. Trust Region Policy Optimization , 2015, ICML.
[28] Jonathan P. How,et al. Graph-based Cross Entropy method for solving multi-robot decentralized POMDPs , 2016, 2016 IEEE International Conference on Robotics and Automation (ICRA).
[29] Long-Ji Lin,et al. Reinforcement learning for robots using neural networks , 1992 .
[30] Milos Hauskrecht,et al. Incremental Methods for Computing Bounds in Partially Observable Markov Decision Processes , 1997, AAAI/IAAI.
[31] Alex Graves,et al. Asynchronous Methods for Deep Reinforcement Learning , 2016, ICML.
[32] Sergey Levine,et al. End-to-End Training of Deep Visuomotor Policies , 2015, J. Mach. Learn. Res..
[33] Andrew Y. Ng,et al. On Local Rewards and Scaling Distributed Reinforcement Learning , 2005, NIPS.
[34] Makoto Yokoo,et al. Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings , 2003, IJCAI.
[35] Stefano Ermon,et al. Model-Free Imitation Learning with Policy Optimization , 2016, ICML.
[36] Norihiko Ono,et al. A Modular Approach to Multi-Agent Reinforcement Learning , 1996, ECAI Workshop LDAIS / ICMAS Workshop LIOME.
[37] Andrew Y. Ng,et al. Near-Bayesian exploration in polynomial time , 2009, ICML '09.
[38] Sergey Levine,et al. High-Dimensional Continuous Control Using Generalized Advantage Estimation , 2015, ICLR.
[39] Shimon Whiteson,et al. Learning to Communicate with Deep Multi-Agent Reinforcement Learning , 2016, NIPS.
[40] Michail G. Lagoudakis,et al. Coordinated Reinforcement Learning , 2002, ICML.
[41] Karl Tuyls,et al. Evolutionary Dynamics of Multi-Agent Learning: A Survey , 2015, J. Artif. Intell. Res..