暂无分享,去创建一个
[1] Sandip Sen,et al. Learning to Coordinate without Sharing Information , 1994, AAAI.
[2] Jeffrey M. Bradshaw,et al. Ten Challenges for Making Automation a "Team Player" in Joint Human-Agent Activity , 2004, IEEE Intell. Syst..
[3] Guillaume J. Laurent,et al. Independent reinforcement learners in cooperative Markov games: a survey regarding coordination problems , 2012, The Knowledge Engineering Review.
[4] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[5] Christopher D. Wickens,et al. A model for types and levels of human interaction with automation , 2000, IEEE Trans. Syst. Man Cybern. Part A.
[6] George Sugihara,et al. Detecting Causality in Complex Ecosystems , 2012, Science.
[7] Raja Parasuraman,et al. Effects of Imperfect Automation on Decision Making in a Simulated Command and Control Task , 2007, Hum. Factors.
[8] Wojciech Zaremba,et al. OpenAI Gym , 2016, ArXiv.
[9] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
[10] Shimon Whiteson,et al. Counterfactual Multi-Agent Policy Gradients , 2017, AAAI.
[11] Yi Wu,et al. Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments , 2017, NIPS.
[12] Guillaume J. Laurent,et al. Hysteretic q-learning :an algorithm for decentralized reinforcement learning in cooperative multi-agent teams , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.
[13] Martin Lauer,et al. An Algorithm for Distributed Reinforcement Learning in Cooperative Multi-Agent Systems , 2000, ICML.