Improving Space Representation in Multiagent Learning via Tile Coding
暂无分享,去创建一个
[1] Craig Boutilier,et al. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems , 1998, AAAI/IAAI.
[2] Chris Watkins,et al. Learning from delayed rewards , 1989 .
[3] Richard S. Sutton,et al. Generalization in ReinforcementLearning : Successful Examples UsingSparse Coarse , 1996 .
[4] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[5] Nikos A. Vlassis,et al. Collaborative Multiagent Reinforcement Learning by Payoff Propagation , 2006, J. Mach. Learn. Res..
[6] S. Whiteson,et al. Adaptive Tile Coding for Value Function Approximation , 2007 .
[7] Ana L. C. Bazzan,et al. Multiagent Learning on Traffic Lights Control , 2009, Multi-Agent Systems for Traffic and Transportation Engineering.
[8] Michail G. Lagoudakis,et al. Coordinated Reinforcement Learning , 2002, ICML.
[9] Peter Stone,et al. Improving Action Selection in MDP's via Knowledge Transfer , 2005, AAAI.
[10] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[11] Ana L. C. Bazzan,et al. Opportunities for multiagent systems and multiagent reinforcement learning in traffic control , 2009, Autonomous Agents and Multi-Agent Systems.