Keepaway Soccer: From Machine Learning Testbed to Benchmark
暂无分享,去创建一个
Peter Stone | Matthew E. Taylor | Yaxin Liu | Gregory Kuhlmann | P. Stone | Gregory Kuhlmann | Yaxin Liu
[1] Peter Stone,et al. Keepaway Soccer: A Machine Learning Testbed , 2001, RoboCup.
[2] Peter Stone,et al. Progress in Learning 3 vs. 2 Keepaway , 2003, RoboCup.
[3] Ian Frank,et al. Soccer Server: A Tool for Research on Multiagent Systems , 1998, Appl. Artif. Intell..
[4] Michael O. Duff,et al. Reinforcement Learning Methods for Continuous-Time Markov Decision Problems , 1994, NIPS.
[5] Martin L. Puterman,et al. Markov Decision Processes: Discrete Stochastic Dynamic Programming , 1994 .
[6] Peter Stone,et al. Reinforcement Learning for RoboCup Soccer Keepaway , 2005, Adapt. Behav..
[7] Risto Miikkulainen,et al. Evolving Soccer Keepaway Players Through Task Decomposition , 2005, Machine Learning.
[8] Gerald Tesauro,et al. TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play , 1994, Neural Computation.
[9] R. Lyndon While,et al. Learning In RoboCup Keepaway Using Evolutionary Algorithms , 2002, GECCO.
[10] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[11] Peter Stone,et al. Behavior transfer for value-function-based reinforcement learning , 2005, AAMAS '05.
[12] Mahesan Niranjan,et al. On-line Q-learning using connectionist systems , 1994 .
[13] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[14] Jelle R. Kok,et al. The Incremental Development of a Synthetic Multi-Agent System: The UvA Trilearn 2001 Robotic Soccer Simulation Team , 2002 .
[15] James S. Albus,et al. Brains, behavior, and robotics , 1981 .
[16] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .
[17] Risto Miikkulainen,et al. Evolving Keepaway Soccer Players through Task Decomposition , 2003, GECCO.
[18] Steven M. Gustafson,et al. Genetic Programming And Multi-agent Layered Learning By Reinforcements , 2002, GECCO.
[19] Andrew G. Barto,et al. Improving Elevator Performance Using Reinforcement Learning , 1995, NIPS.
[20] Trevor Walker. Relational Reinforcement Learning via Sampling the Space of First-Order Conjunctive Features , 2004 .