Construction of a learning agent handling its rewards according to environmental situations
暂无分享,去创建一个
[1] Masayuki Numao,et al. Constructing an Autonomous Agent with an Interdependent Heuristics , 2000, PRICAI.
[2] S. Mikami. Cooperative reinforcement learning by Payoff filters , 1995 .
[3] Xin Yao,et al. An Experimental Study of N-Person Iterated Prisoner's Dilemma Games , 1993, Informatica.
[4] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[5] G. Hardin,et al. The Tragedy of the Commons , 1968, Green Planet Blues.
[6] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.