Development of a reinforcement learning system to play Othello
暂无分享,去创建一个
[1] Shigenobu Kobayashi,et al. Rationality of Reward Sharing in Multi-agent Reinforcement Learning , 1999, PRIMA.
[2] Shigenobu Kobayashi,et al. Reinforcement learning for penalty avoiding policy making , 2000, Smc 2000 conference proceedings. 2000 ieee international conference on systems, man and cybernetics. 'cybernetics evolving to systems, humans, organizations, and their complex interactions' (cat. no.0.
[3] Shigenobu Kobayashi,et al. k-Certainty Exploration Method: An Action Selector to Identify the Environment in Reinforcement Learning , 1997, Artif. Intell..
[4] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[5] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.