Utility of Turning Spot Learning under complex goal search and the limit of memory usage
暂无分享,去创建一个
[1] Peter Dayan,et al. Q-learning , 1992, Machine Learning.
[2] Michael I. Jordan,et al. MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 1996 .
[3] Peter Dayan,et al. Technical Note: Q-Learning , 2004, Machine Learning.
[4] Hidetomo Ichihashi,et al. Simple Reinforcement Learning for Small-Memory Agent , 2011, 2011 10th International Conference on Machine Learning and Applications and Workshops.
[5] P. Y. Glorennec,et al. Fuzzy Q-learning and dynamical fuzzy Q-learning , 1994, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference.
[6] Csaba Szepesvári,et al. Algorithms for Reinforcement Learning , 2010, Synthesis Lectures on Artificial Intelligence and Machine Learning.
[7] Hidetomo Ichihashi,et al. Chain Form Reinforcement Learning for Small-Memory Agent , 2012 .
[8] Katsuhiro Honda,et al. Moratorium Effect on Estimation Values in Simple Reinforcement Learning , 2013 .