k-Certainty Exploration Method: An Action Selector to Identify the Environment in Reinforcement Learning
暂无分享,去创建一个
Shigenobu Kobayashi | Kazuteru Miyazaki | Masayuki Yamamura | Shigenobu Kobayashi | K. Miyazaki | M. Yamamura | S. Kobayashi
[1] John H. Holland,et al. Cognitive systems based on adaptive algorithms , 1977, SGAR.
[2] Dimitri P. Bertsekas,et al. Dynamic Programming and Stochastic Control , 1977, IEEE Transactions on Systems, Man, and Cybernetics.
[3] Donald A. Waterman,et al. Pattern-Directed Inference Systems , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.
[4] John N. Tsitsiklis,et al. The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..
[5] Gunar E. Liepins,et al. Alternatives for Classifier System Credit Assignment , 1989, IJCAI.
[6] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.
[7] Richard S. Sutton,et al. Reinforcement learning architectures for animats , 1991 .
[8] Sridhar Mahadevan,et al. Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..
[9] Satinder Singh. Transfer of Learning by Composing Solutions of Elemental Sequential Tasks , 1992, Mach. Learn..
[10] Steven J. Bradtke,et al. Reinforcement Learning Applied to Linear Quadratic Regulation , 1992, NIPS.
[11] Paul E. Utgoff,et al. A Teaching Method for Reinforcement Learning , 1992, ML.
[12] Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.
[13] Jonas Karlsson,et al. Learning via task decomposition , 1993 .
[14] Long Ji Lin,et al. Scaling Up Reinforcement Learning for Robot Control , 1993, International Conference on Machine Learning.
[15] Shigenobu Kobayashi,et al. Reinforcement Learning by Stochastic Hill Climbing on Discounted Reward , 1995, ICML.
[16] Shigenobu Kobayashi,et al. l-Certainty Exploration Method: An Action Selector to Identify the Environment by an Agent : An Extension of k-Certainty Exploration Method to Stochastic MDPs , 1996 .
[17] Andrew G. Barto,et al. Reinforcement learning , 1998 .