论文信息 - k-Certainty Exploration Method: An Action Selector to Identify the Environment in Reinforcement Learning - 字舞流文

k-Certainty Exploration Method: An Action Selector to Identify the Environment in Reinforcement Learning

Shigenobu Kobayashi | Kazuteru Miyazaki | Masayuki Yamamura | Shigenobu Kobayashi | K. Miyazaki | M. Yamamura | S. Kobayashi

[1] John H. Holland,et al. Cognitive systems based on adaptive algorithms , 1977, SGAR.

[2] Dimitri P. Bertsekas,et al. Dynamic Programming and Stochastic Control , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[3] Donald A. Waterman,et al. Pattern-Directed Inference Systems , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] John N. Tsitsiklis,et al. The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..

[5] Gunar E. Liepins,et al. Alternatives for Classifier System Credit Assignment , 1989, IJCAI.

[6] Dana H. Ballard,et al. Active Perception and Reinforcement Learning , 1990, Neural Computation.

[7] Richard S. Sutton,et al. Reinforcement learning architectures for animats , 1991 .

[8] Sridhar Mahadevan,et al. Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..

[9] Satinder Singh. Transfer of Learning by Composing Solutions of Elemental Sequential Tasks , 1992, Mach. Learn..

[10] Steven J. Bradtke,et al. Reinforcement Learning Applied to Linear Quadratic Regulation , 1992, NIPS.

[11] Paul E. Utgoff,et al. A Teaching Method for Reinforcement Learning , 1992, ML.

[12] Ming Tan,et al. Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.

[13] Jonas Karlsson,et al. Learning via task decomposition , 1993 .

[14] Long Ji Lin,et al. Scaling Up Reinforcement Learning for Robot Control , 1993, International Conference on Machine Learning.

[15] Shigenobu Kobayashi,et al. Reinforcement Learning by Stochastic Hill Climbing on Discounted Reward , 1995, ICML.

[16] Shigenobu Kobayashi,et al. l-Certainty Exploration Method: An Action Selector to Identify the Environment by an Agent : An Extension of k-Certainty Exploration Method to Stochastic MDPs , 1996 .

[17] Andrew G. Barto,et al. Reinforcement learning , 1998 .