k-Certainty Exploration Method: An Action Selector to Identify the Environment in Reinforcement Learning

[1]  John H. Holland,et al.  Cognitive systems based on adaptive algorithms , 1977, SGAR.

[2]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Stochastic Control , 1977, IEEE Transactions on Systems, Man, and Cybernetics.

[3]  Donald A. Waterman,et al.  Pattern-Directed Inference Systems , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  John N. Tsitsiklis,et al.  The Complexity of Markov Decision Processes , 1987, Math. Oper. Res..

[5]  Gunar E. Liepins,et al.  Alternatives for Classifier System Credit Assignment , 1989, IJCAI.

[6]  Dana H. Ballard,et al.  Active Perception and Reinforcement Learning , 1990, Neural Computation.

[7]  Richard S. Sutton,et al.  Reinforcement learning architectures for animats , 1991 .

[8]  Sridhar Mahadevan,et al.  Automatic Programming of Behavior-Based Robots Using Reinforcement Learning , 1991, Artif. Intell..

[9]  Satinder Singh Transfer of Learning by Composing Solutions of Elemental Sequential Tasks , 1992, Mach. Learn..

[10]  Steven J. Bradtke,et al.  Reinforcement Learning Applied to Linear Quadratic Regulation , 1992, NIPS.

[11]  Paul E. Utgoff,et al.  A Teaching Method for Reinforcement Learning , 1992, ML.

[12]  Ming Tan,et al.  Multi-Agent Reinforcement Learning: Independent versus Cooperative Agents , 1997, ICML.

[13]  Jonas Karlsson,et al.  Learning via task decomposition , 1993 .

[14]  Long Ji Lin,et al.  Scaling Up Reinforcement Learning for Robot Control , 1993, International Conference on Machine Learning.

[15]  Shigenobu Kobayashi,et al.  Reinforcement Learning by Stochastic Hill Climbing on Discounted Reward , 1995, ICML.

[16]  Shigenobu Kobayashi,et al.  l-Certainty Exploration Method: An Action Selector to Identify the Environment by an Agent : An Extension of k-Certainty Exploration Method to Stochastic MDPs , 1996 .

[17]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .