论文信息 - Intrinsically Motivated Reinforcement Learning

Intrinsically Motivated Reinforcement Learning

Psychologists call behavior intrinsically motivated when it is engaged in for its own sake rather than as a step toward solving a specific problem of clear practical value. But what we learn during intrinsically motivated behavior is essential for our development as competent autonomous entities able to efficiently solve a wide range of practical problems as they arise. In this paper we present initial results from a computational study of intrinsically motivated reinforcement learning aimed at allowing artificial agents to construct and extend hierarchies of reusable skills that are needed for competent autonomy.

[1] R. W. White. Motivation reconsidered: the concept of competence. , 1959, Psychological review.

[2] Richard S. Sutton,et al. Integrated Modeling and Control Based on Reinforcement Learning and Dynamic Programming , 1990, NIPS 1990.

[3] Jürgen Schmidhuber,et al. A possibility for implementing curiosity and boredom in model-building neural controllers , 1991 .

[4] T. Nokes,et al. Intrinsic reinforcing properties of putatively neutral stimuli in an instrumental two-lever discrimination task , 1996 .

[5] Stanley J. Rosenschein,et al. From Animals to Animats: Proceedings of the First International Conference on Simulation of Adaptive Behavior , 1996 .

[6] Peter Dayan,et al. A Neural Substrate of Prediction and Reward , 1997, Science.

[7] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .

[8] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..

[9] Andrew Y. Ng,et al. Policy Invariance Under Reward Transformations: Theory and Application to Reward Shaping , 1999, ICML.

[10] James L. McClelland,et al. Autonomous Mental Development by Robots and Animals , 2001, Science.

[11] Andrew G. Barto,et al. Autonomous discovery of temporal abstractions from interaction with an environment , 2002 .

[12] P. Dayan,et al. Reward, Motivation, and Reinforcement Learning , 2002, Neuron.

[13] Peter Dayan,et al. Dopamine: generalization and bonuses , 2002, Neural Networks.

[14] Pierre-Yves Oudeyer,et al. Motivational principles for visual know-how development , 2003 .

[15] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..

[16] Sridhar Mahadevan,et al. Recent Advances in Hierarchical Reinforcement Learning , 2003, Discret. Event Dyn. Syst..

[17] Nuttapong Chentanez,et al. Intrinsically Motivated Learning of Hierarchical Collections of Skills , 2004 .

[18] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.