An intrinsic reward for affordance exploration

In this paper, we present preliminary results demonstrating how a robot can learn environmental affordances in terms of the features that predict successful control and interaction. We extend previous work in which we proposed a learning framework that allows a robot to develop a series of hierarchical, closed-loop manipulation behaviors. Here, we examine a complementary process where the robot builds probabilistic models about the conditions under which these behaviors are likely to succeed. To accomplish this, we present an intrinsic reward function that directs the robot's exploratory behavior towards gaining confidence in these models. We demonstrate how this single intrinsic motivator can lead to artifacts of behavior such as “novelty,” “habituation,” and “surprise.” We present results using the bimanual robot Dexter, and explore these results further in simulation.

[1]  Michael O. Duff,et al.  Design for an Optimal Probe , 2003, ICML.

[2]  Benjamin Kuipers,et al.  Autonomous Development of a Grounded Object Ontology by a Learning Robot , 2007, AAAI.

[3]  J. Kagan Motives and development. , 1972, Journal of personality and social psychology.

[4]  L. Festinger,et al.  A Theory of Cognitive Dissonance , 2017 .

[5]  Stephen Hart,et al.  Intrinsically motivated hierarchical manipulation , 2008, 2008 IEEE International Conference on Robotics and Automation.

[6]  Nuttapong Chentanez,et al.  Intrinsically Motivated Learning of Hierarchical Collections of Skills , 2004 .

[7]  Andrew G. Barto,et al.  Reinforcement learning , 1998 .

[8]  Roderic A. Grupen,et al.  Learning to Coordinate Controllers - Reinforcement Learning on a Control Basis , 1997, IJCAI.

[9]  Stephen Hart,et al.  Generalization and Transfer in Robot Control , 2008 .

[10]  M. Dogar,et al.  Afford or Not to Afford : A New Formalization of Affordances Toward Affordance-Based Robot , 2007 .

[11]  Andrew G. Barto,et al.  Intrinsically Motivated Hierarchical Skill Learning in Structured Environments , 2010, IEEE Transactions on Autonomous Mental Development.

[12]  R. Shaw,et al.  Perceiving, Acting and Knowing : Toward an Ecological Psychology , 1978 .

[13]  Richard S. Sutton,et al.  Reinforcement Learning , 1992, Handbook of Machine Learning.

[14]  Giulio Sandini,et al.  Learning about objects through action - initial steps towards artificial cognition , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[15]  Pierre-Yves Oudeyer,et al.  How can we define intrinsic motivation , 2008 .

[16]  Bernt Schiele,et al.  Functional Object Class Detection Based on Learned Affordance Cues , 2008, ICVS.

[17]  Stephen Hart,et al.  Natural task decomposition with intrinsic potential fields , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  C. L. Hull Principles of behavior : an introduction to behavior theory , 1943 .

[19]  Ashutosh Saxena,et al.  Robotic Grasping of Novel Objects using Vision , 2008, Int. J. Robotics Res..

[20]  A. Stoytchev Toward Learning the Binding Affordances of Objects : A Behavior-Grounded Approach , 2022 .

[21]  三嶋 博之 The theory of affordances , 2008 .