Using Self-Imitation to Direct Learning

An evolutionary predecessor to observational imitation may have been self-imitation. Self-imitation is where an agent is able to learn and replicate actions it has experienced through the manipulation of its body by another. This form of imitative learning has the advantage of avoiding some of the complexities encountered in observational learning such as the correspondence problem. We investigate how a system using self-imitation can be constructed with reference to psychological models of motor control including ideomotor theory and ideas from social scaffolding seen in animals to allow us to construct a robotic control system. The system allows a human trainer to teach a robot new skills and modify existing skills. Additionally the system allows the robot to notify the trainer when it is being taught skills it already possesses. We argue that this mechanism may be the first step towards the transformation from self-imitation to observational imitation. We demonstrate the system on a physical Pioneer robot with a 5-DOF arm and pan/tilt camera which is taught using self-imitation to track and point to coloured objects

[1]  Ran,et al.  The correspondence problem , 1998 .

[2]  Robin I. M. Dunbar The thinking ape: Evolutionary origins of intelligence , 1995 .

[3]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[4]  M. Brass,et al.  Imitation: is cognitive neuroscience solving the correspondence problem? , 2005, Trends in Cognitive Sciences.

[5]  John E. Laird,et al.  Learning procedural knowledge through observation , 2001, K-CAP '01.

[6]  C. Heyes,et al.  Social learning in animals : the roots of culture , 1996 .

[7]  J. Saunders,et al.  An experimental comparison of imitation paradigms used in social robotics , 2004, RO-MAN 2004. 13th IEEE International Workshop on Robot and Human Interactive Communication (IEEE Catalog No.04TH8759).

[8]  J. Wertsch Vygotsky and the Social Formation of Mind , 1985 .

[9]  Rüdiger Dillmann,et al.  Teaching and learning of robot tasks via observation of human performance , 2004, Robotics Auton. Syst..

[10]  Chrystopher L. Nehaniv,et al.  Teaching robots by moulding behavior and scaffolding the environment , 2006, HRI '06.

[11]  S. Pattinson,et al.  Learning to fly. , 1998 .

[12]  Gillian M. Hayes,et al.  Imitation as a dual-route process featuring prediction and learning components: A biologically plaus , 2002 .

[13]  Christopher G. Atkeson,et al.  A Framework for Learning from Observation Using Primitives , 2002, RoboCup.

[14]  R. Gardner,et al.  Teaching sign language to a chimpanzee. , 1969, Science.

[15]  W. Prinz An ideomotor approach to imitation , 2005 .

[16]  T. Caro,et al.  Is There Teaching in Nonhuman Animals? , 1992, The Quarterly Review of Biology.

[17]  H. Harry Asada,et al.  Automatic program generation from teaching data for the hybrid control of robots , 1989, IEEE Trans. Robotics Autom..

[18]  Yiannis Demiris,et al.  Hierarchies of Coupled Inverse and Forward Models for Abstraction in Robot Action Planning, Recognition and Imitation , 2005 .

[19]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner , 2007 .

[20]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[21]  Chrystopher L. Nehaniv,et al.  An Approach for Programming Robots by Demonstration: Generalization Across Different Initial Configurations of Manipulated Objects , 2005, 2005 International Symposium on Computational Intelligence in Robotics and Automation.

[22]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[23]  A. Meltzoff Imitation as a mechanism of social cognition: Origins of empathy, theory of mind, and the representation of action. , 2007 .

[24]  C. Heyes Transformational and associative theories of imitation , 2002 .

[25]  Mitsuo Kawato,et al.  MOSAIC Model for Sensorimotor Learning and Control , 2001, Neural Computation.

[26]  B. R. Moore CHAPTER 12 – The Evolution of Imitative Learning , 1996 .