A Cognitive Model of Imitative Development in Humans and Machines

Several algorithms and models have recently been proposed for imitation learning in humans and robots. However, few proposals offer a framework for imitation learning in noisy stochastic environments where the imitator must learn and act under real-time performance constraints. We present a novel probabilistic framework for imitation learning in stochastic environments with unreliable sensors. Bayesian algorithms, based on Meltzoff and Moore's AIM hypothesis for action imitation, implement the core of an imitation learning framework. Our algorithms are computationally efficient, allowing real-time learning and imitation in an active stereo vision robotic head and on a humanoid robot. We present simulated and real-world robotics results demonstrating the viability of our approach. We conclude by advocating a research agenda that promotes interaction between cognitive and robotic studies of imitation.

[1]  Paul A. Viola,et al.  Robust Real-time Object Detection , 2001 .

[2]  R. Byrne,et al.  Priming primates: Human and otherwise , 1998, Behavioral and Brain Sciences.

[3]  Michael I. Jordan,et al.  Forward Models: Supervised Learning with a Distal Teacher , 1992, Cogn. Sci..

[4]  A. Meltzoff,et al.  Imitation of Facial and Manual Gestures by Human Neonates , 1977, Science.

[5]  Ying Wu,et al.  Wide-range, person- and illumination-insensitive head orientation estimation , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[6]  W. Prinz,et al.  The imitative mind : development, evolution, and brain bases , 2002 .

[7]  Sonia Ragir,et al.  “Language” and intelligence in monkeys and apes. Comparative developmental perspectives , 1991, International Journal of Primatology.

[8]  K. Gibson,et al.  “Language” and intelligence in monkeys and apes: Comparative developmental perspectives on ape “language” , 1990 .

[9]  Gillian M. Hayes,et al.  Imitation as a dual-route process featuring prediction and learning components: A biologically plaus , 2002 .

[10]  Leslie Pack Kaelbling,et al.  On the Complexity of Solving Markov Decision Problems , 1995, UAI.

[11]  Andrew Whiten,et al.  The Imitative Mind: The imitator's representation of the imitated: Ape and child , 2002 .

[12]  Carl E. Rasmussen,et al.  In Advances in Neural Information Processing Systems , 2011 .

[13]  Rajesh P. N. Rao,et al.  A Probabilistic Framework for Model-Based Imitation Learning , 2004 .

[14]  A. Meltzoff 1 Imitation and Other Minds: The "Like Me" Hypothesis , 2005 .

[15]  Joshua B. Tenenbaum,et al.  Bayesian models of human action understanding , 2005, NIPS.

[16]  A. Meltzoff,et al.  The importance of eyes: how infants interpret adult looking behavior. , 2002, Developmental psychology.

[17]  Aude Billard,et al.  Learning human arm movements by imitation: : Evaluation of a biologically inspired connectionist architecture , 2000, Robotics Auton. Syst..

[18]  A. Meltzoff Understanding the Intentions of Others: Re-Enactment of Intended Acts by 18-Month-Old Children. , 1995, Developmental psychology.

[19]  Chrystopher L. Nehaniv,et al.  Imitation with ALICE: learning to imitate corresponding actions across dissimilar embodiments , 2002, IEEE Trans. Syst. Man Cybern. Part A.

[20]  Matthew W. Hoffman,et al.  A probabilistic model of gaze imitation and shared attention , 2006, Neural Networks.

[21]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[22]  M. Lungarella,et al.  Beyond Gazing, Pointing, and Reaching: A Survey of Developmental Robotics , 2003 .

[23]  Y. Demiris,et al.  From motor babbling to hierarchical learning by imitation: a robot developmental pathway , 2005 .

[24]  Yiannis Demiris,et al.  Learning Forward Models for Robots , 2005, IJCAI.

[25]  Chrystopher L. Nehaniv,et al.  Imitation as a Dual-Route Process Featuring Predictive and Learning Components: A Biologically Plausible Computational Model , 2002 .

[26]  D M Wolpert,et al.  Multiple paired forward and inverse models for motor control , 1998, Neural Networks.

[27]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[28]  Mitsuo Kawato,et al.  MOSAIC Model for Sensorimotor Learning and Control , 2001, Neural Computation.

[29]  Chris L. Baker,et al.  Goal Inference as Inverse Planning , 2007 .

[30]  A. Meltzoff,et al.  Explaining Facial Imitation: A Theoretical Model. , 1997, Early development & parenting.

[31]  Yishay Mansour,et al.  On the Complexity of Policy Iteration , 1999, UAI.

[32]  Rajesh P. N. Rao,et al.  Goal-Based Imitation as Probabilistic Inference over Graphical Models , 2005, NIPS.

[33]  E. Visalberghi,et al.  “Language” and intelligence in monkeys and apes: Do monkeys ape? , 1990 .

[34]  K. Dautenhahn,et al.  The correspondence problem , 2002 .

[35]  Rajesh P. N. Rao,et al.  Imitation and Social Learning in Robots, Humans and Animals: A Bayesian model of imitation in infants and robots , 2007 .

[36]  R J HERRNSTEIN,et al.  Relative and absolute strength of response as a function of frequency of reinforcement. , 1961, Journal of the experimental analysis of behavior.

[37]  K. Dautenhahn,et al.  Do as I Do: Correspondences across Different Robotic Embodiments , 2002 .

[38]  Rajesh P. N. Rao,et al.  Towards a Real-Time Bayesian Imitation System for a Humanoid Robot , 2007, Proceedings 2007 IEEE International Conference on Robotics and Automation.

[39]  Illah R. Nourbakhsh,et al.  A survey of socially interactive robots , 2003, Robotics Auton. Syst..

[40]  A. Meltzoff The 'like me' framework for recognizing and becoming an intentional agent. , 2007, Acta psychologica.