Constructive Approach to Role-Reversal Imitation Through Unsegmented Interactions

This paper presents a novel method of a robot learning through imitation to acquire a user’s key motions automatically. The learning architecture mainly consists of three learning modules: a switching autoregressive model (SARM), a keyword extractor without a dictionary, and a keyword selection filter that references to the tutor’s reactions. Most previous research on imitation learning by autonomous robots targeted motions given to robots, were segmented into meaningful parts by the users or researchers in advance. To imitate certain behavior from continuous human motion, however, robots must find segments to be learned. To achieve this goal, the learning architecture converts a continuous time series into a discrete time series of letters using the SARM, finds meaningful segments using the keyword extractor without a dictionary, and removes less s meaningful segments from keywords using the user’s reactions. In experiments, an operator showed unsegmented motions to a robot, and reacted to the motions the robot had acquired. Results showed that this framework enabled the robot to obtain several meaningful motions that the operator hoped it would acquire.

[1]  P. Dunham,et al.  Effects of mother-infant social interactions on infants' subsequent contingency task performance. , 1990, Child development.

[2]  Kenneth Ward Church Empirical Estimates of Adaptation: The chance of Two Noriegas is closer to p/2 than p2 , 2000, COLING.

[3]  J. Stevenson The cultural origins of human cognition , 2001 .

[4]  C. Moore,et al.  Joint attention : its origins and role in development , 1995 .

[5]  Tetsuo Sawaragi,et al.  Incremental acquisition of multiple nonlinear forward models based on differentiation process of schema model , 2008, Neural Networks.

[6]  Eamonn J. Keogh,et al.  Probabilistic discovery of time series motifs , 2003, KDD '03.

[7]  Minoru Asada,et al.  Cognitive developmental robotics as a new paradigm for the design of humanoid robots , 2001, Robotics Auton. Syst..

[8]  Ran,et al.  The correspondence problem , 1998 .

[9]  Kuniaki Uehara,et al.  Discovery of Time-Series Motif from Multi-Dimensional Data Based on MDL Principle , 2005, Machine Learning.

[10]  Naoto Iwahashi,et al.  Learning object-manipulation verbs for human-robot communication , 2007, WMISI '07.

[11]  Kyoji Umemura,et al.  Related Word-pairs Extraction Without Dictionaries , 2004, LREC.

[12]  Kevin Murphy,et al.  Switching Kalman Filters , 1998 .

[13]  Jessica Lin,et al.  Finding Motifs in Time Series , 2002, KDD 2002.

[14]  Yoshihiko Nakamura,et al.  Embodied Symbol Emergence Based on Mimesis Theory , 2004, Int. J. Robotics Res..

[15]  Jun Tani,et al.  Dynamic and interactive generation of object handling behaviors by a small humanoid robot using a dynamic neural network model , 2006, Neural Networks.

[16]  Yoshihiko Nakamura,et al.  Polynomial design of the nonlinear dynamics for the brain-like information processing of whole body motion , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[17]  Gordon Cheng,et al.  Discovering optimal imitation strategies , 2004, Robotics Auton. Syst..

[18]  Tetsuya Ogata,et al.  Experience Based Imitation Using RNNPB , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[19]  Tetsuo Sawaragi,et al.  Incremental acquisition of behaviors and signs based on a reinforcement learning schemata model and a spike timing-dependent plasticity network , 2007, Adv. Robotics.