Learning, Generation and Recognition of Motions by Reference-Point-Dependent Probabilistic Models

This paper presents a novel method for learning object manipulation such as rotating an object or placing one object on another. In this method, motions are learned using reference-point-dependent probabilistic models, which can be used for the generation and recognition of motions. The method estimates (i) the reference point, (ii) the intrinsic coordinate system type, which is the type of coordinate system intrinsic to a motion, and (iii) the probabilistic model parameters of the motion that is considered in the intrinsic coordinate system. Motion trajectories are modeled by a hidden Markov model (HMM), and an HMM-based method using static and dynamic features is used for trajectory generation. The method was evaluated in physical experiments in terms of motion generation and recognition. In the experiments, users demonstrated the manipulation of puppets and toys so that the motions could be learned. A recognition accuracy of 90% was obtained for a test set of motions performed by three subjects. Furthermore, the results showed that appropriate motions were generated even if the object placement was changed.

[1]  K. Dautenhahn,et al.  Imitation in Animals and Artifacts , 2002 .

[2]  Katsushi Ikeuchi,et al.  Modeling manipulation interactions by hidden Markov models , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Naoto Iwahashi,et al.  Robots That Learn Language: Developmental Approach to Human-Machine Conversations , 2006, EELC.

[4]  Yasuharu Koike,et al.  PII: S0893-6080(96)00043-3 , 1997 .

[5]  David E. Irwin,et al.  Frames of reference in vision and language: Where is above? , 1993, Cognition.

[6]  C. Breazeal,et al.  Robots that imitate humans , 2002, Trends in Cognitive Sciences.

[7]  Katsushi Ikeuchi,et al.  Generation of a task model by integrating multiple observations of human demonstrations , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[8]  Takao Kobayashi,et al.  Human Walking Motion Synthesis with Desired Pace and Stride Length Based on HSMM , 2005, IEICE Trans. Inf. Syst..

[9]  Aude Billard,et al.  A probabilistic Programming by Demonstration framework handling constraints in joint space and task space , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[10]  Naoto Iwahashi,et al.  Motion recognition and generation by combining reference-point-dependent probabilistic models , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[11]  Stefan Schaal,et al.  http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained , 2007 .

[12]  Christopher W. Geib,et al.  The meaning of action: a review on action recognition and mapping , 2007, Adv. Robotics.

[13]  Yoshihiko Nakamura,et al.  Embodied Symbol Emergence Based on Mimesis Theory , 2004, Int. J. Robotics Res..

[14]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[15]  David J. Fleet,et al.  Gaussian Process Dynamical Models , 2005, NIPS.

[16]  Yoshihiko Nakamura,et al.  Self-organizing Symbol Acquisition and Motion Generation based on Dynamics-based Information Processing System , 2005 .

[17]  Aude Billard,et al.  Discriminative and adaptive imitation in uni-manual and bi-manual tasks , 2006, Robotics Auton. Syst..

[18]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[19]  Eamonn J. Keogh,et al.  On the Need for Time Series Data Mining Benchmarks: A Survey and Empirical Demonstration , 2002, Data Mining and Knowledge Discovery.

[20]  K. Tokuda,et al.  Speech parameter generation from HMM using dynamic features , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[21]  Ran,et al.  The correspondence problem , 1998 .

[22]  Masayuki Inaba,et al.  Learning by watching: extracting reusable task knowledge from visual observation of human performance , 1994, IEEE Trans. Robotics Autom..

[23]  D. Roy Grounding words in perception and action: computational insights , 2005, Trends in Cognitive Sciences.

[24]  Jun Tani,et al.  Self-organization of behavioral primitives as multiple attractor dynamics: A robot experiment , 2003, IEEE Trans. Syst. Man Cybern. Part A.

[25]  Tetsunari Inamura,et al.  Geometric proto-symbol manipulation towards language-based motion pattern synthesis and recognition , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[26]  Tetsuya Ogata,et al.  Two-way translation of compound sentences and arm motions by recurrent neural networks , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[27]  Naoto Iwahashi,et al.  Learning object-manipulation verbs for human-robot communication , 2007, WMISI '07.

[28]  Terry Regier,et al.  The Human Semantic Potential: Spatial Language and Constrained Connectionism , 1996 .

[29]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.