Robotic imitation from human motion capture using Gaussian processes

Programming by demonstration, also called "imitation learning," offers the possibility of flexible, easily modifiable robotic systems. Full-fledged robotic imitation learning comprises many difficult subtasks. However, we argue that, at its core, imitation learning reduces to a regression problem. We propose a two-step framework in which an imitating agent first performs a regression from a high-dimensional observation space to a low-dimensional latent variable space. In the second step, the agent performs a regression from the latent variable space to a high-dimensional space representing degrees of freedom of its motor system. We demonstrate the validity of the approach by learning to map motion capture data from human actors to a humanoid robot. We also contrast use of several low-dimensional latent variable spaces, each covering a subset of agents' degrees of freedom, with use of a single, higher-dimensional latent variable space. Our findings suggest that compositing several regression models together yields qualitatively better imitation results than using a single, more complex regression model

[1]  Stefan Schaal,et al.  Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[2]  Bernhard Schölkopf,et al.  Kernel Dependency Estimation , 2002, NIPS.

[3]  Christopher K. I. Williams Computing with Infinite Networks , 1996, NIPS.

[4]  K. Dautenhahn,et al.  The correspondence problem , 2002 .

[5]  Yasuo Kuniyoshi,et al.  Deferred imitation of human head movements by an active stereo vision head , 1997, Proceedings 6th IEEE International Workshop on Robot and Human Communication. RO-MAN'97 SENDAI.

[6]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[7]  Aaron Hertzmann,et al.  Style-based inverse kinematics , 2004, ACM Trans. Graph..

[8]  J. Tenenbaum,et al.  A global geometric framework for nonlinear dimensionality reduction. , 2000, Science.

[9]  A. O'Hagan,et al.  Curve Fitting and Optimal Design for Prediction , 1978 .

[10]  Geoffrey E. Hinton,et al.  Evaluation of Gaussian processes and other methods for non-linear regression , 1997 .

[11]  Rajesh P. N. Rao,et al.  Gaussian Process CCA for Image Synthesis and Robotic Imitation , 2005 .

[12]  Christopher K. I. Williams Computation with Infinite Neural Networks , 1998, Neural Computation.

[13]  Michael Isard,et al.  Tracking loose-limbed people , 2004, CVPR 2004.

[14]  Neil D. Lawrence,et al.  Fast Sparse Gaussian Process Methods: The Informative Vector Machine , 2002, NIPS.

[15]  Masayuki Inaba,et al.  Learning by watching: extracting reusable task knowledge from visual observation of human performance , 1994, IEEE Trans. Robotics Autom..

[16]  Illah R. Nourbakhsh,et al.  A survey of socially interactive robots , 2003, Robotics Auton. Syst..