On learning the statistical representation of a task and generalizing it to various contexts

This paper presents an architecture for solving generically the problem of extracting the constraints of a given task in a programming by demonstration framework and the problem of generalizing the acquired knowledge to various contexts. We validate the architecture in a series of experiments, where a human demonstrator teaches a humanoid robot simple manipulatory tasks. First, the combined joint angles and hand path motions are projected into a generic latent space, composed of a mixture of Gaussians (GMM) spreading across the spatial dimensions of the motion. Second, the temporal variation of the latent representation of the motion is encoded in a hidden Markov model (HMM). This two-step probabilistic encoding provides a measure of the spatio-temporal correlations across the different modalities collected by the robot, which determines a metric of imitation performance. A generalization of the demonstrated trajectories is then performed using Gaussian mixture regression (GMR). Finally, to generalize skills across contexts, we compute formally the trajectory that optimizes the metric, given the new context and the robot's specific body constraints

[1]  H. Sung Gaussian Mixture Regression and Classification , 2004 .

[2]  Aude Billard,et al.  Recognition and reproduction of gestures using a probabilistic framework combining PCA, ICA and HMM , 2005, ICML.

[3]  Richard A. Volz,et al.  Acquiring robust, force-based assembly skills from human demonstration , 2000, IEEE Trans. Robotics Autom..

[4]  Gordon Cheng,et al.  Discovering optimal imitation strategies , 2004, Robotics Auton. Syst..

[5]  Stefan Schaal,et al.  Learning inverse kinematics , 2001, Proceedings 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems. Expanding the Societal Role of Robotics in the the Next Millennium (Cat. No.01CH37180).

[6]  Ales Ude,et al.  Programming full-body movements for humanoid robots by observation , 2004, Robotics Auton. Syst..

[7]  Masayuki Inaba,et al.  Intent imitation using wearable motion capturing system with on-line teaching of task attention , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[8]  Jun Nakanishi,et al.  Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.

[9]  A. Liegeois,et al.  Automatic supervisory control of the configuration and behavior of multi-body mechanisms , 1977 .

[10]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[11]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[12]  Helge J. Ritter,et al.  Situated robot learning for multi-modal instruction and imitation of grasping , 2004, Robotics Auton. Syst..

[13]  Jakob J. Verbeek,et al.  Mixture models for clustering and dimension reduction , 2004 .

[14]  Tamim Asfour,et al.  Human-like motion of a humanoid robot arm based on a closed-form solution of the inverse kinematics problem , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[15]  Stefan Schaal,et al.  Inverse kinematics for humanoid robots , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[16]  Jean Pierre Verriest,et al.  A geometric algorithm to predict the arm reach posture for computer‐aided ergonomic evaluation , 1998 .

[17]  Rüdiger Dillmann,et al.  Building elementary robot skills from human demonstration , 1996, Proceedings of IEEE International Conference on Robotics and Automation.

[18]  Manuele Bicego,et al.  Investigating hidden Markov models' capabilities in 2D shape classification , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Mohammed Yeasin,et al.  Toward automatic robot programming: learning human skill from visual data , 2000, IEEE Trans. Syst. Man Cybern. Part B.

[20]  Susumu Tachi,et al.  A modular neural network architecture for inverse kinematics model learning , 2001, Neurocomputing.

[21]  Rüdiger Dillmann,et al.  Towards Cognitive Robots: Building Hierarchical Task Representations of Manipulations from Human Demonstration , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[22]  Roland Siegwart,et al.  Robot learning from demonstration , 2004, Robotics Auton. Syst..

[23]  Rajesh P. N. Rao,et al.  Robotic imitation from human motion capture using Gaussian processes , 2005, 5th IEEE-RAS International Conference on Humanoid Robots, 2005..

[24]  Kerstin Dautenhahn,et al.  Of hummingbirds and helicopters: An algebraic framework for interdisciplinary studies of imitation a , 2000 .

[25]  Jianwei Zhang,et al.  Self-valuing learning and generalization with application in visually guided grasping of complex objects , 2004, Robotics Auton. Syst..

[26]  Chrystopher L. Nehaniv,et al.  An Approach for Programming Robots by Demonstration: Generalization Across Different Initial Configurations of Manipulated Objects , 2005, 2005 International Symposium on Computational Intelligence in Robotics and Automation.

[27]  Aude Billard,et al.  Goal-Directed Imitation in a Humanoid Robot , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.