Autonomous framework for segmenting robot trajectories of manipulation task

In manipulation tasks, motion trajectories are characterized by a set of key phases (i.e., motion primitives). It is therefore important to learn the motion primitives embedded in such tasks from a complete demonstration. In this paper, we propose a core framework that autonomously segments motion trajectories to support the learning of motion primitives. For this purpose, a set of segmentation points is estimated using a Gaussian Mixture Model (GMM) learned after investigating the dimensional subspaces reduced by Principal Component Analysis. The segmentation points can be acquired by two alternative approaches: (1) using a geometrical interpretation of the Gaussians obtained from the learned GMM, and (2) using the weights estimated along the time component of the learned GMM. The main contribution of this paper is the autonomous estimation of the segmentation points based on the GMM learned in a reduced dimensional space. The advantages of such an estimation are as follows: (1) segmentation points without any internal parameters to be manually predefined or pretuned (according to the types of given tasks and/or motion trajectories) can be estimated from a single training data, (2) segmentation points, in which non-linear motion trajectories can be better characterized than by using the original motion trajectories, can be estimated, and (3) natural motion trajectories can be retrieved by temporally rearranging motion segments. The capability of this autonomous segmentation framework is validated by four experiments. In the first experiment, motion segments are evaluated through a comparison with a human expert using a publicly available kitchen dataset. In the second experiment, motion segments are evaluated through a comparison with an existing approach using an open hand-writing database. In the third experiment, the segmentation performance is evaluated by retrieving motion trajectories from the reorganization of motion segments. In the fourth experiment, the segmentation performance is evaluated by clustering motion segments.

[1]  Michael I. Jordan,et al.  Supervised learning from incomplete data via an EM approach , 1993, NIPS.

[2]  Sean R. Eddy,et al.  Multiple Alignment Using Hidden Markov Models , 1995, ISMB.

[3]  Michael I. Jordan,et al.  On Convergence Properties of the EM Algorithm for Gaussian Mixtures , 1996, Neural Computation.

[4]  Michael I. Jordan Learning in Graphical Models , 1999, NATO ASI Series.

[5]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[6]  Carl E. Rasmussen,et al.  The Infinite Gaussian Mixture Model , 1999, NIPS.

[7]  Shin Ishii,et al.  On-line EM Algorithm for the Normalized Gaussian Network , 2000, Neural Computation.

[8]  Wei Zhang,et al.  EM algorithms of Gaussian mixture model and hidden Markov model , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[9]  Christopher G. Atkeson,et al.  Learning from observation using primitives , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[10]  Anil K. Jain,et al.  Unsupervised Learning of Finite Mixture Models , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  Monica N. Nicolescu,et al.  Natural methods for robot task learning: instructive demonstrations, generalization and practice , 2003, AAMAS '03.

[12]  J. Kuha AIC and BIC , 2004 .

[13]  Maja J. Mataric,et al.  Exemplar-based primitives for humanoid movement classification and control , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[14]  Aude Billard,et al.  Recognition and reproduction of gestures using a probabilistic framework combining PCA, ICA and HMM , 2005, ICML.

[15]  Jianwei Zhang,et al.  Learning of demonstrated grasping skills by stereoscopic tracking of human head configuration , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[16]  Rajesh P. N. Rao,et al.  Dynamic Imitation in a Humanoid Robot through Nonparametric Probabilistic Inference , 2006, Robotics: Science and Systems.

[17]  Nicandro Cruz-Ramírez,et al.  How Good Are the Bayesian Information Criterion and the Minimum Description Length Principle for Model Selection? A Bayesian Network Analysis , 2006, MICAI.

[18]  Tamim Asfour,et al.  Imitation Learning of Dual-Arm Manipulation Tasks in Humanoid Robots , 2006, 2006 6th IEEE-RAS International Conference on Humanoid Robots.

[19]  Pat Langley,et al.  Learning hierarchical task networks by observation , 2006, ICML.

[20]  Aude Billard,et al.  Incremental learning of gestures by imitation in a humanoid robot , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[21]  Manuela M. Veloso,et al.  Confidence-based policy learning from demonstration using Gaussian mixture models , 2007, AAMAS '07.

[22]  Maya Cakmak,et al.  To Afford or Not to Afford: A New Formalization of Affordances Toward Affordance-Based Robot Control , 2007, Adapt. Behav..

[23]  Aude Billard,et al.  On Learning, Representing, and Generalizing a Task in a Humanoid Robot , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[24]  Héctor H. González-Baños,et al.  Multi-modal Motion Planning for a Humanoid Robot Manipulation Task , 2007, ISRR.

[25]  Aude Billard,et al.  Combining Dynamical Systems control and programming by demonstration for teaching discrete bimanual coordination tasks to a humanoid robot , 2008, 2008 3rd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[26]  Manuel Lopes,et al.  Learning Object Affordances: From Sensory--Motor Coordination to Imitation , 2008, IEEE Transactions on Robotics.

[27]  J. Klafter,et al.  Nonergodicity mimics inhomogeneity in single particle tracking. , 2008, Physical review letters.

[28]  Gianluigi Mongillo,et al.  Online Learning with Hidden Markov Models , 2008, Neural Computation.

[29]  Moritz Tenorth,et al.  The TUM Kitchen Data Set of everyday manipulation activities for motion tracking and action recognition , 2009, 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops.

[30]  ChangHwan Kim,et al.  Human-like catching motion of humanoid using Evolutionary Algorithm(EA)-based imitation learning , 2009, RO-MAN 2009 - The 18th IEEE International Symposium on Robot and Human Interactive Communication.

[31]  Dana Kulic,et al.  Online Segmentation and Clustering From Continuous Observation of Whole Body Motions , 2009, IEEE Transactions on Robotics.

[32]  Aude Billard,et al.  Imitation learning of globally stable non-linear point-to-point robot motions using nonlinear programming , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[33]  Jochen J. Steil,et al.  Human-robot interaction for learning and adaptation of object movements , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[34]  Eric L. Sauser,et al.  Tactile guidance for policy refinement and reuse , 2010, 2010 IEEE 9th International Conference on Development and Learning.

[35]  Thomas Hellström,et al.  Behavior recognition for Learning from Demonstration , 2010, 2010 IEEE International Conference on Robotics and Automation.

[36]  Luc Van Gool,et al.  2D Action Recognition Serves 3D Human Pose Estimation , 2010, ECCV.

[37]  Maxim Likhachev,et al.  Search-based planning for manipulation with motion primitives , 2010, 2010 IEEE International Conference on Robotics and Automation.

[38]  James M. Rehg,et al.  CENTRIST: A Visual Descriptor for Scene Categorization , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Stefan Schaal,et al.  Skill learning and task outcome prediction for manipulation , 2011, 2011 IEEE International Conference on Robotics and Automation.

[40]  Victor Ng-Thow-Hing,et al.  Randomized multi-modal motion planning for a humanoid robot manipulation task , 2011, Int. J. Robotics Res..

[41]  Danica Kragic,et al.  Primitive-Based Action Representation and Recognition , 2011, Adv. Robotics.

[42]  Darwin G. Caldwell,et al.  Encoding the time and space constraints of a task in explicit-duration Hidden Markov Model , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[43]  Giulio Sandini,et al.  Imitation learning of non-linear point-to-point robot motions using dirichlet processes , 2012, 2012 IEEE International Conference on Robotics and Automation.

[44]  Sang Hyoung Lee,et al.  Motivation-Based Dependable Behavior Selection Using Probabilistic Affordance , 2012, Adv. Robotics.

[45]  Maya Cakmak,et al.  Trajectories and keyframes for kinesthetic teaching: A human-robot interaction perspective , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[46]  Sang Hyoung Lee,et al.  Learning basis skills by autonomous segmentation of humanoid motion trajectories , 2012, 2012 12th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2012).