Robust learning from demonstrations using multidimensional SAX

Learning from demonstrations (LfD) is gaining more popularity in robotics due to its promise of providing a human-friendly technique for teaching robots new skills by robotics-naive users. The two main approaches to LfD are dynamic motor primitives (DMP) which models demonstrated motions as dynamical systems with the advantage flexibility in changing the motion's starting position, goal or speed and Gaussian Mixture Modelling/ Gaussian Mixture Regression (GMM/GMR) which represents demonstrated motions as mixtures of Gaussians with the advantage of keeping track of the correlations between different dimensions of learned motions and automatic extraction of motion variability along these dimensions. This paper introduces a third approach that relies on symbolization of demonstrated motions by extending the Symbolic Aggregate approXimation (SAX) to handle multiple dimensions of data. The proposed approach is shown through several real-world evaluations to be more resistant to confusing demonstrations that usually arise when action segmentation is automated. The paper also discusses a possible way to combine SAX based LfD withGMM/GMR in order to preserve the advantages of these two approaches while providing superior confusion resistance.

[1]  VelosoManuela,et al.  A survey of robot learning from demonstration , 2009 .

[2]  Jun Nakanishi,et al.  Learning Attractor Landscapes for Learning Motor Primitives , 2002, NIPS.

[3]  Toyoaki Nishida,et al.  Exact Discovery of Length-Range Motifs , 2014, ACIIDS.

[4]  Aghil Jafari,et al.  Hybrid force-motion control of coordinated robots interacting with unknown environments , 2014, 2014 14th International Conference on Control, Automation and Systems (ICCAS 2014).

[5]  Li Wei,et al.  Experiencing SAX: a novel symbolic representation of time series , 2007, Data Mining and Knowledge Discovery.

[6]  Pieter Abbeel,et al.  Autonomous Helicopter Aerobatics through Apprenticeship Learning , 2010, Int. J. Robotics Res..

[7]  Stefan Schaal,et al.  Learning tasks from a single demonstration , 1997, Proceedings of International Conference on Robotics and Automation.

[8]  Stefan Schaal,et al.  Is imitation learning the route to humanoid robots? , 1999, Trends in Cognitive Sciences.

[9]  Eamonn J. Keogh,et al.  An online algorithm for segmenting time series , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[10]  Yiannis Demiris,et al.  A nonparametric Bayesian approach toward robot learning by demonstration , 2012, Robotics Auton. Syst..

[11]  Toyoaki Nishida,et al.  Fluid Imitation , 2012, Int. J. Soc. Robotics.

[12]  Michael Kipp,et al.  ANVIL - a generic annotation tool for multimodal dialogue , 2001, INTERSPEECH.

[13]  Aude Billard,et al.  Donut as I do: Learning from failed demonstrations , 2011, 2011 IEEE International Conference on Robotics and Automation.

[14]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[15]  Scott Kuindersma,et al.  Constructing Skill Trees for Reinforcement Learning Agents from Demonstration Trajectories , 2010, NIPS.

[16]  Darwin G. Caldwell,et al.  Encoding the time and space constraints of a task in explicit-duration Hidden Markov Model , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17]  A. Meltzoff 1 Imitation and Other Minds: The "Like Me" Hypothesis , 2005 .

[18]  Jun Morimoto,et al.  Learning parametric dynamic movement primitives from multiple demonstrations , 2011, Neural Networks.

[19]  Eamonn J. Keogh,et al.  HOT SAX: efficiently finding the most unusual time series subsequence , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[20]  Keigo Watanabe,et al.  Estimation of human behaviors based on human actions using an ANN , 2014, 2014 14th International Conference on Control, Automation and Systems (ICCAS 2014).

[21]  Aude Billard,et al.  Learning Stable Nonlinear Dynamical Systems With Gaussian Mixture Models , 2011, IEEE Transactions on Robotics.

[22]  Darwin G. Caldwell,et al.  Human-robot skills transfer interfaces for a flexible surgical robot , 2014, Comput. Methods Programs Biomed..