Bagging for Gaussian mixture regression in robot learning from demonstration

Robot learning from demonstration (LfD) emerges as a promising solution to transfer human motion to the robot. However, because of the open-loop between the learner and task constraints, the precision of the reproduction at the desired task constraints cannot always be guaranteed and the model is not robust to changes of the training data. This paper proposes a closed-loop framework of LfD based on the bagging method of Gaussian Mixture Model and Gaussian Mixture Regression (GMM/GMR) to obtain a robust learner of LfD with high precision reproduction. The original demonstration data are divided into several sub-training data, from which multiple Gaussian mixture models are developed and combined through weighted average to provide predictions. A closed-loop is built between the reproduction of the combined learner and task constraints, and the weights that satisfy task constraints are estimated in the closed-loop. The prediction uncertainty of the models is automatically eliminated by the closed-loop, therefore, the low robustness of the LfD model to the training date is overcome. In experiments, tasks of the position and velocity are both constrained in dual closed-loop. It is shown that the proposed method can significantly meet the task constraints without increasing the complexity of the algorithm.

[1]  William Derigent,et al.  Industry 4.0: contributions of holonic manufacturing control architectures and future challenges , 2020, Journal of Intelligent Manufacturing.

[2]  Darwin G. Caldwell,et al.  On improving the extrapolation capability of task-parameterized movement models , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Naonori Ueda,et al.  Deterministic annealing EM algorithm , 1998, Neural Networks.

[4]  Aude Billard,et al.  BM: An iterative algorithm to learn stable non-linear dynamical systems with Gaussian mixture models , 2010, 2010 IEEE International Conference on Robotics and Automation.

[5]  Darwin G. Caldwell,et al.  Generalized Task-Parameterized Skill Learning , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[6]  Izabela Nielsen,et al.  A methodology for implementation of mobile robot in adaptive manufacturing environments , 2017, J. Intell. Manuf..

[7]  Aude Billard,et al.  A probabilistic Programming by Demonstration framework handling constraints in joint space and task space , 2008, 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8]  Tao Chen,et al.  Bagging for Gaussian process regression , 2009, Neurocomputing.

[9]  Aude Billard,et al.  On Learning, Representing, and Generalizing a Task in a Humanoid Robot , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[10]  Jun Ni,et al.  Assembly process optimization for reducing the dimensional error of antenna assembly with abundant rivets , 2018, J. Intell. Manuf..

[11]  Aaron F. Bobick,et al.  Parametric Hidden Markov Models for Gesture Recognition , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Jie Zhang,et al.  Inferential estimation of polymer quality using bootstrap aggregated neural networks , 1999, Neural Networks.

[13]  Flavio Prieto,et al.  Trajectory generation for robotic assembly operations using learning by demonstration , 2019, Robotics and Computer-Integrated Manufacturing.

[14]  Aude Billard,et al.  Incremental learning of gestures by imitation in a humanoid robot , 2007, 2007 2nd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[15]  Jan Peters,et al.  Using probabilistic movement primitives in robotics , 2017, Autonomous Robots.

[16]  Nasser Mozayani,et al.  Acquiring reusable skills in intrinsically motivated reinforcement learning , 2020, Journal of Intelligent Manufacturing.

[17]  Kaare Brandt Petersen,et al.  The Matrix Cookbook , 2006 .

[18]  Konrad Wegener,et al.  Optimizing human–robot task allocation using a simulation tool based on standardized work descriptions , 2020, J. Intell. Manuf..

[19]  Sylvain Calinon,et al.  A tutorial on task-parameterized movement learning and retrieval , 2016, Intell. Serv. Robotics.

[20]  Kung-Jeng Wang,et al.  Skill transfer support model based on deep learning , 2020, J. Intell. Manuf..

[21]  Darwin G. Caldwell,et al.  Kernelized movement primitives , 2017, Int. J. Robotics Res..

[22]  Affan Pervez,et al.  Learning task-parameterized dynamic movement primitives using mixture of GMMs , 2018, Intell. Serv. Robotics.

[23]  Andrea Lockerd Thomaz,et al.  Robot Learning from Human Teachers , 2014, Robot Learning from Human Teachers.

[24]  J. Friedman,et al.  On bagging and nonlinear estimation , 2007 .

[25]  Aude Billard,et al.  Active Teaching in Robot Programming by Demonstration , 2007, RO-MAN 2007 - The 16th IEEE International Symposium on Robot and Human Interactive Communication.

[26]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[27]  Chih-Yi Chiu,et al.  Content-based retrieval for human motion data , 2004, J. Vis. Commun. Image Represent..

[28]  Jun Nakanishi,et al.  Dynamical Movement Primitives: Learning Attractor Models for Motor Behaviors , 2013, Neural Computation.

[29]  Sylvain Calinon,et al.  Learning adaptive dressing assistance from human demonstration , 2017, Robotics Auton. Syst..

[30]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[31]  Jochen J. Steil,et al.  Open-source benchmarking for learned reaching motion generation in robotics , 2015, Paladyn J. Behav. Robotics.

[32]  Yee Mey Goh,et al.  A transformation of human operation approach to inform system design for automation , 2020, J. Intell. Manuf..