Robot Cooperative Behavior Learning Using Single-Shot Learning From Demonstration and Parallel Hidden Markov Models

For robots to become collaborative assistants, they need to be capable of naturally interacting with users in real environments. They also need to be able to learn new skills from non-expert users. In this letter, we present a novel parallel hidden Markov model (PaHMM) architecture for learning from demonstration (LfD), which allows a robot to learn a sequence of cooperative and non-cooperative behaviors from a single demonstration (single-shot) of a task-based human–robot interaction from two interacting teachers. During teaching, the robot learns both a human–robot interaction model and an object interaction model in order to be able to effectively determine its own behaviors. Experiments with a Baxter robot and several teachers were conducted to validate the ability of the robot to learn both cooperative and non-cooperative behaviors during a task-based interaction. Comparison experiments also show the robustness of our approach to spatial variations from the demonstrated behaviors and tracking errors when compared to other approaches.

[1]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[2]  Satoshi Endo,et al.  Implementation and experimental validation of Dynamic Movement Primitives for object handover , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[4]  Satoshi Endo,et al.  Experimental testing of the CogLaboration prototype system for fluent Human-Robot object handover interactions , 2014, The 23rd IEEE International Symposium on Robot and Human Interactive Communication.

[5]  Carme Torras,et al.  A robot learning from demonstration framework to perform force-based manipulation tasks , 2013, Intelligent Service Robotics.

[6]  Satoshi Endo,et al.  Dynamic Movement Primitives for Human-Robot interaction: Comparison with human behavioral observation , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[7]  Sylvain Calinon,et al.  A tutorial on task-parameterized movement learning and retrieval , 2016, Intell. Serv. Robotics.

[8]  Oliver Kroemer,et al.  Probabilistic movement primitives for coordination of multiple human–robot collaborative tasks , 2017, Auton. Robots.

[9]  Andrea Lockerd Thomaz,et al.  Simultaneously learning actions and goals from demonstration , 2016, Auton. Robots.

[10]  Markus Vincze,et al.  Towards a Robot for Supporting Older People to Stay Longer Independent at Home , 2014, ISR 2014.

[11]  Goldie Nejat,et al.  A learning from demonstration system architecture for robots learning social group recreational activities , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[12]  Feng Duan,et al.  Human-robot collaboration in cellular manufacturing: Design and development , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[13]  Hynek Hermansky,et al.  Towards subband-based speech recognition , 1996, 1996 8th European Signal Processing Conference (EUSIPCO 1996).

[14]  Heni Ben Amor,et al.  A system for learning continuous human-robot interactions from human-human demonstrations , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Misha Pavel,et al.  Towards ASR on partially corrupted speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[16]  Jun Tani,et al.  On-line Imitative Interaction with a Humanoid Robot Using a Dynamic Neural Network Model of a Mirror System , 2004, Adapt. Behav..

[17]  Bernhard Schölkopf,et al.  Probabilistic movement modeling for intention inference in human–robot interaction , 2013, Int. J. Robotics Res..

[18]  Katsu Yamane,et al.  Primitive communication based on motion recognition and generation with hierarchical mimesis model , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[19]  Yoshihiko Nakamura,et al.  Mimetic Communication Model with Compliant Physical Contact in Human—Humanoid Interaction , 2010, Int. J. Robotics Res..

[20]  C. Perez,et al.  Development of a robotic scrub nurse for the operating theatre , 2010, 2010 3rd IEEE RAS & EMBS International Conference on Biomedical Robotics and Biomechatronics.

[21]  Feng Duan,et al.  Application of the Assembly Skill Transfer System in an Actual Cellular Manufacturing System , 2012, IEEE Transactions on Automation Science and Engineering.

[22]  Jan Peters,et al.  Learning responsive robot behavior by imitation , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[23]  Yangsheng Xu,et al.  Human action learning via hidden Markov model , 1997, IEEE Trans. Syst. Man Cybern. Part A.

[24]  Jeff A. Bilmes,et al.  A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models , 1998 .