Learning Controllers for Reactive and Proactive Behaviors in Human–Robot Collaboration

Designed to safely share the same workspace as humans and assist them in a variety of tasks, the new collaborative robots are targeting manufacturing and service applications that once were considered unattainable. The large diversity of tasks to carry out, the unstructured environments and the close interaction with humans call for collaborative robots to seamlessly adapt their behaviors so as to cooperate with the users successfully under different and possibly new situations (characterized, for example, by positions of objects/landmarks in the environment, or by the user pose). This paper investigates how controllers capable of reactive and proactive behaviors in collaborative tasks can be learned from demonstrations. The proposed approach exploits the temporal coherence and dynamic characteristics of the task observed during the training phase to build a probabilistic model that enables the robot to both react to the user actions and lead the task when needed. The method is an extension of the Hidden Semi-Markov Model where the duration probability distribution is adapted according to the interaction with the user. This Adaptive Duration Hidden Semi-Markov Model (ADHSMM) is used to retrieve a sequence of states governing a trajectory optimization that provides the reference and gain matrices to the robot controller. A proof-of-concept evaluation is first carried out in a pouring task. The proposed framework is then tested in a collaborative task using a 7 DOF backdrivable manipulator.

[1]  Gilbert Strang,et al.  Introduction to applied mathematics , 1988 .

[2]  Sadaoki Furui,et al.  Speaker-independent isolated word recognition using dynamic features of speech spectrum , 1986, IEEE Trans. Acoust. Speech Signal Process..

[3]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[4]  Kazuhiro Kosuge,et al.  Dynamic control for robot-human collaboration , 1993, Proceedings of 1993 2nd IEEE International Workshop on Robot and Human Communication.

[5]  Mary P. Harper,et al.  On the complexity of explicit duration HMM's , 1995, IEEE Trans. Speech Audio Process..

[6]  Keiichi Tokuda,et al.  An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features , 1995, EUROSPEECH.

[7]  Philip C. Woodland,et al.  Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[8]  Kazuhiro Kosuge,et al.  Control of a robot handling an object in cooperation with a human , 1997, Proceedings 6th IEEE International Workshop on Robot and Human Communication. RO-MAN'97 SENDAI.

[9]  Takao Kobayashi,et al.  Adaptive training for hidden semi-Markov model [speech synthesis applications] , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[10]  Takao Kobayashi,et al.  Human walking motion synthesis based on multiple regression hidden semi-Markov model , 2005, 2005 International Conference on Cyberworlds (CW'05).

[11]  Shunzheng Yu,et al.  Practical implementation of an efficient forward-backward algorithm for an explicit-duration hidden Markov model , 2006, IEEE Transactions on Signal Processing.

[12]  M. Wendl This is an Open Access article distribut... , 2007 .

[13]  Heiga Zen,et al.  State Duration Modeling for HMM-Based Speech Synthesis , 2007, IEICE Trans. Inf. Syst..

[14]  Takashi Nose,et al.  Style estimation of speech based on multiple regression hidden semi-Markov model , 2007, INTERSPEECH.

[15]  Heiga Zen,et al.  Hidden Semi-Markov Model Based Speech Synthesis System , 2006 .

[16]  Heiga Zen,et al.  Reformulating the HMM as a trajectory model by imposing explicit relationships between static and dynamic feature vector sequences , 2007, Comput. Speech Lang..

[17]  Stefan Schaal,et al.  Robot Programming by Demonstration , 2009, Springer Handbook of Robotics.

[18]  Paul Evrard,et al.  Teaching physical collaborative tasks: object-lifting case study with a humanoid , 2009, 2009 9th IEEE-RAS International Conference on Humanoid Robots.

[19]  Shunzheng Yu,et al.  Hidden semi-Markov models , 2010, Artif. Intell..

[20]  Sandra Hirche,et al.  An experience-driven robotic assistant acquiring human knowledge to improve haptic cooperation , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[21]  Satoshi Nakamura,et al.  Learning, Generation and Recognition of Motions by Reference-Point-Dependent Probabilistic Models , 2011, Adv. Robotics.

[22]  Antonio Bicchi,et al.  On making robots understand safety: Embedding injury knowledge into control , 2012, Int. J. Robotics Res..

[23]  Stefanos Nikolaidis,et al.  Optimization of Temporal Dynamics for Adaptive Human-Robot Interaction in Assembly Manufacturing , 2012, Robotics: Science and Systems.

[24]  Minija Tamosiunaite,et al.  Interaction learning for dynamic movement primitives used in cooperative robotic tasks , 2013, Robotics Auton. Syst..

[25]  Jan Peters,et al.  Probabilistic Movement Primitives , 2013, NIPS.

[26]  Carme Torras,et al.  Learning Collaborative Impedance-Based Robot Behaviors , 2013, AAAI.

[27]  Jan Peters,et al.  Learning responsive robot behavior by imitation , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[28]  Bernhard Schölkopf,et al.  Probabilistic movement modeling for intention inference in human–robot interaction , 2013, Int. J. Robotics Res..

[29]  Carme Torras,et al.  Force-based robot learning of pouring skills using parametric hidden Markov models , 2013, 9th International Workshop on Robot Motion and Control.

[30]  Jan Peters,et al.  Learning interaction for collaborative tasks with probabilistic movement primitives , 2014, 2014 IEEE-RAS International Conference on Humanoid Robots.

[31]  Shigeki Sugano,et al.  Generation of sensory reflex behavior versus intentional proactive behavior in robot learning of cooperative interactions with others , 2014, 4th International Conference on Development and Learning and on Epigenetic Robotics.

[32]  Manuel Lopes,et al.  Facilitating intention prediction for humans by optimizing robot motions , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[33]  Siddhartha S. Srinivasa,et al.  Effects of Robot Motion on Human-Robot Collaboration , 2015, 2015 10th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[34]  Darwin G. Caldwell,et al.  Learning optimal controllers in human-robot cooperative transportation tasks with position and force constraints , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[35]  Keng Peng Tee,et al.  Continuous Role Adaptation for Human–Robot Shared Control , 2015, IEEE Transactions on Robotics.

[36]  Jan Peters,et al.  A Probabilistic Framework for Semi-autonomous Robots Based on Interaction Primitives with Phase Estimation , 2015, ISRR.