How to train your robot - teaching service robots to reproduce human social behavior

Developing interactive behaviors for social robots presents a number of challenges. It is difficult to interpret the meaning of the details of people's behavior, particularly non-verbal behavior like body positioning, but yet a social robot needs to be contingent to such subtle behaviors. It needs to generate utterances and non-verbal behavior with good timing and coordination. The rules for such behavior are often based on implicit knowledge and thus difficult for a designer to describe or program explicitly. We propose to teach such behaviors to a robot with a learning-by-demonstration approach, using recorded human-human interaction data to identify both the behaviors the robot should perform and the social cues it should respond to. In this study, we present a fully unsupervised approach that uses abstraction and clustering to identify behavior elements and joint interaction states, which are used in a variable-length Markov model predictor to generate socially-appropriate behavior commands for a robot. The proposed technique provides encouraging results despite high amounts of sensor noise, especially in speech recognition. We demonstrate our system with a robot in a shopping scenario.

[1]  Masaki Ogino,et al.  Interaction rule learning with a human partner based on an imitation faculty with a simple visuo-motor mapping , 2006, Robotics Auton. Syst..

[2]  E. Hall,et al.  The Hidden Dimension , 1970 .

[3]  Takayuki Kanda,et al.  Easy development of communicative behaviors in social robots , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[4]  Takayuki Kanda,et al.  How do people walk side-by-side? — Using a computational model of human behavior for a social robot , 2012, 2012 7th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[5]  Takayuki Kanda,et al.  It's not polite to point Generating socially-appropriate deictic behaviors towards people , 2013, 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[6]  Laurent Gueguen,et al.  Segmentation by Maximal Predictive Partitioning According to Composition Biases , 2000, JOBIM.

[7]  Patrick F. Reidy An Introduction to Latent Semantic Analysis , 2009 .

[8]  Takayuki Kanda,et al.  Person Tracking in Large Public Spaces Using 3-D Range Sensors , 2013, IEEE Transactions on Human-Machine Systems.

[9]  Jeff Orkin,et al.  Understanding Speech in Interactive Narratives with Crowdsourced Data , 2012, AIIDE.

[10]  Cynthia Breazeal,et al.  Crowdsourcing human-robot interaction , 2013, HRI 2013.

[11]  Peter Stone,et al.  Interactively shaping agents via human reinforcement: the TAMER framework , 2009, K-CAP '09.

[12]  Yukie Nagai,et al.  Learning to comprehend deictic gestures in robots and human infants , 2005, ROMAN 2005. IEEE International Workshop on Robot and Human Interactive Communication, 2005..

[13]  R. Fiebrink N-GRAM MODELING OF TABLA SEQUENCES USING VARIABLE-LENGTH HIDDEN MARKOV MODELS FOR IMPROVISATION AND COMPOSITION , 2011 .

[14]  T. Belpraeme,et al.  Imitation and Social Learning in Robots, Humans and Animals: Behavioural, Social and Communicative Dimensions , 2006 .

[15]  Takayuki Kanda,et al.  An Interaction Design Framework for Social Robots , 2011, Robotics: Science and Systems.

[16]  Aude Billard,et al.  Teaching a Humanoid Robot to Recognize and Reproduce Social Cues , 2006, ROMAN 2006 - The 15th IEEE International Symposium on Robot and Human Interactive Communication.

[17]  Manuela M. Veloso,et al.  Using dialog and human observations to dictate tasks to a learning robot assistant , 2008, Intell. Serv. Robotics.

[18]  Takayuki Kanda,et al.  Understanding suitable locations for waiting , 2013, 2013 8th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[19]  Takayuki Kanda,et al.  Pointing to space: Modeling of deictic interaction referring to regions , 2010, 2010 5th ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[20]  Tae-Kyun Kim,et al.  A syntactic approach to robot imitation learning using probabilistic activity grammars , 2013, Robotics Auton. Syst..

[21]  Brian Scassellati,et al.  Foundations for a theory of mind for a humanoid robot , 2001 .

[22]  Takayuki Kanda,et al.  Never too old for teleoperation: Helping elderly people control a conversational service robot , 2013, 2013 IEEE RO-MAN.

[23]  Bin Zhang,et al.  Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R , 2008, Bioinform..

[24]  Monica N. Nicolescu,et al.  Task Learning through Imitation and Human-robot Interaction , 2005 .

[25]  Gustaf Neumann,et al.  Parameters driving effectiveness of automated essay scoring with LSA , 2005 .

[26]  Yiannis Demiris,et al.  Nonparametric Mixtures of Gaussian Processes With Power-Law Behavior , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[27]  Sheng-Wen Shih,et al.  Learning Atomic Human Actions Using Variable-Length Markov Models , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[28]  Peter W. Foltz,et al.  An introduction to latent semantic analysis , 1998 .

[29]  Takayuki Kanda,et al.  How close? Model of proximity control for information-presenting robots , 2008, 2008 3rd ACM/IEEE International Conference on Human-Robot Interaction (HRI).

[30]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[31]  Aude Billard,et al.  A framework integrating statistical and social cues to teach a humanoid robot new skills , 2008, ICRA 2008.

[32]  Takayuki Kanda,et al.  A Robot that Approaches Pedestrians , 2013, IEEE Transactions on Robotics.