Towards a platform-independent cooperative human-robot interaction system: II. Perception, execution and imitation of goal directed actions

If robots are to cooperate with humans in an increasingly human-like manner, then significant progress must be made in their abilities to observe and learn to perform novel goal directed actions in a flexible and adaptive manner. The current research addresses this challenge. In CHRIS.I [1], we developed a platform-independent perceptual system that learns from observation to recognize human actions in a way which abstracted from the specifics of the robotic platform, learning actions including “put X on Y” and “take X”. In the current research, we extend this system from action perception to execution, consistent with current developmental research in human understanding of goal directed action and teleological reasoning. We demonstrate the platform independence with experiments on three different robots. In Experiments 1 and 2 we complete our previous study of perception of actions “put” and “take” demonstrating how the system learns to execute these same actions, along with new related actions “cover” and “uncover” based on the composition of action primitives “grasp X” and “release X at Y”. Significantly, these compositional action execution specifications learned on one iCub robot are then executed on another, based on the abstraction layer of motor primitives. Experiment 3 further validates the platform-independence of the system, as a new action that is learned on the iCub in Lyon is then executed on the Jido robot in Toulouse. In Experiment 4 we extended the definition of action perception to include the notion of agency, again inspired by developmental studies of agency attribution, exploiting the Kinect motion capture system for tracking human motion. Finally in Experiment 5 we demonstrate how the combined representation of action in terms of perception and execution provides the basis for imitation. This provides the basis for an open ended cooperation capability where new actions can be learned and integrated into shared plans for cooperation. Part of the novelty of this research is the robots' use of spoken language understanding and visual perception to generate action representations in a platform independent manner based on physical state changes. This provides a flexible capability for goal-directed action imitation.

[1]  Rüdiger Dillmann,et al.  Teaching and learning of robot tasks via observation of human performance , 2004, Robotics Auton. Syst..

[2]  G. Csibra,et al.  Goal attribution without agency cues: the perception of ‘pure reason’ in infancy , 1999, Cognition.

[3]  Peter Ford Dominey,et al.  The basis of shared intentions in robot and human cognition , 2009 .

[4]  Renée Baillargeon,et al.  The development of calibration-based reasoning about collision events in young infants , 1998, Cognition.

[5]  Pattie Maes,et al.  Postural primitives: Interactive Behavior for a Humanoid Robot Arm , 1996 .

[6]  K. Dautenhahn,et al.  The correspondence problem , 2002 .

[7]  R. James Firby,et al.  Building symbolic primitives with continuous control routines , 1992 .

[8]  T. Kanda,et al.  Six-and-a-half-month-old children positively attribute goals to human action and to humanoid-robot motion , 2005 .

[9]  Wolfgang Prinz,et al.  The early origins of goal attribution in infancy , 2003, Consciousness and Cognition.

[10]  E. Spelke,et al.  Object permanence in five-month-old infants , 1985, Cognition.

[11]  Yonghong Yan,et al.  Universal speech tools: the CSLU toolkit , 1998, ICSLP.

[12]  Dare A. Baldwin,et al.  Intentions and Intentionality: Foundations of Social Cognition , 2001 .

[13]  Yiannis Demiris,et al.  Hierarchies of Coupled Inverse and Forward Models for Abstraction in Robot Action Planning, Recognition and Imitation , 2005 .

[14]  Oussama Khatib,et al.  Synthesis of Whole-Body Behaviors through Hierarchical Control of Behavioral Primitives , 2005, Int. J. Humanoid Robotics.

[15]  A. Woodward Infants' ability to distinguish between purposeful and non-purposeful behaviors , 1999 .

[16]  Peter Ford Dominey,et al.  Linking Language with Embodied and Teleological Representations of Action for Humanoid Cognition , 2010, Front. Neurorobot..

[17]  J. Mandler How to build a baby: II. Conceptual primitives. , 1992, Psychological review.

[18]  Pradeep K. Khosla,et al.  Manipulation task primitives for composing robot skills , 1997, Proceedings of International Conference on Robotics and Automation.

[19]  Peter Ford Dominey,et al.  The basis of shared intentions in human and robot cognition , 2011 .

[20]  Yiannis Demiris,et al.  Distributed, predictive perception of actions: a biologically inspired robotics architecture for imitation and learning , 2003, Connect. Sci..

[21]  E. Spelke,et al.  Origins of knowledge. , 1992, Psychological review.

[22]  H. Furth Object permanence in five-month-old infants. , 1987, Cognition.

[23]  Peter Ford Dominey,et al.  Towards a platform-independent cooperative human-robot interaction system: I. Perception , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[24]  E Bizzi,et al.  Motor learning through the combination of primitives. , 2000, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[25]  J. J. Guajardo,et al.  How infants make sense of intentional action , 2001 .

[26]  Giulio Sandini,et al.  The iCub humanoid robot: an open platform for research in embodied cognition , 2008, PerMIS.

[27]  Jean-Arcady Meyer,et al.  Behavior-Based Primitives for Articulated Control , 1998 .

[28]  Stewart W. Wilson,et al.  From Animals to Animats 5. Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior , 1997 .

[29]  Chrystopher L. Nehaniv,et al.  Imitation with ALICE: learning to imitate corresponding actions across dissimilar embodiments , 2002, IEEE Trans. Syst. Man Cybern. Part A.

[30]  Giulio Sandini,et al.  An experimental evaluation of a novel minimum-jerk cartesian controller for humanoid robots , 2010, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[31]  György Gergely,et al.  What should a robot learn from an infant? Mechanisms of action interpretation and observational learning in infancy , 2003, Connect. Sci..

[32]  Brett Browning,et al.  A survey of robot learning from demonstration , 2009, Robotics Auton. Syst..

[33]  G. Rizzolatti,et al.  The mirror-neuron system. , 2004, Annual review of neuroscience.

[34]  Maja J. Matarić,et al.  Behavior-based primitives for articulated control , 1998 .

[35]  Matthew M. Williamson,et al.  Postural primitives: Interactive Behavior for a Humanoid Robot Arm , 1996 .

[36]  Peter Ford Dominey,et al.  Developmental stages of perception and language acquisition in a perceptually grounded robot , 2005, Cognitive Systems Research.

[37]  Aude Billard,et al.  Goal-Directed Imitation in a Humanoid Robot , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[38]  Peter Ford Dominey,et al.  Learning to talk about events from narrated video in a construction grammar framework , 2005, Artif. Intell..

[39]  Jeffrey Mark Siskind,et al.  Visual event perception , 1997 .

[40]  G. Sandini,et al.  Understanding mirror neurons. , 2006 .

[41]  Renée Baillargeon,et al.  Can infants attribute to an agent a disposition to perform a particular action? , 2005, Cognition.

[42]  G. Csibra,et al.  Teleological reasoning in infancy: the naı̈ve theory of rational action , 2003, Trends in Cognitive Sciences.

[43]  Deb Roy,et al.  Semiotic schemas: A framework for grounding language in action and perception , 2005, Artif. Intell..

[44]  E. Bizzi,et al.  Linear combinations of primitives in vertebrate motor control. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[45]  Jun Tani,et al.  Motor primitive and sequence self-organization in a hierarchical recurrent neural network , 2004, Neural Networks.

[46]  G. Aschersleben,et al.  The Theory of Event Coding (TEC): a framework for perception and action planning. , 2001, The Behavioral and brain sciences.

[47]  A. Woodward Infants selectively encode the goal object of an actor's reach , 1998, Cognition.

[48]  Leonard Talmy,et al.  Force Dynamics in Language and Cognition , 1987, Cogn. Sci..

[49]  Ulrike Thomas,et al.  Error-tolerant execution of complex robot tasks based on skill primitives , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).