Deictic Pronoun Learning and Mirror Self-Identification

The ability to identify the self in a mirror reflection and the ability to use the word “I” effectively are commonly seen as major milestones in a human infant’s development of a concept of self. In addition, deictic pronouns such as “I” and “you” present a technical challenge to computational methods for grounded word learning, which have commonly associated word definitions with sensory patterns instead of pragmatic roles. Here, a robot learns the usage of the words “I” and “you” by observing others playing a game of catch, then correctly uses the terms to refer to its mirror image and to a conversational partner, respectively. Word learning occurs by using already understood words (“got the ball”) to infer the referents of spoken sentences. The properties of those referents, including the conversational roles of “speaker” and “addressee” as well as properties unique to each person, are then associated with the unknown words, and the significance of these associations ranked via chi-square tests. After sufficient observation of others using “I” and “you,” the robot’s own usage is correct without any need for supervised learning. To achieve mirror selfrecognition, the robot uses the timing of the visual feedback that results from its arm’s movement. The part of the image that is labeled as “self” is then treated as the robot’s location in the image for the purpose of responding to the command, “Say who got the ball.”

[1]  Y. Oshima-Takane Analysis of pronominal errors: a case-study , 1992, Journal of Child Language.

[2]  Deb Roy,et al.  Mental imagery for a conversational robot , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[3]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[4]  Brian Scassellati,et al.  Motion-based robotic self-recognition , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[5]  Yuriko Oshima-Takane,et al.  Birth Order Effects on Early Language Development: Do Secondborn Children Learn from Overheard Speech? , 1996 .

[6]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[7]  C. Allen,et al.  The Cognitive Animal: Empirical and Theoretical Perspectives on Animal Cognition , 2002 .

[8]  G. Gallup Chimpanzees: Self-Recognition , 1970, Science.

[9]  Luc Steels,et al.  Aibo''s first words. the social learning of language and meaning. Evolution of Communication , 2002 .

[10]  B. Amsterdam,et al.  Mirror self-image reactions before age two. , 1972, Developmental psychobiology.

[11]  Alex Pentland,et al.  Learning words from sights and sounds: a computational model , 2002, Cogn. Sci..

[12]  Paul Lamere,et al.  Sphinx-4: a flexible open source framework for speech recognition , 2004 .

[13]  Paul M. Fitzpatrick,et al.  Developmentally deep perceptual system for a humanoid robot , 2003 .

[14]  T. Shultz,et al.  The learning of first and second person pronouns in English: network models and analysis , 1999, Journal of Child Language.

[15]  Brian Scassellati,et al.  Grounded Pronoun Learning and Pronoun Reversal , 2006 .

[16]  G. Gallup,et al.  The mirror test , 2002 .

[17]  D. Reiss,et al.  Mirror self-recognition in the bottlenose dolphin: A case of cognitive convergence , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Brian Scassellati,et al.  Using context and sensory data to learn first and second person pronouns , 2006, HRI '06.