Object Recognition from Multiple Percepts

This paper presents a perceptual system that exploits human caregivers as catalysts for the humanoid robot Cog to perceive and learn about objects, scenes, people, and the robot itself. A broad spectrum of machine learning problems are addressed for object recognition across several categorization levels. The paper introduces a new complex approach to object recognition based on the integration of multiple percepts. Training data for all learning mechanisms is automatically generated from actions by an embodied agent, so that the robot develops categorization autonomously. Cognitive capabilities of the humanoid robot are developmentally created, starting from infant-like abilities for detecting, segmenting, and recognizing percepts over multiple sensing modalities. Human caregivers provide a helping hand for communicating such information to the robot, by acting on the objects, inducing their compliant perception from these human-robot interactions.

[1]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[2]  Artur M. Arsénio An Embodied Approach to Perceptual Grouping , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[3]  David J. Kriegman,et al.  Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection , 1996, ECCV.

[4]  Rama Chellappa,et al.  Human and machine recognition of faces: a survey , 1995, Proc. IEEE.

[5]  Artur M. Arsénio,et al.  Map Building from Human-Computer Interactions , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[6]  M. Turk,et al.  Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[7]  Horst Hendriks-Jansen,et al.  Catching ourselves in the act , 1996 .

[8]  J. Rissanen A UNIVERSAL PRIOR FOR INTEGERS AND ESTIMATION BY MINIMUM DESCRIPTION LENGTH , 1983 .

[9]  Artur Miguel Do Amaral Arsénio,et al.  Cognitive-developmental learning for a humanoid robot: a caregiver's gift , 2004 .

[10]  Marion A. Eppler,et al.  Development of perception of affordances. , 1993 .

[11]  Ehud Rivlin,et al.  Recognizing functionality , 1995, Proceedings of International Symposium on Computer Vision - ISCV.

[12]  Antonio Torralba,et al.  Contextual Priming for Object Detection , 2003, International Journal of Computer Vision.

[13]  Truong Q. Nguyen,et al.  Wavelets and filter banks , 1996 .

[14]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[15]  Olivier Faugeras,et al.  3D Dynamic Scene Analysis , 1992 .

[16]  Neil Gershenfeld,et al.  The nature of mathematical modeling , 1998 .