Birth of the Object: Detection of Objectness and Extraction of Object Shape through Object-Action complexes

We describe a process in which the segmentation of objects as well as the extraction of the object shape becomes realized through active exploration of a robot vision system. In the exploration process, two behavioral modules that link robot actions to the visual and haptic perception of objects interact. First, by making use of an object independent grasping mechanism, physical control over potential objects can be gained. Having evaluated the initial grasping mechanism as being successful, a second behavior extracts the object shape by making use of prediction based on the motion induced by the robot. This also leads to the concept of an "object" as a set of features that change predictably over different frames. The system is equipped with a certain degree of generic prior knowledge about the world in terms of a sophisticated visual feature extraction process in an early cognitive vision system, knowledge about its own embodiment as well as knowledge about geometric relationships such as rigid body motion. This prior knowledge allows the extraction of representations that are semantically richer compared to many other approaches.

[1]  Giorgio Metta,et al.  Grounding vision through experimental manipulation , 2003, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[2]  Florentin Wörgötter,et al.  Extraction of multi-modal object representations in a robot vision system , 2007, VISAPP.

[3]  Nicolas Pugeault,et al.  Early cognitive vision: feedback mechanisms for the disambiguation of early visual representation , 2008 .

[4]  J. Gibson The Ecological Approach to Visual Perception , 1979 .

[5]  Sinan Kalkan,et al.  Perceptual Operations and Relations between 2D or 3D Visual Entities , 2007 .

[6]  David G. Lowe,et al.  Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Giulio Sandini,et al.  Exploring the world through grasping: a developmental approach , 2005, 2005 International Symposium on Computational Intelligence in Robotics and Automation.

[8]  Peter K. Allen,et al.  Graspit! A versatile simulator for robotic grasping , 2004, IEEE Robotics & Automation Magazine.

[9]  N. Krüger,et al.  Statistical and Deterministic Regularities: Utilization of Motion and Grouping in Biological and Artificial Visual Systems , 2004 .

[10]  Carme Torras,et al.  PACO-PLUS: Perception, action and cognition through learning of object-action complexes , 2006 .

[11]  Norbert Krüger,et al.  Accumulation of object representations utilising interaction of robot action and perception , 2002, Knowl. Based Syst..

[12]  Benjamin Kuipers,et al.  Bootstrap learning for object discovery , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[13]  Christopher W. Geib,et al.  Object Action Complexes as an Interface for Planning and Robot Control , 2006 .

[14]  Norbert Krüger,et al.  Accumulation of Object Representations Utilizing Interaction of Robot Action and Perception , 2000, DAGM-Symposium.

[15]  Markus Lappe,et al.  Biologically Motivated Multi-modal Processing of Visual Primitives , 2003 .

[16]  Olivier Faugeras,et al.  Three-Dimensional Computer Vision , 1993 .

[17]  Rajesh P. N. Rao,et al.  An Active Vision Architecture Based on Iconic Representations , 1995, Artif. Intell..

[18]  James H. Elder,et al.  Are Edges Incomplete? , 1999, International Journal of Computer Vision.

[19]  Danica Kragic,et al.  Early reactive grasping with second order 3D feature relations , 2007 .