Active Sensorimotor Object Recognition in Three-Dimensional Space

Spatial interaction of biological agents with their environment is based on the cognitive processing of sensory as well as motor information. There are many models for sole sensory processing but only a few for integrating sensory and motor information into a unifying sensorimotor approach. Additionally, neither the relations shaping the integration are yet clear nor how the integrated information can be used in an underlying representation. Therefore, we propose a probabilistic model for integrated processing of sensory and motor information by combining bottom-up feature extraction and top-down action selection embedded in a Bayesian inference approach. The integration of sensory perceptions and motor information brings about two main advantages: (i) Their statistical dependencies can be exploited by representing the spatial relationships of the sensor information in the underlying joint probability distribution and (ii) a top-down process can compute the next most informative region according to an information gain strategy. We evaluated our system in two different object recognition tasks. We found that the integration of sensory and motor information significantly improves active object recognition, in particular when these movements have been chosen by an information gain strategy.

[1]  L. Stark,et al.  Experimental metaphysics: The scanpath as an epistemological mechanism , 1996 .

[2]  Alan Yuille,et al.  Active Vision , 2014, Computer Vision, A Reference Guide.

[3]  J. Kevin O'Regan,et al.  What it is like to see: A sensorimotor theory of perceptual experience , 2001, Synthese.

[4]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[5]  Gerhard Krieger,et al.  Scene analysis with saccadic eye movements: Top-down and bottom-up modeling , 2001, J. Electronic Imaging.

[6]  Wolfram Burgard,et al.  Coastal navigation-mobile robot navigation with uncertainty in dynamic environments , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[7]  Joana Hois,et al.  A belief-based architecture for scene analysis: From sensorimotor features to knowledge and ontology , 2009, Fuzzy Sets Syst..

[8]  Dana H. Ballard,et al.  Animate Vision , 1991, Artif. Intell..

[9]  Alva Noë,et al.  Action in Perception , 2006, Representation and Mind.

[10]  Thomas F. Shipley,et al.  Spatial Cognition VII, International Conference, Spatial Cognition 2010, Mt. Hood/Portland, OR, USA, August 15-19, 2010. Proceedings , 2010, Spatial Cognition.

[11]  G. Griffin,et al.  Caltech-256 Object Category Dataset , 2007 .

[12]  Thomas Reineking,et al.  Bio-inspired Architecture for Active Sensorimotor Localization , 2010, Spatial Cognition.

[13]  Leslie Pack Kaelbling,et al.  Acting under uncertainty: discrete Bayesian models for mobile-robot navigation , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[14]  J. Gibson The Ecological Approach to Visual Perception , 1979 .

[15]  R. Bajcsy Active perception , 1988 .

[16]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[17]  W. Prinz A common-coding approach to perception and action , 1990 .

[18]  Karl J. Friston,et al.  A free energy principle for the brain , 2006, Journal of Physiology-Paris.

[19]  C. Freksa,et al.  Visual Attention and Cognition , 1996 .

[20]  U. Neisser Cognitive Psychology. (Book Reviews: Cognition and Reality. Principles and Implications of Cognitive Psychology) , 1976 .

[21]  Gerhard Krieger,et al.  Investigation of a sensorimotor system for saccadic scene analysis: an integrated approach , 1998 .

[22]  L. Stark,et al.  Scanpaths in saccadic eye movements while viewing and recognizing patterns. , 1971, Vision research.

[23]  A. Noë,et al.  A sensorimotor account of vision and visual consciousness. , 2001, The Behavioral and brain sciences.

[24]  Kerstin Schill Decision Support Systems with Adaptive Reasoning Strategies , 1997, Foundations of Computer Science: Potential - Theory - Cognition.

[25]  G. Aschersleben,et al.  The Theory of Event Coding (TEC): a framework for perception and action planning. , 2001, The Behavioral and brain sciences.

[26]  Nick Chater,et al.  Information gain explains relevance which explains the selection task , 1995, Cognition.

[27]  Christoph Zetzsche,et al.  Sensorimotor representation and knowledge-based reasoning for spatial exploration and localisation , 2008, Cognitive Processing.

[28]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[29]  Wolfram Burgard,et al.  Information Gain-based Exploration Using Rao-Blackwellized Particle Filters , 2005, Robotics: Science and Systems.

[30]  Ernst Pöppel,et al.  Completing Knowledge by Competing Hierarchies , 1991, UAI.

[31]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.