Demand-Driven Visual Information Acquisition

Fast, reliable and demand-driven acquisition of visual information is the key to represent visual scenes efficiently. To achieve this efficiency, a cognitive vision system must plan the utilization of its processing resources to acquire only information relevant for the task. Here, the incorporation of long-term knowledge plays a major role on deciding which information to gather. In this paper, we present a first approach to make use of the knowledge about the world and its structure to plan visual actions. We propose a method to schedule those visual actions to allow for a fast discrimination between objects that are relevant or irrelevant for the task. By doing so, we are able to reduce the system's computational demand. A first evaluation of our ideas is given using a proof-of-concept implementation.

[1]  S. Ullman Visual routines , 1984, Cognition.

[2]  Marc-Oliver Gewaltig,et al.  A model of computation in neocortical architecture , 1999, Neural Networks.

[3]  H Pashler,et al.  Familiarity and visual change detection , 1988, Perception & psychophysics.

[4]  Julian Eggert,et al.  A Cortex-Inspired Neural-Symbolic Network for Knowledge Representation , 2007, NeSy.

[5]  J. Henderson,et al.  The effects of semantic consistency on eye movements during complex scene viewing , 1999 .

[6]  L. Itti,et al.  Modeling the influence of task on attention , 2005, Vision Research.

[7]  Sven Rebhan,et al.  First steps towards an intentional vision system , 2007 .

[8]  Julian Eggert,et al.  Attention Modulation Using Short- and Long-Term Knowledge , 2008, ICVS.

[9]  H. Intraub The representation of visual scenes , 1997, Trends in Cognitive Sciences.

[10]  L. Itti,et al.  Search Goal Tunes Visual Features Optimally , 2007, Neuron.

[11]  A. L. I︠A︡rbus Eye Movements and Vision , 1967 .

[12]  M. Hayhoe Vision Using Routines: A Functional Account of Vision , 2000 .

[13]  D. Ballard,et al.  Memory Representations in Natural Tasks , 1995, Journal of Cognitive Neuroscience.

[14]  M. Just,et al.  Eye fixations and cognitive processes , 1976, Cognitive Psychology.

[15]  Julian Eggert,et al.  Consistent Modeling of Functional Dependencies along with World Knowledge , 2009 .

[16]  Dana H. Ballard,et al.  Animate Vision , 1991, Artif. Intell..

[17]  D. Ballard,et al.  What you see is what you need. , 2003, Journal of vision.

[18]  John K. Tsotsos On the relative complexity of active vs. passive visual search , 2004, International Journal of Computer Vision.