Knowledge-Driven Saliency: Attention to the Unseen

This paper deals with attention in 3D environments based upon knowledge-driven cues. Using learned 3D scenes as top-down influence, the proposed system is able to mark high saliency to locations occupied by objects that are new, changed, or even missing from their location as compared to the already learned situation. The proposal addresses a system level solution covering learning of 3D objects and scenes using visual, range and odometry sensors, storage of spatial knowledge using multiple-view theory from psychology, and validation of scenes using recognized objects as anchors. The proposed system is designed to handle the complex scenarios of recognition with partially visible objects during revisit to the scene from an arbitrary direction. Simulation results have shown success of the designed methodology under ideal sensor readings from range and odometry sensors.

[1]  M J Tarr,et al.  What Object Attributes Determine Canonical Views? , 1999, Perception.

[2]  Jan-Olof Eklundh,et al.  An Attentional System Combining Top-Down and Bottom-Up Influences , 2008, WAPCV.

[3]  Eiichi Hoshino,et al.  Memory Formation of Object Representation: Natural Scenes , 2008 .

[4]  Wolfram Burgard,et al.  Unsupervised learning of 3D object models from partial views , 2009, 2009 IEEE International Conference on Robotics and Automation.

[5]  Thierry Baccino,et al.  Modeling visual attention on scenes , 2010, Stud. Inform. Univ..

[6]  Martin A. Giese,et al.  Hooligan detection: The effects of saliency and expert knowledge , 2011, ICDP.

[7]  M. Posner,et al.  Attention and the detection of signals. , 1980, Journal of experimental psychology.

[8]  Xin Chen,et al.  Is visual search a top-down or bottom-up process? , 2010 .

[9]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[10]  Michael W. Levine,et al.  More ups and downs of visual processing , 2010 .

[11]  Maria Petrou,et al.  Proceedings of the 17th International Conference on Pattern Recognition (ICPR 2004) , 2004 .

[12]  Xin Chen,et al.  Real-world visual search is dominated by top-down guidance , 2006, Vision Research.

[13]  Laurent Itti,et al.  An Integrated Model of Top-Down and Bottom-Up Attention for Optimizing Detection Speed , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[14]  Andrew C. Beall,et al.  Three dimensional spatial memory and learning in real and virtual environments , 2001, Spatial Cogn. Comput..

[15]  John K. Tsotsos,et al.  Attention in Cognitive Systems, 5th International Workshop on Attention in Cognitive Systems, WAPCV 2008, Fira, Santorini, Greece, May 12, 2008, Revised Selected Papers , 2009, WAPCV.

[16]  Indriyati Atmosukarto,et al.  A Learning Approach to 3D Object Representation for Classification , 2008, SSPR/SPR.

[17]  Bärbel Mertsching,et al.  Modeling and Simulating Mobile Robot Environments , 2008, GRAPP.

[18]  Antonio Adán,et al.  Free-Shaped Object Recognition Method from Partial Views Using Weighted Cone Curvatures , 2005, CIARP.

[19]  Georg Hartmann,et al.  Knowledge-based view control of a neural 3-D object recognition system , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[20]  Hoi-Jun Yoo,et al.  Familiarity based unified visual attention model for fast and robust object recognition , 2010, Pattern Recognit..

[21]  Sven J. Dickinson,et al.  Selecting canonical views for view-based 3-D object recognition , 2004, ICPR 2004.

[22]  Mary M Hayhoe,et al.  Spatial memory and saccadic targeting in a natural task. , 2005, Journal of vision.

[23]  S. Palmer Vision Science : Photons to Phenomenology , 1999 .

[24]  J. Theeuwes,et al.  Feature-based memory-driven attentional capture: visual working memory content affects visual attention. , 2006, Journal of experimental psychology. Human perception and performance.

[25]  Lucas Paletta,et al.  Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint , 2008, Lecture Notes in Computer Science.

[26]  Timothy P McNamara,et al.  Spatial memory and perspective taking , 2004, Memory & cognition.

[27]  Bärbel Mertsching,et al.  Validating Vision and Robotic Algorithms for Dynamic Real World Environments , 2010, SIMPAR.