Biologically Inspired Mobile Robot Vision Localization

We present a robot localization system using biologically inspired vision. Our system models two extensively studied human visual capabilities: (1) extracting the ldquogistrdquo of a scene to produce a coarse localization hypothesis and (2) refining it by locating salient landmark points in the scene. Gist is computed here as a holistic statistical signature of the image, thereby yielding abstract scene classification and layout. Saliency is computed as a measure of interest at every image location, which efficiently directs the time-consuming landmark-identification process toward the most likely candidate locations in the image. The gist features and salient regions are then further processed using a Monte Carlo localization algorithm to allow the robot to generate its position. We test the system in three different outdoor environments-building complex (38.4 m times 54.86 m area, 13 966 testing images), vegetation-filled park (82.3 m times 109.73 m area, 26 397 testing images), and open-field park (137.16 m times 178.31 m area, 34 711 testing images)-each with its own challenges. The system is able to localize, on average, within 0.98, 2.63, and 3.46 m, respectively, even with multiple kidnapped-robot instances.

[1]  M. Potter Meaning in visual search. , 1975, Science.

[2]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[3]  I Biederman,et al.  Do Background Depth Gradients Facilitate Object Identification? , 1981, Perception.

[4]  Leslie G. Ungerleider Two cortical visual systems , 1982 .

[5]  B. Tversky,et al.  Categories of environmental scenes , 1983, Cognitive Psychology.

[6]  T. McNamara Memory's view of space , 1991 .

[7]  J. Wolfe,et al.  Guided Search 2.0 A revised model of visual search , 1994, Psychonomic bulletin & review.

[8]  S. Thorpe,et al.  Speed of processing in the human visual system , 1996, Nature.

[9]  W. Epstein,et al.  Priming Spatial Layout of Scenes , 1997 .

[10]  Wolfram Burgard,et al.  A Probabilistic Approach to Concurrent Mapping and Localization for Mobile Robots , 1998, Auton. Robots.

[11]  BurgardWolfram,et al.  A Probabilistic Approach to Concurrent Mapping and Localization for Mobile Robots , 1998 .

[12]  Sebastian Thrun,et al.  Learning Metric-Topological Maps for Indoor Mobile Robot Navigation , 1998, Artif. Intell..

[13]  Wolfram Burgard,et al.  Monte Carlo Localization: Efficient Position Estimation for Mobile Robots , 1999, AAAI/IAAI.

[14]  Russell A. Epstein,et al.  The Parahippocampal Place Area Recognition, Navigation, or Encoding? , 1999, Neuron.

[15]  Wolfram Burgard,et al.  MINERVA: a second-generation museum tour-guide robot , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[16]  Ronald A. Rensink The Dynamic Representation of Scenes , 2000 .

[17]  Illah R. Nourbakhsh,et al.  Appearance-based place recognition for topological localization , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[18]  C. Koch,et al.  Models of bottom-up and top-down visual attention , 2000 .

[19]  A. Oliva,et al.  Diagnostic Colors Mediate Scene Recognition , 2000, Cognitive Psychology.

[20]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[21]  Wolfram Burgard,et al.  Robust Monte Carlo localization for mobile robots , 2001, Artif. Intell..

[22]  Peter K. Allen,et al.  Topological mobile robot localization using fast vision techniques , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[23]  Rafael Murrieta-Cid,et al.  Visual Navigation in Natural Environments: From Range and Color Data to a Landmark-Based Model , 2002, Auton. Robots.

[24]  Sebastian Thrun,et al.  FastSLAM: a factored solution to the simultaneous localization and mapping problem , 2002, AAAI/IAAI.

[25]  P. Perona,et al.  Rapid natural scene categorization in the near absence of attention , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Antonio Torralba,et al.  Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[27]  Antonio Torralba,et al.  Modeling global scene factors in attention. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[28]  Barbara Tversky,et al.  Navigating by Mind and by Body , 2003, Spatial Cognition.

[29]  Joachim Hertzberg,et al.  Indoor and outdoor localization for fast mobile robots , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[30]  Pietro Perona,et al.  Is bottom-up attention useful for object recognition? , 2004, CVPR 2004.

[31]  Wei Zhang,et al.  Localization Based on Building Recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[32]  James J. Little,et al.  Vision-based global localization and mapping for mobile robots , 2005, IEEE Transactions on Robotics.

[33]  Paolo Pirjanian,et al.  A Visual Front-end for Simultaneous Localization and Mapping , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[34]  S. Thorpe,et al.  Rapid categorization of achromatic natural scenes: how robust at very low contrasts? , 2005, The European journal of neuroscience.

[35]  Laurent Itti,et al.  Robot steering with spectral image information , 2005, IEEE Transactions on Robotics.

[36]  Y. Shirai,et al.  A View-Based Outdoor Navigation Using Object Recognition Robust to Changes of Weather and Seasons , 2005 .

[37]  Kurt Konolige,et al.  Real-time Localization in Outdoor Environments using Stereo Vision and Inexpensive GPS , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[38]  Hongbin Zha,et al.  Coarse-to-fine vision-based localization by indexing scale-Invariant features , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[39]  Barbara Caputo,et al.  Visual Servoing to Help Camera Operators Track Better , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[40]  Henrik I. Christensen,et al.  Attentional Landmark Selection for Visual SLAM , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[41]  Wei Zhang,et al.  Image Based Localization in Urban Environments , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[42]  Javier González,et al.  Consistent observation grouping for generating metric-topological maps that improves robot localization , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[43]  Frank Dellaert,et al.  A Rao-Blackwellized particle filter for topological mapping , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[44]  Laurent Itti,et al.  Biologically-inspired robotics vision monte-carlo localization in the outdoor environment , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[45]  Laurent Itti,et al.  Ieee Transactions on Pattern Analysis and Machine Intelligence 1 Rapid Biologically-inspired Scene Classification Using Features Shared with Visual Attention , 2022 .

[46]  Benjamin Kuipers,et al.  An Intellectual History of the Spatial Semantic Hierarchy , 2008, Robotics and Cognitive Approaches to Spatial Mapping.

[47]  Adriana Tapus,et al.  Mobile robot localization using panoramic vision and combinations of feature region detectors , 2008, 2008 IEEE International Conference on Robotics and Automation.

[48]  Achim J. Lilienthal,et al.  Incremental spectral clustering and seasons: Appearance-based localization in outdoor environments , 2008, 2008 IEEE International Conference on Robotics and Automation.

[49]  Laurent Itti,et al.  Storing and recalling information for vision localization , 2008, 2008 IEEE International Conference on Robotics and Automation.

[50]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[51]  Laurent Itti,et al.  Comparison of gist models in rapid scene categorization tasks , 2010 .

[52]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .