论文信息 - Biologically Inspired Mobile Robot Vision Localization

Biologically Inspired Mobile Robot Vision Localization

We present a robot localization system using biologically inspired vision. Our system models two extensively studied human visual capabilities: (1) extracting the ldquogistrdquo of a scene to produce a coarse localization hypothesis and (2) refining it by locating salient landmark points in the scene. Gist is computed here as a holistic statistical signature of the image, thereby yielding abstract scene classification and layout. Saliency is computed as a measure of interest at every image location, which efficiently directs the time-consuming landmark-identification process toward the most likely candidate locations in the image. The gist features and salient regions are then further processed using a Monte Carlo localization algorithm to allow the robot to generate its position. We test the system in three different outdoor environments-building complex (38.4 m times 54.86 m area, 13 966 testing images), vegetation-filled park (82.3 m times 109.73 m area, 26 397 testing images), and open-field park (137.16 m times 178.31 m area, 34 711 testing images)-each with its own challenges. The system is able to localize, on average, within 0.98, 2.63, and 3.46 m, respectively, even with multiple kidnapped-robot instances.

Laurent Itti | Christian Siagian | L. Itti | Christian Siagian

[1] M. Potter. Meaning in visual search. , 1975, Science.

[2] A. Treisman,et al. A feature-integration theory of attention , 1980, Cognitive Psychology.

[3] I Biederman,et al. Do Background Depth Gradients Facilitate Object Identification? , 1981, Perception.

[4] Leslie G. Ungerleider. Two cortical visual systems , 1982 .

[5] B. Tversky,et al. Categories of environmental scenes , 1983, Cognitive Psychology.

[6] T. McNamara. Memory's view of space , 1991 .

[7] J. Wolfe,et al. Guided Search 2.0 A revised model of visual search , 1994, Psychonomic bulletin & review.

[8] S. Thorpe,et al. Speed of processing in the human visual system , 1996, Nature.

[9] W. Epstein,et al. Priming Spatial Layout of Scenes , 1997 .

[10] Wolfram Burgard,et al. A Probabilistic Approach to Concurrent Mapping and Localization for Mobile Robots , 1998, Auton. Robots.

[11] BurgardWolfram,et al. A Probabilistic Approach to Concurrent Mapping and Localization for Mobile Robots , 1998 .

[12] Sebastian Thrun,et al. Learning Metric-Topological Maps for Indoor Mobile Robot Navigation , 1998, Artif. Intell..

[13] Wolfram Burgard,et al. Monte Carlo Localization: Efficient Position Estimation for Mobile Robots , 1999, AAAI/IAAI.

[14] Russell A. Epstein,et al. The Parahippocampal Place Area Recognition, Navigation, or Encoding? , 1999, Neuron.

[15] Wolfram Burgard,et al. MINERVA: a second-generation museum tour-guide robot , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[16] Ronald A. Rensink. The Dynamic Representation of Scenes , 2000 .

[17] Illah R. Nourbakhsh,et al. Appearance-based place recognition for topological localization , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[18] C. Koch,et al. Models of bottom-up and top-down visual attention , 2000 .

[19] A. Oliva,et al. Diagnostic Colors Mediate Scene Recognition , 2000, Cognitive Psychology.

[20] C. Koch,et al. Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[21] Wolfram Burgard,et al. Robust Monte Carlo localization for mobile robots , 2001, Artif. Intell..

[22] Peter K. Allen,et al. Topological mobile robot localization using fast vision techniques , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[23] Rafael Murrieta-Cid,et al. Visual Navigation in Natural Environments: From Range and Color Data to a Landmark-Based Model , 2002, Auton. Robots.

[24] Sebastian Thrun,et al. FastSLAM: a factored solution to the simultaneous localization and mapping problem , 2002, AAAI/IAAI.

[25] P. Perona,et al. Rapid natural scene categorization in the near absence of attention , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[26] Antonio Torralba,et al. Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[27] Antonio Torralba,et al. Modeling global scene factors in attention. , 2003, Journal of the Optical Society of America. A, Optics, image science, and vision.

[28] Barbara Tversky,et al. Navigating by Mind and by Body , 2003, Spatial Cognition.

[29] Joachim Hertzberg,et al. Indoor and outdoor localization for fast mobile robots , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[30] Pietro Perona,et al. Is bottom-up attention useful for object recognition? , 2004, CVPR 2004.

[31] Wei Zhang,et al. Localization Based on Building Recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[32] James J. Little,et al. Vision-based global localization and mapping for mobile robots , 2005, IEEE Transactions on Robotics.

[33] Paolo Pirjanian,et al. A Visual Front-end for Simultaneous Localization and Mapping , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[34] S. Thorpe,et al. Rapid categorization of achromatic natural scenes: how robust at very low contrasts? , 2005, The European journal of neuroscience.

[35] Laurent Itti,et al. Robot steering with spectral image information , 2005, IEEE Transactions on Robotics.

[36] Y. Shirai,et al. A View-Based Outdoor Navigation Using Object Recognition Robust to Changes of Weather and Seasons , 2005 .

[37] Kurt Konolige,et al. Real-time Localization in Outdoor Environments using Stereo Vision and Inexpensive GPS , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[38] Hongbin Zha,et al. Coarse-to-fine vision-based localization by indexing scale-Invariant features , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[39] Barbara Caputo,et al. Visual Servoing to Help Camera Operators Track Better , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[40] Henrik I. Christensen,et al. Attentional Landmark Selection for Visual SLAM , 2006, 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[41] Wei Zhang,et al. Image Based Localization in Urban Environments , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[42] Javier González,et al. Consistent observation grouping for generating metric-topological maps that improves robot localization , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[43] Frank Dellaert,et al. A Rao-Blackwellized particle filter for topological mapping , 2006, Proceedings 2006 IEEE International Conference on Robotics and Automation, 2006. ICRA 2006..

[44] Laurent Itti,et al. Biologically-inspired robotics vision monte-carlo localization in the outdoor environment , 2007, 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[45] Laurent Itti,et al. Ieee Transactions on Pattern Analysis and Machine Intelligence 1 Rapid Biologically-inspired Scene Classification Using Features Shared with Visual Attention , 2022 .

[46] Benjamin Kuipers,et al. An Intellectual History of the Spatial Semantic Hierarchy , 2008, Robotics and Cognitive Approaches to Spatial Mapping.

[47] Adriana Tapus,et al. Mobile robot localization using panoramic vision and combinations of feature region detectors , 2008, 2008 IEEE International Conference on Robotics and Automation.

[48] Achim J. Lilienthal,et al. Incremental spectral clustering and seasons: Appearance-based localization in outdoor environments , 2008, 2008 IEEE International Conference on Robotics and Automation.

[49] Laurent Itti,et al. Storing and recalling information for vision localization , 2008, 2008 IEEE International Conference on Robotics and Automation.

[50] Christof Koch,et al. A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[51] Laurent Itti,et al. Comparison of gist models in rapid scene categorization tasks , 2010 .

[52] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .