Vision-inertial slam using natural features in outdoor environments

Simultaneous Localization and Mapping (SLAM) is a recursive probabilistic inferencing process used for robot navigation when Global Positioning Systems (GPS) are unavailable. SLAM operates by building a map of the robot environment, while concurrently localizing the robot within this map. The ultimate goal of SLAM is to operate anywhere using the environment's natural features as landmarks. Such a goal is difficult to achieve for several reasons. Firstly, different environments contain different types of natural features, each exhibiting large variance in its shape and appearance. Secondly, objects look differently from different viewpoints and it is therefore difficult to always recognize them. Thirdly, in most outdoor environments it is not possible to predict the motion of a vehicle using wheel encoders because of errors caused by slippage. Finally, the design of a SLAM system to operate in a large-scale outdoor setting is in itself a challenge. The above issues are addressed as follows. Firstly, a camera is used to recognize the environmental context (e.g., indoor office, outdoor park) by analyzing the holistic spectral content of images of the robot's surroundings. A type of feature (e.g., trees for a park) is then chosen for SLAM that is likely observable in the recognized setting. A novel tree detection system is introduced, which is based on perceptually organizing the content of images into quasi-vertical structures and marking those structures that intersect ground level as tree trunks. Secondly, a new tree recognition system is proposed, which is based on extracting Scale Invariant Feature Transform (SIFT) features on each tree trunk region and matching trees in feature space. Thirdly, dead-reckoning is performed via an Inertial Navigation System (INS), bounded by non-holonomic constraints. INS are insensitive to slippage and varying ground conditions. Finally, the developed Computer Vision and Inertial systems are integrated within the framework of an Extended Kalman Filter into a working Vision-INS SLAM system, named VisSLAM. VisSLAM is tested on data collected during a real test run in an outdoor unstructured environment. Three test scenarios are proposed, ranging from semi-automatic detection, recognition, and initialization to a fully automated SLAM system. The first two scenarios are used to verify the presented inertial and Computer Vision algorithms in the context of localization, where results indicate accurate vehicle pose estimation for the majority of its journey. The final scenario evaluates the application of the proposed systems for SLAM, where results indicate successful operation for a long portion of the vehicle journey. Although the scope of this thesis is to operate in an outdoor park setting using tree trunks as landmarks, the developed techniques lend themselves to other environments using different natural objects as landmarks.

[1]  Stefan B. Williams,et al.  Efficient Simultaneous Localisation and Mapping Using Local Submaps , 2001 .

[2]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  W MurrayDavid,et al.  Simultaneous Localization and Map-Building Using Active Vision , 2002 .

[4]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5]  Randall Smith,et al.  Estimating Uncertain Spatial Relationships in Robotics , 1987, Autonomous Robot Vehicles.

[6]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[7]  Hugh F. Durrant-Whyte,et al.  Simultaneous map building and localization for an autonomous mobile robot , 1991, Proceedings IROS '91:IEEE/RSJ International Workshop on Intelligent Robots and Systems '91.

[8]  Takeo Kanade,et al.  Object Detection Using the Statistics of Parts , 2004, International Journal of Computer Vision.

[9]  Stefan B. Williams,et al.  Map Management for Efficient Simultaneous Localization and Mapping (SLAM) , 2002, Auton. Robots.

[10]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[11]  H. Sorenson,et al.  Nonlinear Bayesian estimation using Gaussian sum approximations , 1972 .

[12]  W. Burgard,et al.  Markov Localization for Mobile Robots in Dynamic Environments , 1999, J. Artif. Intell. Res..

[13]  Y. Bar-Shalom Tracking and data association , 1988 .

[14]  Giovanni Ulivi,et al.  Vision based navigation using Kalman approach for SLAM , 2003 .

[15]  Eric Foxlin,et al.  Generalized architecture for simultaneous localization, auto-calibration, and map-building , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  Alex Pentland,et al.  Recursive Estimation of Motion, Structure, and Focal Length , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  Yaakov Bar-Shalom,et al.  Bearings-Only Tracking for Maneuvering Targets in Clutter , 1996 .

[18]  John Oliensis,et al.  A Critique of Structure-from-Motion Algorithms , 2000, Comput. Vis. Image Underst..

[19]  Cordelia Schmid,et al.  Selection of scale-invariant parts for object class recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[20]  Linda G. Shapiro,et al.  Computer and Robot Vision , 1991 .

[21]  Hugh F. Durrant-Whyte,et al.  An Autonomous Guided Vehicle for Cargo Handling Applications , 1995, ISER.

[22]  Sebastian Thrun,et al.  FastSLAM: a factored solution to the simultaneous localization and mapping problem , 2002, AAAI/IAAI.

[23]  H. C. Longuet-Higgins,et al.  A computer algorithm for reconstructing a scene from two projections , 1981, Nature.

[24]  Jean-Paul Laumond,et al.  Position referencing and consistent world modeling for mobile robots , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[25]  N. Peach,et al.  Bearings-only tracking using a set of range-parameterised extended Kalman filters , 1995 .

[26]  Martial Hebert,et al.  Experimental Comparison of Techniques for Localization and Mapping Using a Bearing-Only Sensor , 2000, ISER.

[27]  Nobuyuki Kita,et al.  3D simultaneous localisation and map-building using active vision for a robot moving on undulating terrain , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[28]  Alberto Elfes,et al.  Using occupancy grids for mobile robot perception and navigation , 1989, Computer.

[29]  Cordelia Schmid,et al.  Matching images with different resolutions , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[30]  C. Schmid,et al.  Object Class Recognition Using Discriminative Local Features , 2005 .

[31]  Tim Bailey Constrained initialisation for bearing-only SLAM , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[32]  Olivier Faugeras,et al.  Maintaining representations of the environment of a mobile robot , 1988, IEEE Trans. Robotics Autom..

[33]  H. Sorenson,et al.  Recursive bayesian estimation using gaussian sums , 1971 .

[34]  Frank Nielsen,et al.  Statistical region merging , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Hugh F. Durrant-Whyte,et al.  Simultaneous Mapping and Localization with Sparse Extended Information Filters: Theory and Initial Results , 2004, WAFR.

[36]  Paul A. Viola,et al.  Structure Driven Image Database Retrieval , 1997, NIPS.

[37]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[38]  Luc Van Gool,et al.  Affine/ Photometric Invariants for Planar Intensity Patterns , 1996, ECCV.

[39]  Wolfram Burgard,et al.  A Probabilistic Approach to Concurrent Mapping and Localization for Mobile Robots , 1998, Auton. Robots.

[40]  Peter G. Ifju,et al.  Sky/ground modeling for autonomous MAV flight , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[41]  Antonio Torralba,et al.  Context-based vision system for place and object recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[42]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[43]  Peter Cheeseman,et al.  On the Representation and Estimation of Spatial Uncertainty , 1986 .

[44]  J. Koenderink,et al.  Representation of local geometry in the visual system , 1987, Biological Cybernetics.

[45]  Michael Csorba,et al.  Simultaneous Localisation and Map Building , 1997 .

[46]  Alex Pentland,et al.  3D structure from 2D motion , 1999, IEEE Signal Process. Mag..

[47]  J. Andel Sequential Analysis , 2022, The SAGE Encyclopedia of Research Design.

[48]  William T. Freeman,et al.  Presented at: 2nd Annual IEEE International Conference on Image , 1995 .

[49]  Andrew J. Davison,et al.  Real-time simultaneous localisation and mapping with a single camera , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[50]  R. Nevatia,et al.  Perceptual Organization for Scene Segmentation and Description , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[51]  N. Gordon,et al.  Novel approach to nonlinear/non-Gaussian Bayesian state estimation , 1993 .

[52]  Gamini Dissanayake,et al.  An efficient multiple hypothesis filter for bearing-only SLAM , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[53]  Eduardo Nebot,et al.  A Novel Hybrid Map Representation for DenseSLAM in Unstructured Large Environments , 2003 .

[54]  Wolfram Burgard,et al.  Monte Carlo localization for mobile robots , 1999, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C).

[55]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[56]  Gamini Dissanayake,et al.  Bearing-only SLAM Using a SPRT Based Gaussian Sum Filter , 2005, Proceedings of the 2005 IEEE International Conference on Robotics and Automation.

[57]  Juan D. Tardós,et al.  Data association in stochastic mapping using the joint compatibility test , 2001, IEEE Trans. Robotics Autom..

[58]  Salah Sukkarieh,et al.  Bearing-Only SLAM for an Airborne Vehicle , 2005 .

[59]  Michel Devy,et al.  Undelayed initialization in bearing only SLAM , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[60]  Clark F. Olson,et al.  Stereo ego-motion improvements for robust rover navigation , 2001, Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No.01CH37164).

[61]  Zhuowen Tu,et al.  Image Parsing: Unifying Segmentation, Detection, and Recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[62]  I. Jung,et al.  Simultaneous localization and mapping in 3D environments with stereovision , 2004 .

[63]  N E Manos,et al.  Stochastic Models , 1960, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..

[64]  Salah Sukkarieh,et al.  Airborne simultaneous localisation and map building , 2003, 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422).

[65]  Arnaud Doucet,et al.  Particle filters for state estimation of jump Markov linear systems , 2001, IEEE Trans. Signal Process..

[66]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[67]  Favio R. Masson,et al.  Navigation and Mapping in Large Unstructured Environments , 2004, Int. J. Robotics Res..

[68]  Ryan M. Eustice,et al.  Large-area visually augmented navigation for autonomous underwater vehicles , 2005 .

[69]  Wolfram Burgard,et al.  Experiences with an Interactive Museum Tour-Guide Robot , 1999, Artif. Intell..

[70]  Gamini Dissanayake,et al.  Bearing-only SLAM in Indoor Environments Using a Modified Particle Filter , 2003 .

[71]  Paul A. Viola,et al.  Boosting Image Retrieval , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[72]  Hugh F. Durrant-Whyte,et al.  Initial calibration and alignment of low-cost inertial navigation units for land vehicle applications , 1999, J. Field Robotics.

[73]  T. R. Kronhamn,et al.  Bearings-only target motion analysis based on a multihypothesis Kalman filter and adaptive ownship motion control , 1998 .

[74]  Andrew Zisserman,et al.  Multi-view Matching for Unordered Image Sets, or "How Do I Organize My Holiday Snaps?" , 2002, ECCV.

[75]  Ben Upcroft,et al.  Representing natural objects in unstructured environments , 2005, NIPS 2005.

[76]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[77]  Eduardo Mario Nebot,et al.  Optimization of the simultaneous localization and map-building algorithm for real-time implementation , 2001, IEEE Trans. Robotics Autom..

[78]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[79]  Simon Lacroix,et al.  Simultaneous Localization and Mapping with Stereovision , 2003, ISRR.

[80]  James E. Cutting,et al.  HIGH-PERFORMANCE COMPUTING AND HUMAN VISION I , 2002 .

[81]  Simon Lacroix,et al.  A practical 3D bearing-only SLAM algorithm , 2005, 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[82]  Simon Lacroix,et al.  Position estimation in outdoor environments using pixel tracking and stereovision , 2000, Proceedings 2000 ICRA. Millennium Conference. IEEE International Conference on Robotics and Automation. Symposia Proceedings (Cat. No.00CH37065).

[83]  John J. Leonard,et al.  Towards Constant-Time SLAM on an Autonomous Underwater Vehicle Using Synthetic Aperture Sonar , 2003, ISRR.

[84]  J. Wolfowitz,et al.  Optimum Character of the Sequential Probability Ratio Test , 1948 .

[85]  Y. Charlie Hu,et al.  A computational efficient SLAM algorithm based on logarithmic-map partitioning , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[86]  Roland Siegwart,et al.  Introduction to Autonomous Mobile Robots , 2004 .

[87]  S. Coren,et al.  In Sensation and perception , 1979 .

[88]  John Oliensis Exact Two-Image Structure from Motion , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[89]  Jean Ponce,et al.  Computer Vision: A Modern Approach , 2002 .

[90]  Stefan B. Williams,et al.  Constrained Initialization of the Simultaneous Localization and Mapping Algorithm , 2003, Int. J. Robotics Res..

[91]  Hugh F. Durrant-Whyte,et al.  The aiding of a low-cost strapdown inertial measurement unit using vehicle model constraints for land vehicle applications , 2001, IEEE Trans. Robotics Autom..