Improved visual SLAM: a novel approach to mapping and localization using visual landmarks in consecutive frames

Pathfinding is becoming more and more common in autonomous vehicle navigation, robot localization, and other computer vision applications. In this paper, a novel approach to mapping and localization is presented that extracts visual landmarks from a robot dataset acquired by a Kinect sensor. The visual landmarks are detected and recognized using the improved scale-invariant feature transform (I-SIFT) method. The methodology is based on detecting stable and invariant landmarks in consecutive (red-green-blue depth) RGB-D frames of the robot dataset. These landmarks are then used to determine the robot path, and a map is constructed by using the visual landmarks. A number of experiments were performed on various datasets in an indoor environment. The proposed method performs efficient landmark detection in various environments, which includes changes in rotation and illumination. The experimental results show that the proposed method can solve the simultaneous localization and mapping (SLAM) problem using stable visual landmarks, but with less computation time.

[1]  Luc Van Gool,et al.  From images to 3D models , 2002, CACM.

[2]  Sebastian Thrun,et al.  FastSLAM: a factored solution to the simultaneous localization and mapping problem , 2002, AAAI/IAAI.

[3]  Joachim Hertzberg,et al.  6D SLAM—3D mapping outdoor environments , 2007, J. Field Robotics.

[4]  Robert C. Bolles,et al.  Outdoor Mapping and Navigation Using Stereo Vision , 2006, ISER.

[5]  Daniel Cremers,et al.  Real-Time Dense Geometry from a Handheld Camera , 2010, DAGM-Symposium.

[6]  Andreas Nüchter,et al.  Automatic appearance‐based loop detection from three‐dimensional laser data using the normal distributions transform , 2009, J. Field Robotics.

[7]  Andreas Nüchter,et al.  Automatic Appearance-Based Loop Detection from 3 D Laser Data Using the Normal Distributions Transform , 2009 .

[8]  Shengyong Chen,et al.  Vision sensor planning for 3-D model acquisition , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[9]  Hanseok Ko,et al.  Robust local feature extraction algorithm with visual cortex for object recognition , 2011 .

[10]  James J. Little,et al.  Mobile Robot Localization and Mapping with Uncertainty using Scale-Invariant Visual Landmarks , 2002, Int. J. Robotics Res..

[11]  Wolfram Burgard,et al.  An evaluation of the RGB-D SLAM system , 2012, 2012 IEEE International Conference on Robotics and Automation.

[12]  Wolfram Burgard,et al.  Improved Techniques for Grid Mapping With Rao-Blackwellized Particle Filters , 2007, IEEE Transactions on Robotics.

[13]  Inkyu Moon,et al.  Depth estimation of features in video frames with improved feature matching technique using Kinect sensor , 2012 .

[14]  Frank Dellaert,et al.  iSAM: Incremental Smoothing and Mapping , 2008, IEEE Transactions on Robotics.

[15]  Inkyu Moon,et al.  Improved scale-invariant feature transform feature-matching technique-based object tracking in video sequences via a neural network and Kinect sensor , 2013, J. Electronic Imaging.

[16]  Inkyu Moon,et al.  Extraction of Visual Landmarks Using Improved Feature Matching Technique for Stereo Vision Applications , 2012 .

[17]  Joachim Hertzberg,et al.  Evolving interface design for robot search tasks: Research Articles , 2007 .

[18]  Youdan Kim,et al.  Collision avoidance for quadrotor using stereo vision depth maps , 2015, IEEE Transactions on Aerospace and Electronic Systems.

[19]  Ling Shao,et al.  Computer vision for RGB-D sensors: Kinect and its applications [special issue intro.] , 2013, IEEE Transactions on Cybernetics.

[20]  Patrick Rives,et al.  Real-time Quadrifocal Visual Odometry , 2010, Int. J. Robotics Res..

[21]  A. Aydin Alatan,et al.  3-D Rigid Body Tracking Using Vision and Depth Sensors , 2013, IEEE Transactions on Cybernetics.

[22]  Wolfram Burgard,et al.  Nonlinear Constraint Network Optimization for Efficient Map Learning , 2009, IEEE Transactions on Intelligent Transportation Systems.

[23]  Shai Avidan,et al.  Support vector tracking , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Yohan Fougerolle,et al.  Static-Map and Dynamic Object Reconstruction in Outdoor Scenes Using 3-D Motion Segmentation , 2016, IEEE Robotics and Automation Letters.

[25]  Dieter Fox,et al.  RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments , 2010, ISER.

[26]  Jörg Stückler,et al.  Integrating depth and color cues for dense multi-resolution scene mapping using RGB-D cameras , 2012, 2012 IEEE International Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI).

[27]  Reinhard Koch,et al.  An Analysis-by-Synthesis Camera Tracking Approach Based on Free-Form Surfaces , 2007, DAGM-Symposium.

[28]  Hauke Strasdat,et al.  Scale Drift-Aware Large Scale Monocular SLAM , 2010, Robotics: Science and Systems.

[29]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[30]  Cordelia Schmid,et al.  Local Grayvalue Invariants for Image Retrieval , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Andrew W. Fitzgibbon,et al.  KinectFusion: Real-time dense surface mapping and tracking , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[32]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.

[33]  Wolfram Burgard,et al.  Towards a benchmark for RGB-D SLAM evaluation , 2011, RSS 2011.

[34]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[35]  David Nistér,et al.  Preemptive RANSAC for live structure and motion estimation , 2005, Machine Vision and Applications.

[36]  Chih-Hsien Hsia,et al.  A Stereo Vision-Based Self-Localization System , 2013, IEEE Sensors Journal.

[37]  Hongbin Zha,et al.  Coarse-to-fine vision-based localization by indexing scale-Invariant features , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[38]  Kurt Konolige,et al.  Towards lifelong visual maps , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[39]  Jose Luis Blanco Claraco Development of Scientific Applications with the Mobile Robot Programming Toolkit , 2010 .

[40]  Lynne Outhred,et al.  An evaluation of the , 2001 .

[41]  Roberto Manduchi,et al.  Obstacle Detection and Terrain Classification for Autonomous Off-Road Navigation , 2005, Auton. Robots.

[42]  Yan Ke,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[43]  Andreas Nüchter,et al.  Automatic appearance-based loop detection from three-dimensional laser data using the normal distributions transform , 2009 .

[44]  Wen Yu,et al.  Recent advances on simultaneous localization and mapping for mobile robots , 2013 .