A fusion algorithm for building three-dimensional maps

Recently various algorithms for building of three-dimensional maps of indoor environments have been proposed. In this work we use a Kinect camera that captures RGB images along with depth information for building three-dimensional dense maps of indoor environments. Commonly mapping systems consist of three components; that is, first, spatial alignment of consecutive data frames; second, detection of loop-closures, and finally, globally consistent alignment of the data sequence. It is known that three-dimensional point clouds are well suited for frame-to-frame alignment and for three-dimensional dense reconstruction without the use of valuable visual RGB information. A new fusion algorithm combining visual features and depth information for loop-closure detection followed by pose optimization to build global consistent maps is proposed. The performance of the proposed system in real indoor environments is presented and discussed.

[1]  Dieter Fox,et al.  RGB-D Mapping: Using Depth Cameras for Dense 3D Modeling of Indoor Environments , 2010, ISER.

[2]  Oliver Birbach,et al.  Experiences in building a visual SLAM system from open source components , 2011, 2011 IEEE International Conference on Robotics and Automation.

[3]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[4]  Wolfram Burgard,et al.  An evaluation of the RGB-D SLAM system , 2012, 2012 IEEE International Conference on Robotics and Automation.

[5]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[6]  Olivier Stasse,et al.  MonoSLAM: Real-Time Single Camera SLAM , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Steven M. Seitz,et al.  Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..

[8]  David Nistér,et al.  An efficient solution to the five-point relative pose problem , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[9]  Nicola Fioraio,et al.  Realtime Visual and Point Cloud SLAM Nicola Fioraio Willow Garage , 2011 .

[10]  Kurt Konolige,et al.  Large-Scale Visual Odometry for Rough Terrain , 2007, ISRR.

[11]  Gérard G. Medioni,et al.  Object modeling by registration of multiple range images , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[12]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[13]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[14]  V. Díaz-Ramírez,et al.  A novel image matching algorithm based on sliding histograms of oriented gradients , 2014 .

[15]  Jan-Michael Frahm,et al.  Detailed Real-Time Urban 3D Reconstruction from Video , 2007, International Journal of Computer Vision.

[16]  Robert C. Bolles,et al.  Outdoor Mapping and Navigation Using Stereo Vision , 2006, ISER.

[17]  Michael A. Greenspan,et al.  The parallel iterative closest point algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[18]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..