Globally Consistent Indoor Mapping via a Decoupling Rotation and Translation Algorithm Applied to RGB-D Camera Output

This paper presents a novel RGB-D 3D reconstruction algorithm for the indoor environment. The method can produce globally-consistent 3D maps for potential GIS applications. As the consumer RGB-D camera provides a noisy depth image, the proposed algorithm decouples the rotation and translation for a more robust camera pose estimation, which makes full use of the information, but also prevents inaccuracies caused by noisy depth measurements. The uncertainty in the image depth is not only related to the camera device, but also the environment; hence, a novel uncertainty model for depth measurements was developed using Gaussian mixture applied to multi-windows. The plane features in the indoor environment contain valuable information about the global structure, which can guide the convergence of camera pose solutions, and plane and feature point constraints are incorporated in the proposed optimization framework. The proposed method was validated using publicly-available RGB-D benchmarks and obtained good quality trajectory and 3D models, which are difficult for traditional 3D reconstruction algorithms.

[1]  Shi-Min Hu,et al.  3D indoor scene modeling from RGB-D data: a survey , 2015, Computational Visual Media.

[2]  Andrew Owens,et al.  SUN3D: A Database of Big Spaces Reconstructed Using SfM and Object Labels , 2013, 2013 IEEE International Conference on Computer Vision.

[3]  M. Yang,et al.  Global robust image rotation from combined weighted averaging , 2017 .

[4]  Jörg Stückler,et al.  Efficient Multi-resolution Plane Segmentation of 3D Point Clouds , 2011, ICIRA.

[5]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[6]  Andreas Birk,et al.  Online three‐dimensional SLAM by registration of large planar surface segments and closed‐form pose‐graph relaxation , 2010, J. Field Robotics.

[7]  Dieter Fox,et al.  RGB-D mapping: Using Kinect-style depth cameras for dense 3D modeling of indoor environments , 2012, Int. J. Robotics Res..

[8]  Liang Zhang,et al.  Modeling and analyzing 3D complex building interiors for effective evacuation simulations , 2012 .

[9]  John J. Leonard,et al.  Kintinuous: Spatially Extended KinectFusion , 2012, AAAI 2012.

[10]  Filip Biljecki,et al.  Applications of 3D City Models: State of the Art Review , 2015, ISPRS Int. J. Geo Inf..

[11]  Wolfram Burgard,et al.  An evaluation of the RGB-D SLAM system , 2012, 2012 IEEE International Conference on Robotics and Automation.

[12]  Marc Pollefeys,et al.  Dense Semantic 3D Reconstruction , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Stefan Leutenegger,et al.  ElasticFusion: Real-time dense SLAM and light source estimation , 2016, Int. J. Robotics Res..

[14]  Hae-Kyong Kang,et al.  A Standard Indoor Spatial Data Model - OGC IndoorGML and Implementation Approaches , 2017, ISPRS Int. J. Geo Inf..

[15]  Wolfram Burgard,et al.  A benchmark for the evaluation of RGB-D SLAM systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[16]  Juan D. Tardós,et al.  ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras , 2016, IEEE Transactions on Robotics.

[17]  Daniel Cremers,et al.  Dense visual SLAM for RGB-D cameras , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[18]  Matthias Nießner,et al.  BundleFusion , 2016, TOGS.

[19]  J. D. Wegner,et al.  Globally consistent registration of terrestrial laser scans via graph optimization , 2015 .

[20]  Friedrich Keller,et al.  Precise indoor mapping as a basis for coarse indoor navigation , 2013 .

[21]  Sander Oude Elberink,et al.  Accuracy and Resolution of Kinect Depth Data for Indoor Mapping Applications , 2012, Sensors.

[22]  Javier Civera,et al.  RGBDTAM: A cost-effective and accurate RGB-D tracking and mapping system , 2017, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[23]  Krystof Litomisky Consumer RGB-D Cameras and their Applications , 2012 .

[24]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[25]  Shilin Zhou,et al.  Convolutional neural network-based image representation for visual loop closure detection , 2015, 2015 IEEE International Conference on Information and Automation.

[26]  Jörg Stückler,et al.  Model Learning and Real-Time Tracking Using Multi-Resolution Surfel Maps , 2012, AAAI.

[27]  G. Vosselman,et al.  GENERATION AND WEIGHTING OF 3D POINT CORRESPONDENCES FOR IMPROVED REGISTRATION OF RGB-D DATA , 2013 .

[28]  Nico Blodow,et al.  Close-range scene segmentation and reconstruction of 3D point cloud maps for mobile manipulation in domestic environments , 2009, 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[29]  M. L. Yeh,et al.  The Evaluation of GPS techniques for UAV-based Photogrammetry in Urban Area , 2016 .

[30]  Pascal Monasse,et al.  Global Fusion of Relative Motions for Robust, Accurate and Scalable Structure from Motion , 2013, ICCV.

[31]  Andrew W. Fitzgibbon,et al.  KinectFusion: Real-time dense surface mapping and tracking , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[32]  Sisi Zlatanova,et al.  NIBU: a new approach to representing and analysing interior utility networks within 3D geo-information systems , 2012, Int. J. Digit. Earth.

[33]  John J. Leonard,et al.  Real-time large-scale dense RGB-D SLAM with volumetric fusion , 2014, Int. J. Robotics Res..

[34]  Evangelos E. Milios,et al.  Globally Consistent Range Scan Alignment for Environment Mapping , 1997, Auton. Robots.

[35]  Joachim Hertzberg,et al.  Globally consistent 3D mapping with scan matching , 2008, Robotics Auton. Syst..

[36]  Lutz Plümer,et al.  Estimation of 3D Indoor Models with Constraint Propagation and Stochastic Reasoning in the Absence of Indoor Measurements , 2017, ISPRS Int. J. Geo Inf..

[37]  Jean-Arcady Meyer,et al.  Fast and Incremental Method for Loop-Closure Detection Using Bags of Visual Words , 2008, IEEE Transactions on Robotics.

[38]  Andrew J. Davison,et al.  A benchmark for RGB-D visual odometry, 3D reconstruction and SLAM , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[39]  Nadisson Luis Pavan,et al.  A Global Closed-Form Refinement for Consistent TLS Data Registration , 2017, IEEE Geoscience and Remote Sensing Letters.

[40]  Derek D. Lichti,et al.  IMU and Multiple RGB-D Camera Fusion for Assisting Indoor Stop-and-Go 3D Terrestrial Laser Scanning , 2014, Robotics.

[41]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[42]  George Vosselman,et al.  Mapping Indoor Spaces by Adaptive Coarse-to-Fine Registration of RGB-D Data , 2016, IEEE Geoscience and Remote Sensing Letters.

[43]  Jean-Claude Thill,et al.  Traveling in the three-dimensional city: applications in route planning, accessibility assessment, location analysis and beyond , 2011 .

[44]  Thomas A. Funkhouser,et al.  Fine-to-Coarse Global Registration of RGB-D Scans , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[46]  K. Khoshelham,et al.  A WEIGHTED CLOSED-FORM SOLUTION FOR RGB-D DATA REGISTRATION , 2016 .

[47]  Alias Abdul Rahman,et al.  Automatic Generation of 3D Networks in CityGML and Design of an Intelligent Individual Evacuation Model for Building Fires Within the Scope of 3D GIS , 2014 .