论文信息 - DPPTAM: Dense piecewise planar tracking and mapping from a monocular sequence

DPPTAM: Dense piecewise planar tracking and mapping from a monocular sequence

This paper proposes a direct monocular SLAM algorithm that estimates a dense reconstruction of a scene in real-time on a CPU. Highly textured image areas are mapped using standard direct mapping techniques [1], that minimize the photometric error across different views. We make the assumption that homogeneous-color regions belong to approximately planar areas. Our contribution is a new algorithm for the estimation of such planar areas, based on the information of a superpixel segmentation and the semidense map from highly textured areas. We compare our approach against several alternatives using the public TUM dataset [2] and additional live experiments with a hand-held camera. We demonstrate that our proposal for piecewise planar monocular SLAM is faster, more accurate and more robust than the piecewise planar baseline [3]. In addition, our experimental results show how the depth regularization of monocular maps can damage its accuracy, being the piecewise planar assumption a reasonable option in indoor scenarios.

Javier Civera | Alejo Concha | Javier Civera | Alejo Concha

[1] Vijay Kumar,et al. Autonomous multi-floor indoor navigation with a computationally constrained MAV , 2011, 2011 IEEE International Conference on Robotics and Automation.

[2] Andrew J. Davison,et al. DTAM: Dense tracking and mapping in real-time , 2011, 2011 International Conference on Computer Vision.

[3] Davide Scaramuzza,et al. REMODE: Probabilistic, monocular dense reconstruction in real time , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[4] Davide Scaramuzza,et al. SVO: Fast semi-direct monocular visual odometry , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[5] Carlos Hernández,et al. Video-based, real-time multi-view stereo , 2011, Image Vis. Comput..

[6] Olivier Stasse,et al. MonoSLAM: Real-Time Single Camera SLAM , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Javier Civera,et al. Dense multi-planar scene estimation from a sparse set of images , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[8] Daniel Cremers,et al. CopyMe3D: Scanning and Printing Persons in 3D , 2013, GCPR.

[9] Derek Hoiem,et al. Recovering the spatial layout of cluttered rooms , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[10] G. Klein,et al. Parallel Tracking and Mapping for Small AR Workspaces , 2007, 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality.

[11] Javier Civera,et al. Real-time localization and dense mapping in underwater environments from a monocular sequence , 2015, OCEANS 2015 - Genova.

[12] Simon Baker,et al. Lucas-Kanade 20 Years On: A Unifying Framework , 2004, International Journal of Computer Vision.

[13] Jan-Michael Frahm,et al. Piecewise planar and non-planar stereo for urban scene reconstruction , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14] Changhai Xu,et al. Real-time indoor scene understanding using Bayesian filtering with motion cues , 2011, 2011 International Conference on Computer Vision.

[15] Juan D. Tardós,et al. Probabilistic Semi-Dense Mapping from Highly Accurate Feature-Based Monocular SLAM , 2015, Robotics: Science and Systems.

[16] Wolfram Burgard,et al. A benchmark for the evaluation of RGB-D SLAM systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[17] Javier Civera,et al. Using superpixels in monocular SLAM , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[18] Daniel Cremers,et al. Real-Time Dense Geometry from a Handheld Camera , 2010, DAGM-Symposium.

[19] Javier Civera,et al. Manhattan and Piecewise-Planar Constraints for Dense Monocular Mapping , 2014, Robotics: Science and Systems.

[20] Daniel Cremers,et al. LSD-SLAM: Large-Scale Direct Monocular SLAM , 2014, ECCV.

[21] Paul Newman,et al. Distraction suppression for vision-based pose estimation at city scales , 2013, 2013 IEEE International Conference on Robotics and Automation.

[22] 智一吉田,et al. Efficient Graph-Based Image Segmentationを用いた圃場図自動作成手法の検討 , 2014 .

[23] Ian D. Reid,et al. Manhattan scene understanding using monocular, stereo, and 3D features , 2011, 2011 International Conference on Computer Vision.

[24] Jan-Michael Frahm,et al. A Comparative Analysis of RANSAC Techniques Leading to Adaptive Real-Time Random Sample Consensus , 2008, ECCV.