Structure-aware SLAM with planes and lines in man-made environment

Abstract This paper describes a robust approach to monocular SLAM by incorporating structural elements in man-made environments. Indoor and outdoor scenes often contain a large number of planes and lines, while the classic point-based methods tend to ignore the geometric clues in these common structures. The proposed method exploits not only strong structural constraints on points. More importantly, planes and lines are also treated as supplement for long-distance tracking. Drifting error is reduced via detecting large planes spanning over views and reinforcing coplanarity of triangulated points. Lines are integrated across views with an inverse depth representation, providing extra one-dimensional features and directional information. The structural constraints and additional features are incorporated under a unified minimization framework, and the planes and lines are actively extended as the SLAM system goes on, making our algorithm suitable for exploration-style applications. Real world monocular sequences have demonstrated that the proposed SLAM system outperforms the state-of-the-art and produces accurate results in both indoor and outdoor scenes.

[1]  Shichao Yang,et al.  Pop-up SLAM: Semantic monocular plane SLAM for low-texture environments , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[2]  Vladlen Koltun,et al.  Dense Monocular Depth Estimation in Complex Dynamic Scenes , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Hongbin Zha,et al.  Scalable Monocular SLAM by Fusing and Connecting Line Segments with Inverse Depth Filter , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[4]  David W. Murray,et al.  Improving the Agility of Keyframe-Based SLAM , 2008, ECCV.

[5]  Zihan Zhou,et al.  Robust plane-based structure from motion , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Hauke Strasdat,et al.  Real-time monocular SLAM: Why filter? , 2010, 2010 IEEE International Conference on Robotics and Automation.

[7]  Hongbin Zha,et al.  Structure-Aware SLAM with Planes in Man-Made Environment , 2017, CCCV.

[8]  Wolfram Burgard,et al.  A benchmark for the evaluation of RGB-D SLAM systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[9]  Javier González,et al.  PL-SVO: Semi-direct Monocular Visual Odometry by combining points and line segments , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[10]  Chen Feng,et al.  Point-plane SLAM for hand-held 3D sensors , 2013, 2013 IEEE International Conference on Robotics and Automation.

[11]  Akihiro Yamamoto,et al.  Visual Odometry by Multi-frame Feature Integration , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[12]  Daniel Cremers,et al.  A Photometrically Calibrated Benchmark For Monocular Visual Odometry , 2016, ArXiv.

[13]  Francesc Moreno-Noguer,et al.  PL-SLAM: Real-time monocular visual SLAM with points and lines , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[14]  Andrew J. Davison,et al.  A benchmark for RGB-D visual odometry, 3D reconstruction and SLAM , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[15]  Walterio W. Mayol-Cuevas,et al.  Ninja on a Plane: Automatic Discovery of Physical Planes for Augmented Reality Using Visual SLAM , 2007, 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality.

[16]  Shichao Yang,et al.  Direct monocular odometry using points and lines , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[17]  Andrea Fusiello,et al.  Robust Multiple Structures Estimation with J-Linkage , 2008, ECCV.

[18]  Andrew Zisserman,et al.  MLESAC: A New Robust Estimator with Application to Estimating Image Geometry , 2000, Comput. Vis. Image Underst..

[19]  Davide Scaramuzza,et al.  REMODE: Probabilistic, monocular dense reconstruction in real time , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[20]  Davide Scaramuzza,et al.  SVO: Fast semi-direct monocular visual odometry , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[21]  Daniel Cremers,et al.  Semi-dense Visual Odometry for a Monocular Camera , 2013, 2013 IEEE International Conference on Computer Vision.

[22]  Rafael Grompone von Gioi,et al.  LSD: A Fast Line Segment Detector with a False Detection Control , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Adrien Bartoli,et al.  Structure-from-motion using lines: Representation, triangulation, and bundle adjustment , 2005, Comput. Vis. Image Underst..

[24]  Dezhen Song,et al.  Visual Navigation Using Heterogeneous Landmarks and Unsupervised Geometric Constraints , 2015, IEEE Transactions on Robotics.

[25]  Daniel Cremers,et al.  Direct Sparse Odometry , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[27]  Tom Drummond,et al.  Machine Learning for High-Speed Corner Detection , 2006, ECCV.

[28]  J. M. M. Montiel,et al.  ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.

[29]  Javier Civera,et al.  DPPTAM: Dense piecewise planar tracking and mapping from a monocular sequence , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[30]  Michael Kaess,et al.  Simultaneous localization and mapping with infinite planes , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[31]  Paul H. J. Kelly,et al.  SLAM++: Simultaneous Localisation and Mapping at the Level of Objects , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  G. Klein,et al.  Parallel Tracking and Mapping for Small AR Workspaces , 2007, 2007 6th IEEE and ACM International Symposium on Mixed and Augmented Reality.

[33]  Javier Ibanez Guzman,et al.  Accurate visual odometry from a rear parking camera , 2011, 2011 IEEE Intelligent Vehicles Symposium (IV).

[34]  Carlos Hernández,et al.  Video-based, real-time multi-view stereo , 2011, Image Vis. Comput..

[35]  Andrew Calway,et al.  Unifying Planar and Point Mapping in Monocular SLAM , 2010, BMVC.

[36]  Guofeng Zhang,et al.  Keyframe-based dense planar SLAM , 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[37]  Andrew J. Davison,et al.  DTAM: Dense tracking and mapping in real-time , 2011, 2011 International Conference on Computer Vision.

[38]  Walterio W. Mayol-Cuevas,et al.  Discovering Planes and Collapsing the State Space in Visual SLAM , 2007, BMVC.

[39]  Paul H. J. Kelly,et al.  Dense planar SLAM , 2014, 2014 IEEE International Symposium on Mixed and Augmented Reality (ISMAR).