RGBD-Sphere SLAM

This article proposes a SLAM algorithm referred to as RGBD-Sphere SLAM. The key innovation of this work is the prototypical system that demonstrates how formal models of 3D geometric shape and appearance can be transformed into generative classification models that detect and recognize these shapes. Object models are specified as shape programs in PSML; a custom-built procedural language for 3D object modeling. Classifiers for each PSML shape are created by simulating how instances of each shape manifest in real-world sensor data, e.g., color images and range images. The proposed RGBD-Sphere SLAM algorithm demonstrates a prototypical example of the system. Here, the PSML program specifies spherical 3D objects having diffuse surface albedos and distinct color appearances. A recognizer uses PSML models of each object's geometry and appearance to detect instances of these objects within streaming RGBD sensor data. The detected model parameters are then integrated into an RGBD SLAM algorithm; hence the name RGBD-Sphere SLAM. This article describes the PSML programs, the spherical detection and recognition algorithms used, and describes the impact this approach has for improving the performance of RGBD SLAM approaches by incorporating detected objects as landmarks. This is the first example of a prototypical system that externalizes the geometric and appearance modeling to a programming language from which a recognizer is created, and marks an important step towards enabling users to “program” their problem space and allow computers to transform the formal object models, as expressed in PSML, into customized classifiers suited for specific sensor suites, e.g., color imagery and depth imagery.

[1]  Daniel Cremers,et al.  Dense visual SLAM for RGB-D cameras , 2013, 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[2]  Wolfram Burgard,et al.  A benchmark for the evaluation of RGB-D SLAM systems , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[3]  Peter Wonka,et al.  Generating 3D Building Models from Architectural Drawings: A Survey , 2009, IEEE Computer Graphics and Applications.

[4]  Dong-Ming Yan,et al.  Generating and exploring good building layouts , 2013, ACM Trans. Graph..

[5]  Julien Weissenberg,et al.  Inverse Procedural Modelling and Applications , 2014 .

[6]  Daniel Cremers,et al.  Robust odometry estimation for RGB-D cameras , 2013, 2013 IEEE International Conference on Robotics and Automation.

[7]  Luc Van Gool,et al.  Procedural 3D Building Reconstruction Using Shape Grammars and Detectors , 2011, 2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission.

[8]  Alexei A. Efros,et al.  Blocks World Revisited: Image Understanding Using Qualitative Geometry and Mechanics , 2010, ECCV.

[9]  Paul H. J. Kelly,et al.  SLAM++: Simultaneous Localisation and Mapping at the Level of Objects , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Adrian Hilton,et al.  Block world reconstruction from spherical stereo image pairs , 2015, Comput. Vis. Image Underst..

[11]  Marc Levoy,et al.  Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[12]  Christian Ah-Soon,et al.  A complete system for the analysis of architectural drawings , 2000, International Journal on Document Analysis and Recognition.

[13]  John J. Leonard,et al.  Kintinuous: Spatially Extended KinectFusion , 2012, AAAI 2012.

[14]  Nikos Paragios,et al.  Segmentation of building facades using procedural shape priors , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Andrew W. Fitzgibbon,et al.  KinectFusion: Real-time dense surface mapping and tracking , 2011, 2011 10th IEEE International Symposium on Mixed and Augmented Reality.

[16]  Bernhard Hohmann,et al.  Cityfit - High-Quality Urban Reconstruction by Fitting Shape Grammars to Image and derived Textured Point Clouds , 2008 .

[17]  Luc Van Gool,et al.  Procedural modeling of buildings , 2006, ACM Trans. Graph..

[18]  Luc Van Gool,et al.  Image-based procedural modeling of facades , 2007, SIGGRAPH 2007.

[19]  Wolfram Burgard,et al.  A Tutorial on Graph-Based SLAM , 2010, IEEE Intelligent Transportation Systems Magazine.

[20]  Nassir Navab,et al.  Model globally, match locally: Efficient and robust 3D object recognition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[21]  Michael Wimmer,et al.  Instant architecture , 2003, ACM Trans. Graph..

[22]  Wolfram Burgard,et al.  G2o: A general framework for graph optimization , 2011, 2011 IEEE International Conference on Robotics and Automation.

[23]  Bui Tuong Phong Illumination for computer generated pictures , 1975, Commun. ACM.