Fast Probabilistic Fusion of 3D Point Clouds via Occupancy Grids for Scene Classification

Abstract. High resolution consumer cameras on Unmanned Aerial Vehicles (UAVs) allow for cheap acquisition of highly detailed images, e.g., of urban regions. Via image registration by means of Structure from Motion (SfM) and Multi View Stereo (MVS) the automatic generation of huge amounts of 3D points with a relative accuracy in the centimeter range is possible. Applications such as semantic classification have a need for accurate 3D point clouds, but do not benefit from an extremely high resolution/density. In this paper, we, therefore, propose a fast fusion of high resolution 3D point clouds based on occupancy grids. The result is used for semantic classification. In contrast to state-of-the-art classification methods, we accept a certain percentage of outliers, arguing that they can be considered in the classification process when a per point belief is determined in the fusion process. To this end, we employ an octree-based fusion which allows for the derivation of outlier probabilities. The probabilities give a belief for every 3D point, which is essential for the semantic classification to consider measurement noise. For an example point cloud with half a billion 3D points (cf. Figure 1), we show that our method can reduce runtime as well as improve classification accuracy and offers high scalability for large datasets.

[1]  M. Goesele,et al.  Floating scale surface reconstruction , 2014, ACM Trans. Graph..

[2]  Claus Brenner,et al.  Object-level Segmentation of RGBD Data , 2014 .

[3]  Umar Mohammed,et al.  Superpixel lattices , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Wolfram Burgard,et al.  Probabilistic Robotics (Intelligent Robotics and Autonomous Agents) , 2005 .

[5]  Marc Levoy,et al.  A volumetric method for building complex models from range images , 1996, SIGGRAPH.

[6]  Christian Heipke,et al.  Conditional Random Fields for Multitemporal and Multiscale Classification of Optical Satellite Imagery , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[8]  Jitendra Malik,et al.  Learning a classification model for segmentation , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[9]  Helmut Mayer,et al.  Incremental Division of Very Large Point Clouds for Scalable 3D Surface Reconstruction , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[10]  Neubiberg Helmut Mayer From Orientation to Functional Modeling for Terrestrial and UAV Images , 2015 .

[11]  Thomas Brox,et al.  Global, Dense Multiscale Reconstruction for a Billion Points , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[12]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[13]  Hans P. Moravec,et al.  High resolution maps from wide angle sonar , 1985, Proceedings. 1985 IEEE International Conference on Robotics and Automation.

[14]  Steven M. Seitz,et al.  Multicore bundle adjustment , 2011, CVPR 2011.

[15]  Heiko Hirschmüller,et al.  Stereo Processing by Semiglobal Matching and Mutual Information , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Ines Ernst,et al.  Mutual Information Based Semi-Global Stereo Matching on the GPU , 2008, ISVC.

[17]  Changchang Wu,et al.  SiftGPU : A GPU Implementation of Scale Invariant Feature Transform (SIFT) , 2007 .

[18]  F. Rottensteiner,et al.  Classification of urban LiDAR data using conditional random field and random forests , 2013, Joint Urban Remote Sensing Event 2013.

[19]  Xi Zhang,et al.  Learning from synthetic models for roof style classification in point clouds , 2014, SIGSPATIAL/GIS.

[20]  Heiko Hirschmüller,et al.  Multi-Resolution Range Data Fusion for Multi-View Stereo Reconstruction , 2013, GCPR.

[21]  Heiko Hirschmüller,et al.  A TV Prior for High-Quality Local Multi-view Stereo Reconstruction , 2014, 2014 2nd International Conference on 3D Vision.

[22]  G. Vosselman Point cloud segmentation for urban scene classification , 2013 .

[23]  Heiko Hirschmüller,et al.  Dense 3D Reconstruction from Wide Baseline Image Sets , 2011, Theoretical Foundations of Computer Vision.

[24]  Uwe Soergel,et al.  Contextual Classification of Full Waveform Lidar Data in the Wadden Sea , 2014, IEEE Geoscience and Remote Sensing Letters.

[25]  Heiko Hirschmüller,et al.  Semi-Global Matching-Motivation, Developments and Applications , 2011 .

[26]  Daniel P. Huttenlocher,et al.  Efficient Graph-Based Image Segmentation , 2004, International Journal of Computer Vision.

[27]  Heiko Hirschmüller,et al.  Evaluation of Stereo Matching Costs on Images with Radiometric Differences , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  M. Rothermel,et al.  SURE : PHOTOGRAMMETRIC SURFACE RECONSTRUCTION FROM IMAGER Y , 2013 .

[29]  Hai Huang,et al.  Robust and efficient urban scene classification using relative features , 2015, SIGSPATIAL/GIS.

[30]  Konrad Schindler,et al.  An Overview and Comparison of Smooth Labeling Methods for Land-Cover Classification , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[31]  M. Goesele,et al.  Fusion of depth maps with multiple scales , 2011, ACM Trans. Graph..

[32]  Changchang Wu,et al.  Towards Linear-Time Incremental Structure from Motion , 2013, 2013 International Conference on 3D Vision.

[33]  Samia Boukir,et al.  Relevance of airborne lidar and multispectral image data for urban scene classification using Random Forests , 2011 .