Deep point embedding for urban classification using ALS point clouds: A new perspective from local to global

Abstract Semantic interpretation of the 3D scene is one of the most challenging problems in point cloud processing, which also deems as an essential task in a wide variety of point cloud applications. The core task of semantic interpretation is semantic labeling, namely, obtaining a unique semantic label for each point in the point cloud. Despite several reported approaches, semantic labeling continues to be a challenge owing to the complexity of scenes, objects of various scales, and the non-homogeneity of unevenly distributed points. In this paper, we propose a novel method for obtaining semantic labels of airborne laser scanning (ALS) point clouds involving the embedding of local context information for each point with multi-scale deep learning, nonlinear manifold learning for feature dimension reduction, and global graph-based optimization for refining the classification results. Specifically, we address the tasks of learning discriminative features and global labeling smoothing. The key contribution of our study is threefold. First, a hierarchical data augmentation strategy is applied to enhance the learning of deep features based on the PointNet++ network and simultaneously eliminate the artifacts caused by division and sampling while dealing with large-scale datasets. Subsequently, the learned hierarchical deep features are globally optimized and embedded into a low-dimensional space with a nonlinear manifold-based joint learning method with the removal of redundant and disturbing information. Finally, a graph-structured optimization based on the Markov random fields algorithm is performed to achieve global optimization of the initial classification results that are obtained using the embedded deep features by constructing a weighted indirect graph and solving the optimization problem with graph-cuts. We conducted thorough experiments on ALS point cloud datasets to assess the performance of our framework. Results indicate that compared to other commonly used advanced classification methods, our method can achieve high classification accuracy. The overall accuracy (OA) of our approach on the ISPRS benchmark dataset can scale up to 83.2% for classifying nine semantic classes, thereby outperforming other compared point-based strategies. Additionally, we evaluated our framework on a selected portion of the AHN3 dataset, which provided OA up to 91.2%.

[1]  Shaohui Sun,et al.  Aerial 3D Building Detection and Modeling From Airborne LiDAR Point Clouds , 2013, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[2]  Jonathan Cheung-Wai Chan,et al.  Evaluation of random forest and adaboost tree-based ensemble classification and spectral band selection for ecotope mapping using airborne hyperspectral imagery , 2008 .

[3]  J. Reitberger,et al.  Analysis of full waveform LIDAR data for the classification of deciduous and coniferous trees , 2008 .

[4]  Yusheng Xu,et al.  Multi-Scale Local Context Embedding for LiDAR Point Cloud Classification , 2020, IEEE Geoscience and Remote Sensing Letters.

[5]  Fan Zhang,et al.  Classification of airborne laser scanning data using JointBoost , 2015 .

[6]  Matthias Nießner,et al.  ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Dong Chen,et al.  LiDAR Point Clouds to 3-D Urban Models$:$ A Review , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[8]  Zhen Wang,et al.  A Multilevel Point-Cluster-Based Discriminative Feature for ALS Point Cloud Classification , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[9]  Uwe Stilla,et al.  Detection of fallen trees in ALS point clouds using a Normalized Cut approach trained by simulation , 2015 .

[10]  Thomas L. Ainsworth,et al.  Exploiting manifold geometry in hyperspectral imagery , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Uwe Stilla,et al.  Change detection in urban areas by object-based analysis and on-the-fly comparison of multi-view ALS data , 2013 .

[12]  Boris Jutzi,et al.  Investigations on surface reflection models for intensity normalization in airborne laser scanning (ALS) data , 2010 .

[13]  Norbert Pfeifer,et al.  Improving LiDAR classification accuracy by contextual label smoothing in post-processing , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[14]  Nassir Navab,et al.  Fully-Convolutional Point Networks for Large-Scale Point Clouds , 2018, ECCV.

[15]  Markus Gerke,et al.  The ISPRS benchmark on urban object classification and 3D building reconstruction , 2012 .

[16]  Silvio Savarese,et al.  SEGCloud: Semantic Segmentation of 3D Point Clouds , 2017, 2017 International Conference on 3D Vision (3DV).

[17]  T. M. Lillesand,et al.  Remote Sensing and Image Interpretation , 1980 .

[18]  Konrad Schindler,et al.  An Overview and Comparison of Smooth Labeling Methods for Land-Cover Classification , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[19]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[20]  Leonidas J. Guibas,et al.  Volumetric and Multi-view CNNs for Object Classification on 3D Data , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Gunho Sohn,et al.  Full Waveform-based Analysis for Forest Type Information Derivation from Large Footprint Spaceborne Lidar Data , 2011 .

[22]  Cheng Wang,et al.  3-D Deep Feature Construction for Mobile Laser Scanning Point Cloud Registration , 2019, IEEE Geoscience and Remote Sensing Letters.

[23]  Yuliang Wang,et al.  Higher-Order Conditional Random Fields-Based 3D Semantic Labeling of Airborne Laser-Scanning Point Clouds , 2019, Remote. Sens..

[24]  Falko Kuester,et al.  Terrestrial Laser Scanning-Based Structural Damage Assessment , 2010, J. Comput. Civ. Eng..

[25]  Stefan Hinz,et al.  Semantic point cloud interpretation based on optimal neighborhoods, relevant features and efficient classifiers , 2015 .

[26]  Leonidas J. Guibas,et al.  Frustum PointNets for 3D Object Detection from RGB-D Data , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[27]  George Vosselman,et al.  Contextual segment-based classification of airborne laser scanner data , 2017 .

[28]  Yin Zhou,et al.  VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[29]  Nikos Komodakis,et al.  Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Frédéric Bosché,et al.  The value of integrating Scan-to-BIM and Scan-vs-BIM techniques for construction monitoring using laser scanning and BIM: The case of cylindrical MEP components , 2015 .

[31]  Pedram Ghamisi,et al.  LiDAR Data Classification Using Extinction Profiles and a Composite Kernel Support Vector Machine , 2017, IEEE Geoscience and Remote Sensing Letters.

[32]  Steffen Urban,et al.  Distinctive 2D and 3D features for automated large-scale scene analysis in urban areas , 2015, Comput. Graph..

[33]  Mikhail Belkin,et al.  Laplacian Eigenmaps for Dimensionality Reduction and Data Representation , 2003, Neural Computation.

[34]  Ruibin Zhao,et al.  Classifying airborne LiDAR point clouds via deep features learned by a multi-scale convolutional neural network , 2018, Int. J. Geogr. Inf. Sci..

[35]  Silvio Savarese,et al.  3D Semantic Parsing of Large-Scale Indoor Spaces , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Alexandre Boulch,et al.  SnapNet: 3D point cloud semantic labeling with 2D deep segmentation networks , 2017, Comput. Graph..

[37]  Manuela M. Veloso,et al.  Depth camera based indoor mobile robot localization and navigation , 2012, 2012 IEEE International Conference on Robotics and Automation.

[38]  Ji Wan,et al.  Multi-view 3D Object Detection Network for Autonomous Driving , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Yue Wang,et al.  Dynamic Graph CNN for Learning on Point Clouds , 2018, ACM Trans. Graph..

[40]  Yan Lu,et al.  Simplified markov random fields for efficient semantic labeling of 3D point clouds , 2012, 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[41]  J. Niemeyer,et al.  Contextual classification of lidar data and building object detection in urban areas , 2014 .

[42]  Hilmar Ingensand,et al.  TERRESTRIAL LASER SCANNING FOR DEFORMATION MONITORING-LOAD TESTS ON THE FELSENAU VIADUCT ( CH ) , 2008 .

[43]  Uwe Stilla,et al.  A Voxel- and Graph-Based Strategy for Segmenting Man-Made Infrastructures Using Perceptual Grouping Laws: Comparison and Evaluation , 2018, Photogrammetric Engineering and Remote Sensing.

[44]  Ayman Habib,et al.  Planar-Based Adaptive Down-Sampling of Point Clouds , 2016 .

[45]  Li Ma,et al.  Local Manifold Learning-Based $k$ -Nearest-Neighbor for Hyperspectral Image Classification , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[46]  Bisheng Yang,et al.  Automated Extraction of Building Outlines From Airborne Laser Scanning Point Clouds , 2013, IEEE Geoscience and Remote Sensing Letters.

[47]  Vladimir Kolmogorov,et al.  An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[48]  Subhransu Maji,et al.  Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[49]  J. Reitberger,et al.  3D segmentation of single trees exploiting full waveform LIDAR data , 2009 .

[50]  Martial Hebert,et al.  Contextual classification with functional Max-Margin Markov Networks , 2009, CVPR.

[51]  C. Mallet,et al.  A structured regularization framework for spatially smoothing semantic labelings of 3D point clouds , 2017 .

[52]  Wai Yeung Yan,et al.  Urban land cover classification using airborne LiDAR data: A review , 2015 .

[53]  S. J. Oude Elberink,et al.  Multiple-entity based classification of airborne laser scanning data in urban areas , 2014 .

[54]  Bo Du,et al.  A Three-Step Approach for TLS Point Cloud Classification , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[55]  Airong Chen,et al.  Three-Dimensional Reconstruction of Structural Surface Model of Heritage Bridges Using UAV-Based Photogrammetric Point Clouds , 2019, Remote. Sens..

[56]  Uwe Stilla,et al.  Simultaneous Calibration of ALS Systems and Alignment of Multiview LiDAR Scans of Urban Areas , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[57]  Uwe Soergel,et al.  HIERARCHICAL HIGHER ORDER CRF FOR THE CLASSIFICATION OF AIRBORNE LIDAR POINT CLOUDS IN URBAN AREAS , 2016 .

[58]  George Vosselman,et al.  Airborne and terrestrial laser scanning , 2011, Int. J. Digit. Earth.

[59]  Yang Liu,et al.  O-CNN , 2017, ACM Trans. Graph..

[60]  Olga Veksler,et al.  Fast Approximate Energy Minimization via Graph Cuts , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[61]  Juntao Yang,et al.  A probabilistic graphical model for the classification of mobile LiDAR point clouds , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[62]  Carl Salvaggio,et al.  A Fully Convolutional Network for Semantic Labeling of 3D Point Clouds , 2017, ISPRS Journal of Photogrammetry and Remote Sensing.

[63]  Martin Weinmann,et al.  USING MULTI-SCALE FEATURES FOR THE 3D SEMANTIC LABELING OFAIRBORNE LASER SCANNING DATA , 2017 .

[64]  C. Mallet,et al.  AIRBORNE LIDAR FEATURE SELECTION FOR URBAN CLASSIFICATION USING RANDOM FORESTS , 2009 .

[65]  Dushyant Rao,et al.  Vote3Deep: Fast object detection in 3D point clouds using efficient convolutional neural networks , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[66]  J. Demantké,et al.  DIMENSIONALITY BASED SCALE SELECTION IN 3D LIDAR POINT CLOUDS , 2012 .

[67]  Luiz Carlos Estraviz Rodriguez,et al.  A Method for Optimizing Height Threshold When Computing Airborne Laser Scanning Metrics , 2017 .

[68]  Martin Simonovsky,et al.  Large-Scale Point Cloud Semantic Segmentation with Superpoint Graphs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[69]  Bisheng Yang,et al.  Automated Extraction of Road Markings from Mobile Lidar Point Clouds , 2012 .

[70]  Florent Lafarge,et al.  Creating Large-Scale City Models from 3D-Point Clouds: A Robust Approach with Hybrid Representation , 2012, International Journal of Computer Vision.

[71]  Naoto Yokoya,et al.  Learning a Robust Local Manifold Representation for Hyperspectral Dimensionality Reduction , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[72]  Xiaohua Tong,et al.  Classification of LiDAR Point Clouds Using Supervoxel-Based Detrended Feature and Perception-Weighted Graphical Model , 2020, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[73]  Wei Huang,et al.  A Convolutional Neural Network-Based 3D Semantic Labeling Method for ALS Point Clouds , 2017, Remote. Sens..

[74]  Uwe Soergel,et al.  Relevance assessment of full-waveform lidar data for urban area classification , 2011 .