MVF-CNN: Fusion of Multilevel Features for Large-Scale Point Cloud Classification

Due to the disorder, sparsity, and irregularity of point clouds, the accurate classification of large-scale point clouds is a challenging problem. Voxelization-based deep learning methods have been applied to point cloud classification and have achieved good performances. However, there are some problems in the methods, such as voxels lacking color information, single receptive fields being considered only at the same voxel scale, and only the global features of voxels being considered in the classification. This paper proposes a deep learning-based algorithm for large-scale point cloud classification through the fusion of multiscale voxels and features (MVF-CNN). First, the point cloud is transformed into voxels of two different sizes. Then, the voxels are input into the 3D convolutional neural network (3D CNN) with three different scale receptive fields for the feature extraction. Next, the output features of the 3D CNN are input into the proposed global and local feature fusion classification network (GLNet), which fuses the global features of the voxels at the main branch and the local features of each voxel at the auxiliary branch. Finally, the multiscale features of the main branch are fused, and its classification results are obtained. We have conducted the experiments on six-point cloud scenes. The experimental results show that the proposed algorithm accurately classifies large-scale point clouds. Compared with several semisupervised/supervised learning methods, the proposed algorithm obtains better classification results. In addition, the experimental results also demonstrate that the proposed algorithm has a strong generalization ability and obviously has a better classification performance than the compared algorithms.

[1]  Liqiang Zhang,et al.  Large-scale urban point cloud labeling and reconstruction , 2018 .

[2]  Lixin Fan,et al.  Comprehensive Automated 3D Urban Environment Modelling Using Terrestrial Laser Scanning Point Cloud , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[3]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[4]  Ivor W. Tsang,et al.  Flexible Manifold Embedding: A Framework for Semi-Supervised and Unsupervised Dimension Reduction , 2010, IEEE Transactions on Image Processing.

[5]  Leonidas J. Guibas,et al.  Volumetric and Multi-view CNNs for Object Classification on 3D Data , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Jiaxin Li,et al.  SO-Net: Self-Organizing Network for Point Cloud Analysis , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Jing Huang,et al.  Point cloud labeling using 3D Convolutional Neural Network , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[9]  Michael Felsberg,et al.  Deep Projective 3D Semantic Segmentation , 2017, CAIP.

[10]  Xiang Li,et al.  LightNet: A Lightweight 3D Convolutional Neural Network for Real-Time 3D Object Recognition , 2017, 3DOR@Eurographics.

[11]  Silvio Savarese,et al.  SEGCloud: Semantic Segmentation of 3D Point Clouds , 2017, 2017 International Conference on 3D Vision (3DV).

[12]  Xuelong Li,et al.  Learning a Nonnegative Sparse Graph for Linear Regression , 2015, IEEE Transactions on Image Processing.

[13]  Martin Simonovsky,et al.  Large-Scale Point Cloud Semantic Segmentation with Superpoint Graphs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[14]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[15]  François Goulette,et al.  Classification of Point Cloud Scenes with Multiscale Voxel Deep Network , 2018, ArXiv.

[16]  Anh Nguyen,et al.  3D point cloud segmentation: A survey , 2013, 2013 6th IEEE Conference on Robotics, Automation and Mechatronics (RAM).

[17]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[18]  Konrad Schindler,et al.  FAST SEMANTIC SEGMENTATION OF 3D POINT CLOUDS WITH STRONGLY VARYING DENSITY , 2016 .

[19]  Liqiang Zhang,et al.  Joint Margin, Cograph, and Label Constraints for Semisupervised Scene Parsing From Point Clouds , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[20]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[21]  Antonio Torralba,et al.  SIFT Flow: Dense Correspondence across Scenes and Its Applications , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Huang He,et al.  Precision Verification of 3D SLAM Backpacked Mobile Mapping Robot , 2016 .

[23]  Nicu Sebe,et al.  Discriminating Joint Feature Analysis for Multimedia Data Understanding , 2012, IEEE Transactions on Multimedia.

[24]  Wang Wei Outdoor scene comprehension of mobile robot based on multi-sensor information fusion , 2011 .

[25]  Lei Wang,et al.  MSNet: Multi-Scale Convolutional Network for Point Cloud Classification , 2018, Remote. Sens..

[26]  Wei Wu,et al.  PointCNN: Convolution On X-Transformed Points , 2018, NeurIPS.

[27]  Wei An,et al.  Binary Volumetric Convolutional Neural Networks for 3-D Object Recognition , 2019, IEEE Transactions on Instrumentation and Measurement.

[28]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[29]  Marc Pollefeys,et al.  Semantic3D.net: A new Large-scale Point Cloud Classification Benchmark , 2017, ArXiv.

[30]  Hao Liu,et al.  A hierarchical methodology for urban facade parsing from TLS point clouds , 2017 .

[31]  Sebastian Scherer,et al.  3D Convolutional Neural Networks for landing zone detection from LiDAR , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[32]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[34]  Subhransu Maji,et al.  Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[35]  Alexandre Boulch,et al.  Unstructured Point Cloud Semantic Labeling Using Deep Segmentation Networks , 2017, 3DOR@Eurographics.

[36]  Jan Dirk Wegner,et al.  Mind the Gap: Modeling Local and Global Context in (Road) Networks , 2014, GCPR.

[37]  François Goulette,et al.  Semantic Classification of 3D Point Clouds with Multiscale Spherical Neighborhoods , 2018, 2018 International Conference on 3D Vision (3DV).