论文信息 - Semantic Segmentation of Large-Scale Outdoor Point Clouds by Encoder-Decoder Shared MLPs with Multiple Losses

Semantic Segmentation of Large-Scale Outdoor Point Clouds by Encoder-Decoder Shared MLPs with Multiple Losses

Semantic segmentation of large-scale outdoor 3D LiDAR point clouds becomes essential to understand the scene environment in various applications, such as geometry mapping, autonomous driving, and more. With an advantage of being a 3D metric space, 3D LiDAR point clouds, on the other hand, pose a challenge for a deep learning approach, due to their unstructured, unorder, irregular, and large-scale characteristics. Therefore, this paper presents an encoder–decoder shared multi-layer perceptron (MLP) with multiple losses, to address an issue of this semantic segmentation. The challenge rises a trade-off between efficiency and effectiveness in performance. To balance this trade-off, we proposed common mechanisms, which is simple and yet effective, by defining a random point sampling layer, an attention-based pooling layer, and a summation of multiple losses integrated with the encoder–decoder shared MLPs method for the large-scale outdoor point clouds semantic segmentation. We conducted our experiments on the following two large-scale benchmark datasets: Toronto-3D and DALES dataset. Our experimental results achieved an overall accuracy (OA) and a mean intersection over union (mIoU) of both the Toronto-3D dataset, with 83.60% and 71.03%, and the DALES dataset, with 76.43% and 59.52%, respectively. Additionally, our proposed method performed a few numbers of parameters of the model, and faster than PointNet++ by about three times during inferencing.

[1] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Ming-Ming Cheng,et al. LSANet: Feature Learning on Point Sets by Local Spatial Aware Layer , 2019 .

[3] Xiaoqin Zeng,et al. A Review of Deep Learning Research , 2019, KSII Trans. Internet Inf. Syst..

[4] Mohammed Bennamoun,et al. Deep Learning for 3D Point Clouds: A Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5] Michael Felsberg,et al. Deep Projective 3D Semantic Segmentation , 2017, CAIP.

[6] Jeonghwan Gwak,et al. A Review of Intelligent Self-Driving Vehicle Software Research , 2019, KSII Trans. Internet Inf. Syst..

[7] Wei Wu,et al. PointCNN: Convolution On X-Transformed Points , 2018, NeurIPS.

[8] Naveed Akhtar,et al. Spherical Convolutional Neural Network for 3D Point Clouds , 2018, ArXiv.

[9] Andrew Markham,et al. Robust Attentional Aggregation of Deep Feature Sets for Multi-view 3D Reconstruction , 2018, International Journal of Computer Vision.

[10] Junru Yin,et al. Road Damage Detection and Classification based on Multi-level Feature Pyramids , 2021, KSII Transactions on Internet and Information Systems.

[11] Cewu Lu,et al. PointSIFT: A SIFT-like Network Module for 3D Point Cloud Semantic Segmentation , 2018, ArXiv.

[12] Yue Wang,et al. Dynamic Graph CNN for Learning on Point Clouds , 2018, ACM Trans. Graph..

[13] Saifullahi Aminu Bello,et al. Review: deep learning on 3D point clouds , 2020, Remote. Sens..

[14] Cheol Mun,et al. Intelligent Hybrid Fusion Algorithm with Vision Patterns for Generation of Precise Digital Road Maps in Self-driving Vehicles , 2020, KSII Trans. Internet Inf. Syst..