论文信息 - JSPNet: Learning joint semantic & instance segmentation of point clouds via feature self-similarity and cross-task probability

JSPNet: Learning joint semantic & instance segmentation of point clouds via feature self-similarity and cross-task probability

Abstract In this paper, we propose a novel method named JSPNet, to segment 3D point cloud in semantic and instance simultaneously. First, we analyze the problem in addressing joint semantic and instance segmentation, including the common ground of cooperation of two tasks, conflict of two tasks, quadruplet relation between semantic and instance distributions, and ignorance of existing works. Then we introduce our method to reinforce mutual cooperation and alleviate the essential conflict. Our method has a shared encoder and two decoders to address two tasks. Specifically, to maintain discriminative features and characterize inconspicuous content, a similarity-based feature fusion module is designed to locate the inconspicuous area in the feature of current branch and then select related features from the other branch to compensate for the unclear content. Furthermore, given the salient semantic feature and the salient instance feature, a cross-task probability-based feature fusion module is developed to establish the probabilistic correlation between semantic and instance features. This module could transform features from one branch and further fuse them with the other branch by multiplying probabilistic matrix. Experimental results on a large-scale 3D indoor point cloud dataset S3DIS and a part-segmentation dataset ShapeNet have demonstrated the superiority of our method over existing state-of-the-arts in both semantic and instance segmentation. The proposed method outperforms PointNet with 12% and 26% improvements and outperforms ASIS with 2.7% and 4.3% improvements in terms of mIoU and mPre. Code of this work has been made available at https://github.com/Chenfeng1271/JSPNet .

[1] Huimin Lu,et al. Chinese Image Captioning via Fuzzy Attention-based DenseNet-BiLSTM , 2021, ACM Trans. Multim. Comput. Commun. Appl..

[2] Qiang Zhang,et al. Automatic Comic Generation with Stylistic Multi-page Layouts and Emotion-driven Text Balloon Generation , 2021 .

[3] Sami Sieranoja,et al. How much can k-means be improved by using better initialization and repeats? , 2019, Pattern Recognit..

[4] Dorin Comaniciu,et al. Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[5] Pascal Vincent,et al. Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6] Lei Zhang,et al. Deep CNNs Meet Global Covariance Pooling: Better Representation and Generalization , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Jian Zhang,et al. Constructing multilayer locality-constrained matrix regression framework for noise robust face super-resolution , 2021, Pattern Recognit..

[8] Luc Van Gool,et al. Semantic Instance Segmentation for Autonomous Driving , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[9] Huimin Lu,et al. Underwater image dehazing using joint trilateral filter , 2014, Comput. Electr. Eng..

[10] Huimin Lu,et al. Brain Intelligence: Go beyond Artificial Intelligence , 2017, Mobile Networks and Applications.

[11] Luc Van Gool,et al. Semantic Instance Segmentation with a Discriminative Loss Function , 2017, ArXiv.

[12] Wang Peng,et al. Numerical and experimental study on the maneuverability of an active propeller control based wave glider , 2020 .

[13] Roberto Cipolla,et al. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Charles A. Micchelli,et al. Learning Multiple Tasks with Kernel Methods , 2005, J. Mach. Learn. Res..

[15] Lizhong Xu,et al. Construction of a Hierarchical Feature Enhancement Network and Its Application in Fault Recognition , 2021, IEEE Transactions on Industrial Informatics.

[16] Jian Yang,et al. Learning robust and discriminative low-rank representations for face recognition with occlusion , 2017, Pattern Recognit..