SCANet: A Spatial and Channel Attention based Network for Partial-to-Partial Point Cloud Registration

ABSTRACT Point cloud registration plays an essential role in many areas, such as computer vision and robotics. However, traditional feature-based registration requires handcrafted descriptors for various scenarios, which is of low efficiency and flexibility; ICP and its locally optimal variants are sensitive to initialization, while globally optimal methods are of high computational time to overcome noise, outliers, and partial overlap. Learning-based registration can automatically and flexibly learn shape representation for different objects, but existing methods are of either low efficiency or low precision, and poorly perform in partial-to-partial point cloud registration. Thus, we present a simple spatial and channel attention based network, named SCANet, for partial-to-partial point cloud registration. A spatial self-attention aggregation (SSA) module is applied in a feature extraction sub-network to efficiently make use of the inter and global information of each point cloud in different levels, while a channel cross-attention regression (CCR) module is adopted in a pose estimation sub-network for information interaction between two input global feature vectors, enhancing relevant information and suppressing redundant information. Experimental results show that our SCANet achieves state-of-the-art performances in both accuracy and efficiency compared to existing non-deep learning and learning-based methods on partial visibility with Gaussian noise. Our source code is available at the project website https://github.com/zhouruqin/SCANet .

[1]  Vladlen Koltun,et al.  Fast Global Registration , 2016, ECCV.

[2]  Thierry Monteil,et al.  Mixed integer linear programming for quality of service optimization in Clouds , 2017, Future Gener. Comput. Syst..

[3]  Jan Kautz,et al.  HGMR: Hierarchical Gaussian Mixtures for Adaptive 3D Registration , 2018, ECCV.

[4]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[5]  Rama Chellappa,et al.  3DRegNet: A Deep Neural Network for 3D Point Registration , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Zi Jian Yew,et al.  RPM-Net: Robust Point Matching Using Learned Features , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Simon Lucey,et al.  Deterministic PointNetLK for Generalized Registration , 2020, ArXiv.

[8]  Jitendra Malik,et al.  Recognizing Objects in Range Data Using Regional Point Descriptors , 2004, ECCV.

[9]  Shiyu Song,et al.  DeepVCP: An End-to-End Deep Neural Network for Point Cloud Registration , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[10]  Liang Wang,et al.  Beyond Joints: Learning Representations From Primitive Geometries for Skeleton-Based Action Recognition and Detection , 2018, IEEE Transactions on Image Processing.

[11]  Jie Ma,et al.  Local voxelized structure for 3D binary feature representation and robust registration of point clouds from low-cost sensors , 2018, Inf. Sci..

[12]  Hideo Saito,et al.  CorsNet: 3D Point Cloud Registration by Deep Neural Network , 2020, IEEE Robotics and Automation Letters.

[13]  Federico Tombari,et al.  Unique Signatures of Histograms for Local Surface Description , 2010, ECCV.

[14]  Yasuhiro Aoki,et al.  PointNetLK: Robust & Efficient Point Cloud Registration Using PointNet , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Yue Wang,et al.  Deep Closest Point: Learning Representations for Point Cloud Registration , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[16]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[17]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[18]  Wei Chen,et al.  Stereo Matching Using Multi-Level Cost Volume and Multi-Scale Feature Constancy , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Xilin Chen,et al.  Cross Attention Network for Few-shot Classification , 2019, NeurIPS.

[20]  Weisi Lin,et al.  3DHoPD: A Fast Low-Dimensional 3-D Descriptor , 2017, IEEE Robotics and Automation Letters.

[21]  Jiaolong Yang,et al.  Go-ICP: Solving 3D Registration Efficiently and Globally Optimally , 2013, 2013 IEEE International Conference on Computer Vision.

[22]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Lingjing Wang,et al.  Deep-3DAligner: Unsupervised 3D Point Set Registration Network With Optimizable Latent Vector , 2020, ArXiv.

[24]  Nico Blodow,et al.  Fast Point Feature Histograms (FPFH) for 3D registration , 2009, 2009 IEEE International Conference on Robotics and Automation.

[25]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  Yaron Lipman,et al.  Point registration via efficient convex relaxation , 2016, ACM Trans. Graph..

[27]  Mohammed Bennamoun,et al.  Rotational Projection Statistics for 3D Local Surface Description and Object Recognition , 2013, International Journal of Computer Vision.

[28]  Andrew E. Johnson,et al.  Using Spin Images for Efficient Object Recognition in Cluttered 3D Scenes , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Yue Wang,et al.  PRNet: Self-Supervised Learning for Partial-to-Partial Registration , 2019, NeurIPS.

[30]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Mohammed Bennamoun,et al.  Deep Learning for 3D Point Clouds: A Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.