论文信息 - PL-Net3d: Robust 3D Object Class Recognition Using Geometric Models

PL-Net3d: Robust 3D Object Class Recognition Using Geometric Models

Three-dimensional point clouds produced by 3D scanners are often noisy and contain outliers. Such data inaccuracies can significantly affect current deep learning-based methods and reduce their ability to classify objects. Most deep neural networks-based object classification methods were targeted to achieve high classification accuracy without considering classification robustness. Thus, despite their great success, they still fail to achieve good classification accuracy with low levels of noise and outliers. This work is carried out to develop a robust network structure that can solidly identify objects. The proposed method uses patches of planar segments, which can robustly capture object appearance. The planar segments information are then fed into a deep neural network for classification. We base our approach on the PointNet deep learning architecture. Our method was tested against several kinds of data inaccuracies such as scattered outliers, clustered outliers, noise and missing points. The proposed method shows excellent performance in the presence of these inaccuracies compared to state-of-the-art techniques. By decomposing objects into planes, the suggested method is simple, fast, provides good classification accuracy and can handle different kinds of point cloud data inaccuracies. The code can be found at https://github.com/AymanMukh/Pl-Net3D

[1] Hsi-Yung Feng,et al. Effects of scanning orientation on outlier formation in 3D laser scanning of reflective surfaces , 2016 .

[2] Jiri Matas,et al. Matching with PROSAC - progressive sample consensus , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[3] Slawomir J. Nasuto,et al. NAPSAC: High Noise, High Dimensional Robust Estimation - it's in the Bag , 2002, BMVC.

[4] Tony DeRose,et al. Surface reconstruction from unorganized points , 1992, SIGGRAPH.

[5] Yin Zhou,et al. VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[6] Sebastian Scherer,et al. VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7] Jianxiong Xiao,et al. 3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8] Dorin Comaniciu,et al. Mean Shift: A Robust Approach Toward Feature Space Analysis , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9] Andrea Fusiello,et al. Robust Multiple Structures Estimation with J-Linkage , 2008, ECCV.

[10] Victor S. Lempitsky,et al. Escape from Cells: Deep Kd-Networks for the Recognition of 3D Point Cloud Models , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[11] Andrew Zisserman,et al. MLESAC: A New Robust Estimator with Application to Estimating Image Geometry , 2000, Comput. Vis. Image Underst..

[12] Junsong Yuan,et al. Multi-view Harmonized Bilinear Network for 3D Object Recognition , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13] Ioannis Pratikakis,et al. Ensemble of PANORAMA-based convolutional neural networks for 3D model classification and retrieval , 2017, Comput. Graph..

[14] Chao Chen,et al. ClusterNet: Deep Hierarchical Cluster Network With Rigorously Rotation-Invariant Representation for Point Cloud Analysis , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Ludovico Minto,et al. Deep learning for 3D shape classification from multiple depth maps , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[16] Andrew Zisserman,et al. Robust computation and parametrization of multiple view relations , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[17] José García Rodríguez,et al. LonchaNet: A sliced-based CNN architecture for real-time 3D object recognition , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[18] Yue Gao,et al. PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition , 2018, ACM Multimedia.

[19] Reza Bosagh Zadeh,et al. FusionNet: 3D Object Classification Using Multiple Data Representations , 2016, ArXiv.

[20] Alireza Bab-Hadiashar,et al. A comparative study of model selection criteria for computer vision applications , 2008, Image Vis. Comput..

[21] Tat-Jun Chin,et al. Simultaneously Fitting and Segmenting Multiple-Structure Data with Outliers , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22] Didier Stricker,et al. Robust Outlier Removal from Point Clouds Acquired with Structured Light , 2012, Eurographics.

[23] Subhransu Maji,et al. A Deeper Look at 3D Shape Classifiers , 2018, ECCV Workshops.

[24] Ioannis Pratikakis,et al. Exploiting the PANORAMA Representation for Convolutional Neural Network Classification and Retrieval , 2017, 3DOR@Eurographics.

[25] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[26] Tat-Jun Chin,et al. Robust fitting of multiple structures: The statistical learning approach , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[27] Dana H. Ballard,et al. Generalizing the Hough transform to detect arbitrary shapes , 1981, Pattern Recognit..

[28] Gernot Riegler,et al. OctNet: Learning Deep 3D Representations at High Resolutions , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] David Suter,et al. Robust segmentation of visual data using ranked unbiased scale estimate , 1999, Robotica.

[30] Zhenwei Cao,et al. Robust Model Fitting Using Higher Than Minimal Subset Sampling , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31] Subhransu Maji,et al. Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[32] Yang Liu,et al. O-CNN , 2017, ACM Trans. Graph..

[33] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34] Leonidas J. Guibas,et al. PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[35] Ersin Yumer,et al. 3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[36] Zhichao Zhou,et al. DeepPano: Deep Panoramic Representation for 3-D Shape Recognition , 2015, IEEE Signal Processing Letters.

[37] Ruwan Tennakoon,et al. Effective Sampling: Fast Segmentation Using Robust Geometric Model Fitting , 2017, IEEE Transactions on Image Processing.

[38] Yasuyuki Matsushita,et al. RotationNet: Joint Object Categorization and Pose Estimation Using Multiviews from Unsupervised Viewpoints , 2016, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[39] Li Li,et al. A Novel Octree-Based 3-D Fully Convolutional Neural Network for Point Cloud Classification in Road Environment , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[40] Leonidas J. Guibas,et al. Volumetric and Multi-view CNNs for Object Classification on 3D Data , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41] Jiaxin Li,et al. SO-Net: Self-Organizing Network for Point Cloud Analysis , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[42] Jiajun Wu,et al. Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43] Stefan Leutenegger,et al. Pairwise Decomposition of Image Sequences for Active Multi-view Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44] Jana Kosecka,et al. Nonparametric Estimation of Multiple Structures with Outliers , 2006, WDV.

[45] Ruwan Tennakoon,et al. Comparative Analysis of 3D Shape Recognition in the Presence of Data Inaccuracies , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[46] David W. Rosen,et al. Rotation Invariant Convolutions for 3D Point Clouds Deep Learning , 2019, 2019 International Conference on 3D Vision (3DV).

[47] Longin Jan Latecki,et al. GIFT: A Real-Time and Scalable 3D Shape Search Engine , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).