论文信息 - 3D Point Cloud Recognition Based on a Multi-View Convolutional Neural Network

3D Point Cloud Recognition Based on a Multi-View Convolutional Neural Network

The recognition of three-dimensional (3D) lidar (light detection and ranging) point clouds remains a significant issue in point cloud processing. Traditional point cloud recognition employs the 3D point clouds from the whole object. Nevertheless, the lidar data is a collection of two-and-a-half-dimensional (2.5D) point clouds (each 2.5D point cloud comes from a single view) obtained by scanning the object within a certain field angle by lidar. To deal with this problem, we initially propose a novel representation which expresses 3D point clouds using 2.5D point clouds from multiple views and then we generate multi-view 2.5D point cloud data based on the Point Cloud Library (PCL). Subsequently, we design an effective recognition model based on a multi-view convolutional neural network. The model directly acts on the raw 2.5D point clouds from all views and learns to get a global feature descriptor by fusing the features from all views by the view fusion network. It has been proved that our approach can achieve an excellent recognition performance without any requirement for three-dimensional reconstruction and the preprocessing of point clouds. In conclusion, this paper can effectively solve the recognition problem of lidar point clouds and provide vital practical value.

[1] John E. Wessel,et al. Aerospace applications of optical sensing with lidar , 1996, 1996 IEEE Aerospace Applications Conference. Proceedings.

[2] P. McManamon. Aerospace Applications Of Lidar For DOD , 1991, LEOS 1991 Summer Topical Meetings on Spaceborne Photonics: Aerospace Applications of Lasers and Electro-Optics, Optical Millimeter-Wave Interactions: Measurements, Generation, Transmission and Control.

[3] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[4] Jian Liu,et al. A Framework for Applying Point Clouds Grabbed by Multi-Beam LIDAR in Perceiving the Driving Environment , 2015, Sensors.

[5] Szymon Rusinkiewicz,et al. Rotation Invariant Spherical Harmonic Representation of 3D Shape Descriptors , 2003, Symposium on Geometry Processing.

[6] Ryan Halterman,et al. Velodyne HDL-64E lidar for unmanned surface vehicle obstacle detection , 2010, Defense + Commercial Sensing.

[7] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[8] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[9] Danil V. Prokhorov,et al. A Convolutional Learning System for Object Classification in 3-D Lidar Data , 2010, IEEE Transactions on Neural Networks.

[10] Zhichao Zhou,et al. DeepPano: Deep Panoramic Representation for 3-D Shape Recognition , 2015, IEEE Signal Processing Letters.

[11] Geoffrey E. Hinton,et al. Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[12] Marwan Hussein,et al. 3D imaging lidar for lunar robotic exploration , 2009, Defense + Commercial Sensing.

[13] Sebastian Scherer,et al. VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[14] Ulrich Neumann,et al. Fast and Robust Multi-view 3D Object Recognition in Point Clouds , 2015, 2015 International Conference on 3D Vision.

[15] C. Qi. Deep Learning on Point Sets for 3 D Classification and Segmentation , 2016 .

[16] Tara N. Sainath,et al. FUNDAMENTAL TECHNOLOGIES IN MODERN SPEECH RECOGNITION Digital Object Identifier 10.1109/MSP.2012.2205597 , 2012 .

[17] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[18] Jianxiong Xiao,et al. 3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Andrew W. Senior,et al. Fast and accurate recurrent neural network acoustic models for speech recognition , 2015, INTERSPEECH.

[20] Ioannis Pratikakis,et al. Ensemble of PANORAMA-based convolutional neural networks for 3D model classification and retrieval , 2017, Comput. Graph..

[21] Quoc V. Le,et al. Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[22] Samy Bengio,et al. Order Matters: Sequence to sequence for sets , 2015, ICLR.

[23] Ming Ouhyoung,et al. On Visual Similarity Based 3D Model Retrieval , 2003, Comput. Graph. Forum.

[24] Xinmin Wang,et al. Research and application on improved BP neural network algorithm , 2010, 2010 5th IEEE Conference on Industrial Electronics and Applications.

[25] Ji Wan,et al. Multi-view 3D Object Detection Network for Autonomous Driving , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Ulrich Neumann,et al. 3D point cloud object detection with multi-view convolutional neural network , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[27] Subhransu Maji,et al. Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[28] Leonidas J. Guibas,et al. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).