A Two-Stage Pillar Feature-Encoding Network for Pillar-Based 3D Object Detection

Three-dimensional object detection plays a vital role in the field of environment perception in autonomous driving, and its results are crucial for the subsequent processes. Pillar-based 3D object detection is a method to detect objects in 3D by dividing point cloud data into pillars and extracting features from each pillar. However, the current pillar-based 3D object-detection methods suffer from problems such as “under-segmentation” and false detections in overlapping and occluded scenes. To address these challenges, we propose an improved pillar-based 3D object-detection network with a two-stage pillar feature-encoding (Ts-PFE) module that considers both inter- and intra-relational features among and in the pillars. This novel approach enhances the model’s ability to identify the local structure and global distribution of the data, which improves the distinction between objects in occluded and overlapping scenes and ultimately reduces under-segmentation and false detection problems. Furthermore, we use the attention mechanism to improve the backbone and make it focus on important features. The proposed approach is evaluated on the KITTI dataset. The experimental results show that the detection accuracy of the proposed approach are significantly improved on the benchmarks of BEV and 3D. The improvement of AP for car, pedestrian, and cyclist 3D detection are 1.1%, 3.78%, and 2.23% over PointPillars.

[1]  Yuyang Peng,et al.  Pillar-Based 3D Object Detection from Point Cloud with Multiattention Mechanism , 2023, Wireless Communications and Mobile Computing.

[2]  Lei Zhou,et al.  High-Throughput Instance Segmentation and Shape Restoration of Overlapping Vegetable Seeds Based on Sim2real Method , 2022, SSRN Electronic Journal.

[3]  Simegnew Yihunie Alaba,et al.  A Survey on Deep-Learning-Based LiDAR 3D Object Detection for Autonomous Driving , 2022, Sensors.

[4]  Guoxin Zhang,et al.  SAT-GCN: Self-attention graph convolutional network-based 3D object detection for autonomous driving , 2022, Knowl. Based Syst..

[5]  Zongxu Pan,et al.  Muti-Frame Point Cloud Feature Fusion Based on Attention Mechanisms for 3D Object Detection , 2022, Sensors.

[6]  Yanyan Zhang,et al.  Point-attention Net: a graph attention convolution network for point cloudsegmentation , 2022, Applied Intelligence.

[7]  Liang Du,et al.  Modify Self-Attention via Skeleton Decomposition for Effective Point Cloud Transformer , 2022, AAAI.

[8]  Gaihua Wang,et al.  Cross self-attention network for 3D point cloud , 2022, Knowl. Based Syst..

[9]  M. Barth,et al.  PillarGrid: Deep Learning-Based Cooperative Perception for 3D Object Detection from Onboard-Roadside LiDAR , 2022, 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC).

[10]  Ralph R. Martin,et al.  Attention mechanisms in computer vision: A survey , 2021, Computational Visual Media.

[11]  Chongyi Li,et al.  Investigating Attention Mechanism in 3D Point Cloud Object Detection , 2021, 2021 International Conference on 3D Vision (3DV).

[12]  Feng Gao,et al.  A New Density-Based Clustering Method Considering Spatial Distribution of Lidar Point Cloud for Object Detection of Autonomous Driving , 2021, Electronics.

[13]  Jian Cheng,et al.  3D-CenterNet: 3D object detection network for point clouds with center estimation priority , 2021, Pattern Recognit..

[14]  Shiliang Pu,et al.  RangeIoUDet: Range Image based Real-Time 3D Object Detector Optimized by Intersection over Union , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  João L. Monteiro,et al.  Point-cloud based 3D object detection and classification methods for self-driving applications: A survey and taxonomy , 2021, Inf. Fusion.

[16]  Xuan Xiong,et al.  RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[17]  Jenq-Neng Hwang,et al.  RODNet: A Real-Time Radar Object Detection Network Cross-Supervised by Camera-Radar Fused Object 3D Localization , 2021, IEEE Journal of Selected Topics in Signal Processing.

[18]  Krzysztof Czarnecki,et al.  SA-Det3D: Self-Attention Based Context-Aware 3D Object Detection , 2021, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[19]  Hao Jiang,et al.  PSANet: Pyramid Splitting and Aggregation Network for 3D Object Detection in Point Cloud , 2020, Sensors.

[20]  Ying Nian Wu,et al.  Generative VoxelNet: Learning Energy-Based Models for 3D Shape Synthesis and Analysis , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Philipp Krähenbühl,et al.  Center-based 3D Object Detection and Tracking , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Lei Zhang,et al.  Structure Aware Single-Stage 3D Object Detection From Point Cloud , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Saifullahi Aminu Bello,et al.  Review: deep learning on 3D point clouds , 2020, Remote. Sens..

[24]  Xiaogang Wang,et al.  PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Xin Zhao,et al.  TANet: Robust 3D Object Detection from Point Clouds with Triple Attention , 2019, AAAI.

[26]  Gang Wang,et al.  Multi-View Frustum Pointnet for Object Detection in Autonomous Driving , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[27]  Frank Lindseth,et al.  Multimodal 3D Object Detection from Simulated Pretraining , 2019, NAIS.

[28]  Xin Zhao,et al.  3D Object Detection Using Scale Invariant and Feature Reweighting Networks , 2019, AAAI.

[29]  Jiong Yang,et al.  PointPillars: Fast Encoders for Object Detection From Point Clouds , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Bin Yang,et al.  HDNET: Exploiting HD Maps for 3D Object Detection , 2018, CoRL.

[31]  Bo Li,et al.  SECOND: Sparsely Embedded Convolutional Detection , 2018, Sensors.

[32]  Bin Yang,et al.  PIXOR: Real-time 3D Object Detection from Point Clouds , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[33]  Jiaxin Li,et al.  SO-Net: Self-Organizing Network for Point Cloud Analysis , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[34]  Leonidas J. Guibas,et al.  Frustum PointNets for 3D Object Detection from RGB-D Data , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[35]  Yin Zhou,et al.  VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[36]  Enhua Wu,et al.  Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[39]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Wei Liu,et al.  SSD: Single Shot MultiBox Detector , 2015, ECCV.

[41]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[42]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Yang Zhao,et al.  DA-Net: Density-Aware 3D Object Detection Network for Point Clouds , 2023, IEEE Transactions on Multimedia.

[44]  Bo Wang,et al.  Real-Time 3D Object Detection From Point Cloud Through Foreground Segmentation , 2021, IEEE Access.

[45]  Zhenhong Du,et al.  Building Damage Detection Using U-Net with Attention Mechanism from Pre- and Post-Disaster Remote Sensing Datasets , 2021, Remote. Sens..

[46]  Bin Dai,et al.  SCNet: Subdivision Coding Network for Object Detection Based on 3D Point Cloud , 2019, IEEE Access.

[47]  Р Ю Чуйков,et al.  Обнаружение транспортных средств на изображениях загородных шоссе на основе метода Single shot multibox Detector , 2017 .

[48]  C. Qi Deep Learning on Point Sets for 3 D Classification and Segmentation , 2016 .