Robo3D: Towards Robust and Reliable 3D Perception against Corruptions

The robustness of 3D perception systems under natural corruptions from environments and sensors is pivotal for safety-critical applications. Existing large-scale 3D perception datasets often contain data that are meticulously cleaned. Such configurations, however, cannot reflect the reliability of perception models during the deployment stage. In this work, we present Robo3D, the first comprehensive benchmark heading toward probing the robustness of 3D detectors and segmentors under out-of-distribution scenarios against natural corruptions that occur in real-world environments. Specifically, we consider eight corruption types stemming from severe weather conditions, external disturbances, and internal sensor failure. We uncover that, although promising results have been progressively achieved on standard benchmarks, state-of-the-art 3D perception models are at risk of being vulnerable to corruptions. We draw key observations on the use of data representations, augmentation schemes, and training strategies, that could severely affect the model's performance. To pursue better robustness, we propose a density-insensitive training framework along with a simple flexible voxelization strategy to enhance the model resiliency. We hope our benchmark and approach could inspire future research in designing more robust and reliable 3D perception models. Our robustness benchmark suite is publicly available.

[1]  Tengyu Ma,et al.  DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds , 2023, 2023 IEEE/CVF International Conference on Computer Vision (ICCV).

[2]  Jun Zhu,et al.  Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous Driving , 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Y. Qiao,et al.  Rethinking Range View Representation for LiDAR Segmentation , 2023, 2023 IEEE/CVF International Conference on Computer Vision (ICCV).

[4]  Tengyu Ma,et al.  LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross- Modal Fusion , 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Cihang Xie,et al.  On the Adversarial Robustness of Camera-based 3D Object Detection , 2023, ArXiv.

[6]  G. Puy,et al.  Using a Waffle Iron for Automotive Point Cloud Semantic Segmentation , 2023, ArXiv.

[7]  Dengxin Dai,et al.  Benchmarking the Robustness of LiDAR Semantic Segmentation Models , 2023, ArXiv.

[8]  Yiran Chen,et al.  : Joint Point Interaction-Dimension Search for 3D Point Cloud , 2022, 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).

[9]  Cheng Zheng,et al.  Point-Voxel Adaptive Feature Abstraction for Robust Point Cloud Classification , 2022, ArXiv.

[10]  Xingjiao Wu,et al.  Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection , 2022, ECCV.

[11]  Qing Guo,et al.  Common Corruption Robustness of Point Cloud Detectors: Benchmark and Enhancement , 2022, ArXiv.

[12]  Yu Wang,et al.  CenterFormer: Center-based Transformer for 3D Object Detection , 2022, ECCV.

[13]  Fang Wen,et al.  MaskCLIP: Masked Self-Distillation Advances Contrastive Language-Image Pretraining , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Shijian Lu,et al.  PolarMix: A General Data Augmentation Technique for LiDAR Point Clouds , 2022, NeurIPS.

[15]  F. Khan,et al.  On the Robustness of 3D Object Detectors , 2022, MMAsia.

[16]  Xian-Feng Han,et al.  Cenet: Toward Concise and Efficient Lidar Semantic Segmentation for Autonomous Driving , 2022, 2022 IEEE International Conference on Multimedia and Expo (ICME).

[17]  Shenghui Cui,et al.  2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds , 2022, ECCV.

[18]  Haibo Qiu,et al.  GFNet: Geometric Flow Network for 3D Point Cloud Semantic Segmentation , 2022, Trans. Mach. Learn. Res..

[19]  Liang Pan,et al.  LaserMix for Semi-Supervised LiDAR Semantic Segmentation , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Kaicheng Yu,et al.  Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[21]  Zhenhua Wang,et al.  CPGNet: Cascade Point-Grid Fusion Network for Real-Time LiDAR Semantic Segmentation , 2022, 2022 International Conference on Robotics and Automation (ICRA).

[22]  Haotian Tang,et al.  TorchSparse: Efficient Point Cloud Inference Engine , 2022, MLSys.

[23]  L. Gool,et al.  LiDAR Snowfall Simulation for Robust 3D Object Detection , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Jiwen Lu,et al.  LiDAR Distillation: Bridging the Beam-Induced Domain Gap for 3D Object Detection , 2022, ECCV.

[25]  Oğuzhan Fatih Kar,et al.  3D Common Corruptions and Data Augmentation , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Ziwei Liu,et al.  Benchmarking and Analyzing Point Cloud Classification under Corruptions , 2022, ICML.

[27]  Siyuan Huang,et al.  Multi-modal Sensor Fusion for Auto Driving Perception: A Survey , 2022, ArXiv.

[28]  Zhiding Yu,et al.  Benchmarking Robustness of 3D Point Cloud Recognition Against Common Corruptions , 2022, ArXiv.

[29]  Saasha Nair,et al.  On the Real-World Adversarial Robustness of Real-Time Semantic Segmentation Models for Autonomous Driving , 2022, IEEE transactions on neural networks and learning systems.

[30]  Venice Erin Liong,et al.  ConDA: Unsupervised Domain Adaptation for LiDAR Segmentation via Regularized Domain Concatenation , 2021, 2023 IEEE International Conference on Robotics and Automation (ICRA).

[31]  Ross B. Girshick,et al.  Masked Autoencoders Are Scalable Vision Learners , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Cem Karaoguz,et al.  A methodology for analyzing the impact of crosstalk on LIDAR measurements , 2021, 2021 IEEE Sensors.

[33]  Yap-Peng Tan,et al.  Benchmarking the Robustness of Spatial-Temporal Models Against Corruptions , 2021, NeurIPS Datasets and Benchmarks.

[34]  Alexey Nekrasov,et al.  Mix3D: Out-of-Context Data Augmentation for 3D Scenes , 2021, 2021 International Conference on 3D Vision (3DV).

[35]  Qi Alfred Chen,et al.  Sensor Adversarial Traits: Analyzing Robustness of 3D Object Detection Sensor Fusion Models , 2021, 2021 IEEE International Conference on Image Processing (ICIP).

[36]  Rohit Mohan,et al.  Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking , 2021, IEEE Robotics and Automation Letters.

[37]  Yiming Zhao,et al.  FIDNet: LiDAR Point Cloud Semantic Segmentation with Fully Interpolation Decoding , 2021, 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[38]  Minzhe Niu,et al.  Voxel Transformer for 3D Object Detection , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[39]  Bing Deng,et al.  Improving 3D Object Detection with Channel-wise Transformer , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[40]  Luc Van Gool,et al.  Fog Simulation on Real LiDAR Point Clouds for 3D Object Detection in Adverse Weather , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[41]  Shijian Lu,et al.  Transfer Learning from Synthetic to Real LiDAR Point Cloud for Semantic Segmentation , 2021, AAAI.

[42]  Roozbeh Mottaghi,et al.  RobustNav: Towards Benchmarking Robustness in Embodied Navigation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[43]  Ping Luo,et al.  When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Ruigang Yang,et al.  Invisible for both Camera and LiDAR: Security of Multi-Sensor Fusion based Perception in Autonomous Driving Under Physical-World Attacks , 2021, 2021 IEEE Symposium on Security and Privacy (SP).

[45]  Philippe Giguere,et al.  Lidar Scan Registration Robust to Extreme Motions , 2021, 2021 18th Conference on Robots and Vision (CRV).

[46]  Zhichao Li,et al.  LiDAR R-CNN: An Efficient and Universal 3D Object Detector , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Shiliang Pu,et al.  RPVNet: A Deep and Efficient Range-Point-Voxel Fusion Network for LiDAR Point Cloud Segmentation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[48]  Jianping Shi,et al.  PV-RCNN++: Point-Voxel Feature Set Abstraction With Local Vector Representation for 3D Object Detection , 2021, International Journal of Computer Vision.

[49]  Wengang Zhou,et al.  Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection , 2020, AAAI.

[50]  Venice Erin Liong,et al.  AMVNet: Assertion-based Multi-View Fusion Network for LiDAR Semantic Segmentation , 2020, ArXiv.

[51]  Xinge Zhu,et al.  Cylindrical and Asymmetrical 3D Convolution Networks for LiDAR Segmentation , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  Song Han,et al.  Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution , 2020, ECCV.

[53]  Philipp Krähenbühl,et al.  Center-based 3D Object Detection and Tracking , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[54]  Raquel Urtasun,et al.  Physically Realizable Adversarial Examples for LiDAR Object Detection , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[55]  Philip David,et al.  PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[56]  Eren Erdal Aksoy,et al.  SalsaNext: Fast, Uncertainty-Aware Semantic Segmentation of LiDAR Point Clouds , 2020, ISVC.

[57]  Weijing Shi,et al.  Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[58]  Yanan Sun,et al.  3DSSD: Point-Based 3D Single Stage Object Detector , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[59]  Xiaogang Wang,et al.  PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[60]  Mohammed Bennamoun,et al.  Deep Learning for 3D Point Clouds: A Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[61]  Dragomir Anguelov,et al.  Scalability in Perception for Autonomous Driving: Waymo Open Dataset , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[62]  Raoul de Charette,et al.  xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[63]  Cyrill Stachniss,et al.  RangeNet ++: Fast and Accurate LiDAR Semantic Segmentation , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[64]  Carsten Rother,et al.  Benchmarking the Robustness of Semantic Segmentation Models with Respect to Common Corruptions , 2019, International Journal of Computer Vision.

[65]  Xiaoyong Shen,et al.  STD: Sparse-to-Dense 3D Object Detector for Point Cloud , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[66]  Alexander S. Ecker,et al.  Benchmarking Robustness in Object Detection: Autonomous Driving when Winter is Coming , 2019, ArXiv.

[67]  Xiaogang Wang,et al.  From Points to Parts: 3D Object Detection From Point Cloud With Part-Aware and Part-Aggregation Network , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[68]  Silvio Savarese,et al.  4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[69]  Leonidas J. Guibas,et al.  KPConv: Flexible and Deformable Convolution for Point Clouds , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[70]  Taejung Kim,et al.  Characteristics of Laser Backscattering Intensity to Detect Frozen and Wet Surfaces on Roads , 2019, J. Sensors.

[71]  Cyrill Stachniss,et al.  SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[72]  Thomas G. Dietterich,et al.  Benchmarking Neural Network Robustness to Common Corruptions and Perturbations , 2019, ICLR.

[73]  Qiang Xu,et al.  nuScenes: A Multimodal Dataset for Autonomous Driving , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[74]  Felix Heide,et al.  Seeing Through Fog Without Seeing Fog: Deep Multimodal Sensor Fusion in Unseen Adverse Weather , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[75]  Omar Y. Al-Jarrah,et al.  A Survey on 3D Object Detection Methods for Autonomous Driving Applications , 2019, IEEE Transactions on Intelligent Transportation Systems.

[76]  Jiong Yang,et al.  PointPillars: Fast Encoders for Object Detection From Point Clouds , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[77]  Xiaogang Wang,et al.  PointRCNN: 3D Object Proposal Generation and Detection From Point Cloud , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[78]  Michael Arens,et al.  Mitigation of crosstalk effects in multi-LiDAR configurations , 2018, Security + Defence.

[79]  Bo Li,et al.  SECOND: Sparsely Embedded Convolutional Detection , 2018, Sensors.

[80]  Kurt Keutzer,et al.  SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud , 2018, 2019 International Conference on Robotics and Automation (ICRA).

[81]  Werner Ritter,et al.  A Benchmark for Lidar Sensors in Fog: Is Detection Breaking Down? , 2018, 2018 IEEE Intelligent Vehicles Symposium (IV).

[82]  Yin Zhou,et al.  VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[83]  Kurt Keutzer,et al.  SqueezeSeg: Convolutional Neural Nets with Recurrent CRF for Real-Time Road-Object Segmentation from 3D LiDAR Point Cloud , 2017, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[84]  Nicky Guenther,et al.  When the Dust Settles: The Four Behaviors of LiDAR in the Presence of Fine Airborne Particulates , 2017, J. Field Robotics.

[85]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[86]  Andreas Geiger,et al.  Are we ready for autonomous driving? The KITTI vision benchmark suite , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[87]  Ralph Helmar Rasshofer,et al.  Influences of weather phenomena on automotive laser radar systems , 2011 .

[88]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[89]  Qifeng Chen,et al.  Efficient Point Cloud Segmentation with Geometry-Aware Sparse Networks , 2022, ECCV.

[90]  Christoffer Petersson,et al.  Masked Autoencoders for Self-Supervised Learning on Automotive Point Clouds , 2022, ArXiv.

[91]  Francesco Cappio Borlino,et al.  Towards Open Set 3D Learning: A Benchmark on Object Point Clouds , 2022, ArXiv.

[92]  B. Dai,et al.  Voxel-MAE: Masked Autoencoders for Pre-training Large-scale Point Clouds , 2022, ArXiv.

[93]  Adverse Weather Denoising From Adjacent Point Clouds , 2022 .