PointCaM: Cut-and-Mix for Open-Set Point Cloud Learning

Point cloud learning is receiving increasing attention, however, most existing point cloud models lack the practical ability to deal with the unavoidable presence of unknown objects. This paper mainly discusses point cloud learning under open-set settings, where we train the model without data from unknown classes and identify them in the inference stage. Basically, we propose to solve open-set point cloud learning using a novel Point Cut-and-Mix mechanism consisting of Unknown-Point Simulator and Unknown-Point Estimator modules. Specifically, we use the Unknown-Point Simulator to simulate out-of-distribution data in the training stage by manipulating the geometric context of partial known data. Based on this, the Unknown-Point Estimator module learns to exploit the point cloud's feature context for discriminating the known and unknown data. Extensive experiments show the plausibility of open-set point cloud learning and the effectiveness of our proposed solutions. Our code is available at \url{https://github.com/ShiQiu0419/pointcam}.

[1]  Matias Valdenegro-Toro,et al.  A Benchmark for Out of Distribution Detection in Point Cloud 3D Semantic Segmentation , 2022, ArXiv.

[2]  Xiaolong Wang,et al.  Visual Reinforcement Learning With Self-Supervised 3D Representations , 2022, IEEE Robotics and Automation Letters.

[3]  Francesco Cappio Borlino,et al.  3DOS: Towards 3D Open Set Learning - Benchmarking and Understanding Semantic Novelty Detection on Point Clouds , 2022, NeurIPS.

[4]  Peng Yun,et al.  Open-world Semantic Segmentation for LIDAR Point Clouds , 2022, ECCV.

[5]  I. Reid,et al.  CropMix: Sampling a Rich Input Distribution via Multi-Scale Cropping , 2022, ArXiv.

[6]  Yutong Xie,et al.  Region-Aware Metric Learning for Open World Semantic Segmentation via Meta-Channel Aggregation , 2022, IJCAI.

[7]  N. Barnes,et al.  Towards Open-Set Object Detection and Discovery , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[8]  Mehrtash Harandi,et al.  GOSS: Towards Generalized Open-set Semantic Segmentation , 2022, Vis. Comput..

[9]  Peng Yun,et al.  Open-set 3D Object Detection , 2021, 2021 International Conference on 3D Vision (3DV).

[10]  N. Barnes,et al.  PU-Transformer: Point Cloud Upsampling Transformer , 2021, ACCV.

[11]  Yixuan Li,et al.  A Unified Survey on Anomaly, Novelty, Open-Set, and Out-of-Distribution Detection: Solutions and Future Challenges , 2021, Trans. Mach. Learn. Res..

[12]  A. Vedaldi,et al.  Open-Set Recognition: A Good Closed-Set Classifier is All You Need , 2021, ICLR.

[13]  Yixuan Li,et al.  On the Importance of Gradients for Detecting Distributional Shifts in the Wild , 2021, NeurIPS.

[14]  Mingqiang Wei,et al.  Adaptive Graph Convolution for Point Cloud Analysis , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[15]  Nick Barnes,et al.  PnP-3D: A Plug-and-Play for 3D Point Clouds , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Peng Yun,et al.  Deep Metric Learning for Open World Semantic Segmentation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[17]  Chongyi Li,et al.  Investigating Attention Mechanism in 3D Point Cloud Object Detection , 2021, 2021 International Conference on 3D Vision (3DV).

[18]  Tomas Pfister,et al.  CutPaste: Self-Supervised Learning for Anomaly Detection and Localization , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Shu Kong,et al.  OpenGAN: Open-Set Recognition via Open Data Generation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[20]  N. Barnes,et al.  Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  K. J. Joseph,et al.  Towards Open World Object Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Yonghong Tian,et al.  Adversarial Reciprocal Points Learning for Open Set Recognition , 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Ilya Sutskever,et al.  Learning Transferable Visual Models From Natural Language Supervision , 2021, ICML.

[24]  Sangyoun Lee,et al.  Regularization Strategy for Point Cloud via Rigidly Mixed Sample , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Jihong Zhu,et al.  PointCutMix: Regularization Strategy for Point Cloud Classification , 2021, Neurocomputing.

[26]  Ralph R. Martin,et al.  PCT: Point cloud transformer , 2020, Computational Visual Media.

[27]  Quoc V. Le,et al.  Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Klaus Dietmayer,et al.  Point Transformer , 2020, IEEE Access.

[29]  Biplab Banerjee,et al.  Empowering Knowledge Distillation via Open Set Recognition for Robust 3D Point Cloud Classification , 2020, Pattern Recognit. Lett..

[30]  Tat-Seng Chua,et al.  Few-shot 3D Point Cloud Semantic Segmentation , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Songcan Chen,et al.  Guided CNN for generalized zero-shot and open-set recognition using visual and semantic prototypes , 2020, Pattern Recognition.

[32]  Ang Li,et al.  Hybrid Models for Open Set Recognition , 2020, ECCV.

[33]  Zhuguo Li,et al.  PointASNL: Robust Point Clouds Processing Using Nonlocal Neural Networks With Adaptive Sampling , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  Terrance E. Boult,et al.  The Overlooked Elephant of Object Detection: Open Set , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[35]  Jonathon S. Hare,et al.  FMix: Enhancing Mixed Sample Data Augmentation , 2020 .

[36]  Mohammed Bennamoun,et al.  Deep Learning for 3D Point Clouds: A Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37]  Nick Barnes,et al.  Geometric Back-Projection Network for Point Cloud Classification , 2019, IEEE Transactions on Multimedia.

[38]  A. Markham,et al.  RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Raquel Urtasun,et al.  Identifying Unknown Instances for Autonomous Driving , 2019, CoRL.

[40]  Duc Thanh Nguyen,et al.  Revisiting Point Cloud Classification: A New Benchmark Dataset and Classification Model on Real-World Data , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[41]  Hui Tang,et al.  An overview of augmented reality technology , 2019, Journal of Physics: Conference Series.

[42]  Seong Joon Oh,et al.  CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[43]  Leonidas J. Guibas,et al.  Deep Hough Voting for 3D Object Detection in Point Clouds , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[44]  Silvio Savarese,et al.  4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Shiming Xiang,et al.  Relation-Shape Convolutional Neural Network for Point Cloud Analysis , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Cyrill Stachniss,et al.  SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[47]  Chunping Hou,et al.  Open-set human activity recognition based on micro-Doppler signatures , 2019, Pattern Recognit..

[48]  Weng-Keen Wong,et al.  Open Set Learning with Counterfactual Images , 2018, ECCV.

[49]  Stanislav Pidhorskyi,et al.  Generative Probabilistic Novelty Detection with Adversarial Autoencoders , 2018, NeurIPS.

[50]  Lei Shi,et al.  Towards open-set semantic labeling in 3D point clouds : Analysis on the unknown class , 2018, Neurocomputing.

[51]  Yue Wang,et al.  Dynamic Graph CNN for Learning on Point Clouds , 2018, ACM Trans. Graph..

[52]  Ulrich Neumann,et al.  SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[53]  Martial Hebert,et al.  Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[54]  R. Srikant,et al.  Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks , 2017, ICLR.

[55]  Leonidas J. Guibas,et al.  PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space , 2017, NIPS.

[56]  Silvio Savarese,et al.  Joint 2D-3D-Semantic Data for Indoor Scene Understanding , 2017, ArXiv.

[57]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[58]  Kevin Gimpel,et al.  A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks , 2016, ICLR.

[59]  Leonidas J. Guibas,et al.  ShapeNet: An Information-Rich 3D Model Repository , 2015, ArXiv.

[60]  Terrance E. Boult,et al.  Towards Open Set Deep Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[61]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[62]  Roland Siegwart,et al.  A Review of Point Cloud Registration Algorithms for Mobile Robotics , 2015, Found. Trends Robotics.

[63]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[64]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[65]  Subhransu Maji,et al.  Multi-view Convolutional Neural Networks for 3D Shape Recognition , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[66]  Takaya Saito,et al.  The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets , 2015, PloS one.

[67]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[68]  Trevor Darrell,et al.  Fully convolutional networks for semantic segmentation , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[69]  Zhuowen Tu,et al.  Deeply-Supervised Nets , 2014, AISTATS.

[70]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[71]  Wolfram Burgard,et al.  3-D Mapping With an RGB-D Camera , 2014, IEEE Transactions on Robotics.

[72]  Anderson Rocha,et al.  Toward Open Set Recognition , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[73]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[74]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[75]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[76]  Dawn Song,et al.  Scaling Out-of-Distribution Detection for Real-World Settings , 2022, ICML.

[77]  Quoc V. Le,et al.  Combined Scaling for Open-Vocabulary Image Classification , 2022 .

[78]  Maggi Kelly,et al.  A New Method for Segmenting Individual Trees from the Lidar Point Cloud , 2012 .

[79]  M. Jaboyedoff,et al.  Use of LIDAR in landslide investigations: a review , 2012, Natural Hazards.