H2RBox-v2: Boosting HBox-supervised Oriented Object Detection via Symmetric Learning

With the increasing demand for oriented object detection e.g. in autonomous driving and remote sensing, the oriented annotation has become a labor-intensive work. To make full use of existing horizontally annotated datasets and reduce the annotation cost, a weakly-supervised detector H2RBox for learning the rotated box (RBox) from the horizontal box (HBox) has been proposed and received great attention. This paper presents a new version, H2RBox-v2, to further bridge the gap between HBox-supervised and RBox-supervised oriented object detection. While exploiting axisymmetry via flipping and rotating consistencies is available through our theoretical analysis, H2RBox-v2, using a weakly-supervised branch similar to H2RBox, is embedded with a novel self-supervised branch that learns orientations from the symmetry inherent in the image of objects. Com-plemented by modules to cope with peripheral issues, e.g. angular periodicity, a stable and effective solution is achieved. To our knowledge, H2RBox-v2 is the first symmetry-supervised paradigm for oriented object detection. Compared to H2RBox, our method is less susceptible to low annotation quality and insufficient training data, which in such cases is expected to give a competitive performance much closer to fully-supervised oriented object detectors. Specifically, the performance comparison between H2RBox-v2 and Rotated FCOS on DOTA-v1.0/1.5/2.0

[1]  T. Drummond,et al.  Knowledge Combination to Learn Rotated Detection without Rotated Annotation , 2023, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Long Wen,et al.  A comprehensive survey of oriented object detection in remote sensing images , 2023, Expert Syst. Appl..

[3]  Junchi Yan,et al.  Detecting Rotated Objects as Gaussian Distributions and its 3-D Generalization , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Xue Yang,et al.  G-Rep: Gaussian Representation for Arbitrary-Oriented Object Detection , 2022, Remote. Sens..

[5]  Zhi-guo Jiang,et al.  WSODet: A Weakly Supervised Oriented Detector for Aerial Object Detection , 2023, IEEE Transactions on Geoscience and Remote Sensing.

[6]  Yi Yu,et al.  Phase-Shifting Coder: Predicting Accurate Orientation in Oriented Object Detection , 2022, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Junchi Yan,et al.  H2RBox: Horizontal Box Annotation is All You Need for Oriented Object Detection , 2022, ICLR.

[8]  Jianke Zhu,et al.  Box-supervised Instance Segmentation with Level Set Evolution , 2022, ECCV.

[9]  Jian Xue,et al.  Shape-Adaptive Selection and Measurement for Oriented Object Detection , 2022, AAAI Conference on Artificial Intelligence.

[10]  Junchi Yan,et al.  MMRotate: A Rotated Object Detection Benchmark using PyTorch , 2022, ACM Multimedia.

[11]  Junchi Yan,et al.  The KFIoU Loss for Rotated Object Detection , 2022, ICLR.

[12]  Junwei Han,et al.  Anchor-Free Oriented Proposal Generator for Object Detection , 2021, IEEE Transactions on Geoscience and Remote Sensing.

[13]  Jianke Zhu,et al.  Oriented RepPoints for Aerial Object Detection , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Stefan Hinz,et al.  FAIR1M: A Benchmark Dataset for Fine-grained Object Recognition in High-Resolution Remote Sensing Imagery , 2021, ISPRS Journal of Photogrammetry and Remote Sensing.

[15]  Gui-Song Xia,et al.  Align Deep Features for Oriented Object Detection , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Junchi Yan,et al.  On the Arbitrary-Oriented Object Detection: Classification Based Approaches Revisited , 2020, International Journal of Computer Vision.

[17]  Hongjin Wu,et al.  PCBNet: A Lightweight Convolutional Neural Network for Defect Inspection in Surface Mount Technology , 2022, IEEE Transactions on Instrumentation and Measurement.

[18]  Junwei Han,et al.  Oriented R-CNN for Object Detection , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[19]  Junchi Yan,et al.  Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence , 2021, NeurIPS.

[20]  Jianbin Jiao,et al.  Beyond Bounding-Box: Convex-hull Feature Adaptation for Oriented and Densely Packed Object Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Arif Mahmood,et al.  Leveraging orientation for weakly supervised object detection with application to firearm localization , 2021, Neurocomputing.

[22]  Gui-Song Xia,et al.  ReDet: A Rotation-equivariant Detector for Aerial Object Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Junchi Yan,et al.  Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss , 2021, ICML.

[24]  Zhi Tian,et al.  BoxInst: High-Performance Instance Segmentation with Box Annotations , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Junchi Yan,et al.  Dense Label Encoding for Boundary Discontinuity Free Rotation Detection , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  S. Gelly,et al.  An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.

[27]  Xue Yang,et al.  Learning Modulated Loss for Rotated Object Detection , 2019, AAAI.

[28]  Junchi Yan,et al.  R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object , 2019, AAAI.

[29]  Stephen Lin,et al.  Swin Transformer: Hierarchical Vision Transformer using Shifted Windows , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[30]  Shi-Min Hu,et al.  Jittor: a novel deep learning framework with meta-operators and unified graph execution , 2020, Science China Information Sciences.

[31]  Hongli Gao,et al.  A Data-Flow Oriented Deep Ensemble Learning Method for Real-Time Surface Defect Inspection , 2020, IEEE Transactions on Instrumentation and Measurement.

[32]  Weiming Dong,et al.  Dynamic Refinement Network for Oriented and Densely Packed Object Detection , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Junchi Yan,et al.  Arbitrary-Oriented Object Detection with Circular Smooth Label , 2020, ECCV.

[34]  Hao Chen,et al.  Conditional Convolutions for Instance Segmentation , 2020, ECCV.

[35]  Yue Zhang,et al.  Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images , 2020 .

[36]  Junwei Han,et al.  Object Detection in Optical Remote Sensing Images: A Survey and A New Benchmark , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[37]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Ross B. Girshick,et al.  Mask R-CNN , 2017, 1703.06870.

[39]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[40]  Yang Long,et al.  Learning RoI Transformer for Oriented Object Detection in Aerial Images , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Stephen Lin,et al.  RepPoints: Point Set Representation for Object Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[42]  Hao Chen,et al.  FCOS: Fully Convolutional One-Stage Object Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[43]  Tal Hassner,et al.  Precise Detection in Densely Packed Scenes , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Yue Zhang,et al.  SCRDet: Towards More Robust Detection for Small, Cluttered and Rotated Objects , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[45]  Matti Pietikäinen,et al.  Deep Learning for Generic Object Detection: A Survey , 2018, International Journal of Computer Vision.

[46]  Xindong Wu,et al.  Object Detection With Deep Learning: A Review , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[47]  Frank Hutter,et al.  Decoupled Weight Decay Regularization , 2017, ICLR.

[48]  Yung-Yu Chuang,et al.  Weakly Supervised Instance Segmentation using the Bounding Box Tightness Prior , 2019, NeurIPS.

[49]  Gui-Song Xia,et al.  Rotation-Sensitive Regression for Oriented Scene Text Detection , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[50]  Menglong Yan,et al.  Automatic Ship Detection in Remote Sensing Images from Google Earth of Complex Scenes Based on Multiscale Rotation Dense Feature Pyramid Networks , 2018, Remote. Sens..

[51]  Junjie Yan,et al.  FOTS: Fast Oriented Text Spotting with a Unified Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[52]  Jiebo Luo,et al.  DOTA: A Large-Scale Dataset for Object Detection in Aerial Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[53]  Xiangyang Xue,et al.  Arbitrary-Oriented Scene Text Detection via Rotation Proposals , 2017, IEEE Transactions on Multimedia.

[54]  Wafa Khlif,et al.  ICDAR2017 Robust Reading Challenge on Multi-Lingual Scene Text Detection and Script Identification - RRC-MLT , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[55]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[56]  Shuchang Zhou,et al.  EAST: An Efficient and Accurate Scene Text Detector , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[57]  Yiping Yang,et al.  A High Resolution Optical Satellite Image Dataset for Ship Recognition and Some New Baselines , 2017, ICPRAM.

[58]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[59]  Bernt Schiele,et al.  Simple Does It: Weakly Supervised Instance and Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60]  Yuning Jiang,et al.  UnitBox: An Advanced Object Detection Network , 2016, ACM Multimedia.

[61]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).