论文信息 - Hallucination Improves Few-Shot Object Detection

Hallucination Improves Few-Shot Object Detection

Learning to detect novel objects from few annotated examples is of great practical importance. A particularly challenging yet common regime occurs when there are extremely limited examples (less than three). One critical factor in improving few-shot detection is to address the lack of variation in training data. We propose to build a better model of variation for novel classes by transferring the shared within-class variation from base classes. To this end, we introduce a hallucinator network that learns to generate additional, useful training examples in the region of interest (RoI) feature space, and incorporate it into a modern object detection model. Our approach yields significant performance improvements on two state-of-the-art few-shot detectors with different proposal generation procedures. In particular, we achieve new state of the art in the extremely-few-shot regime on the challenging COCO benchmark.

Yu-Xiong Wang | Weilin Zhang | Yu-Xiong Wang | Weilin Zhang

[1] Yu-Wing Tai,et al. Few-Shot Object Detection With Attention-RPN and Multi-Relation Detector , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[3] Joshua B. Tenenbaum,et al. One-Shot Learning with a Hierarchical Nonparametric Bayesian Model , 2011, ICML Unsupervised and Transfer Learning.

[4] A. Osokin,et al. OS2D: One-Stage One-Shot Object Detection by Matching Anchor Features , 2020, ECCV.

[5] Luc Van Gool,et al. The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[6] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Ali Farhadi,et al. YOLOv3: An Incremental Improvement , 2018, ArXiv.

[8] Derek Hoiem,et al. Category Independent Object Proposals , 2010, ECCV.

[9] Xiaodan Liang,et al. Meta R-CNN: Towards General Solver for Instance-Level Low-Shot Learning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[10] Hwann-Tzong Chen,et al. One-Shot Object Detection with Co-Attention and Co-Excitation , 2019, NeurIPS.

[11] Martial Hebert,et al. Model recommendation: Generating object detectors from few samples , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Hong-Yuan Mark Liao,et al. YOLOv4: Optimal Speed and Accuracy of Object Detection , 2020, ArXiv.

[13] Martial Hebert,et al. Learning to Learn: Model Regression Networks for Easy Small Sample Learning , 2016, ECCV.

[14] Xingyi Zhou,et al. Bottom-Up Object Detection by Grouping Extreme and Center Points , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16] Koen E. A. van de Sande,et al. Segmentation as selective search for object recognition , 2011, 2011 International Conference on Computer Vision.

[17] Chi Zhang,et al. FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[18] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[19] David A. Forsyth,et al. Cooperating RPN's Improve Few-Shot Object Detection , 2020, ArXiv.

[20] Trevor Darrell,et al. Frustratingly Simple Few-Shot Object Detection , 2020, ICML.

[21] Yi Li,et al. Deformable Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[22] Yoshua Bengio,et al. MetaGAN: An Adversarial Approach to Few-Shot Learning , 2018, NeurIPS.

[23] Oriol Vinyals,et al. Matching Networks for One Shot Learning , 2016, NIPS.

[24] Zhiqiang Shen,et al. Semantic Relation Reasoning for Shot-Stable Few-Shot Object Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Xin Wang,et al. Few-Shot Object Detection via Feature Reweighting , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[26] Sharath Pankanti,et al. RepMet: Representative-Based Metric Learning for Classification and Few-Shot Object Detection , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Richard S. Zemel,et al. Prototypical Networks for Few-shot Learning , 2017, NIPS.

[28] Pietro Perona,et al. One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29] Wei Liu,et al. SSD: Single Shot MultiBox Detector , 2015, ECCV.

[30] Martial Hebert,et al. Low-Shot Learning from Imaginary Data , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31] Renaud Marlet,et al. Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild , 2020, ECCV.

[32] Deva Ramanan,et al. Meta-Learning to Detect Rare Objects , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[33] Lauren A. Schmidt. Meaning and compositionality as statistical induction of categories and constraints , 2009 .

[34] Hei Law,et al. CornerNet: Detecting Objects as Paired Keypoints , 2018, ECCV.

[35] Bharath Hariharan,et al. Low-Shot Visual Recognition by Shrinking and Hallucinating Features , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[36] Rogério Schmidt Feris,et al. Delta-encoder: an effective sample synthesis method for few-shot object recognition , 2018, NeurIPS.

[37] Shih-Fu Chang,et al. Low-shot Learning via Covariance-Preserving Adversarial Augmentation Networks , 2018, NeurIPS.

[38] Sergey Levine,et al. Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[39] Ali Farhadi,et al. YOLO9000: Better, Faster, Stronger , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40] Dragomir Anguelov,et al. Capturing Long-Tail Distributions of Object Subcategories , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[41] Di Huang,et al. Multi-Scale Positive Sample Refinement for Few-Shot Object Detection , 2020, ECCV.

[42] Hao Chen,et al. LSTD: A Low-Shot Transfer Detector for Object Detection , 2018, AAAI.

[43] Ali Farhadi,et al. You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[45] Qixiang Ye,et al. Beyond Max-Margin: Class Margin Equilibrium for Few-shot Object Detection , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[46] Miaojing Shi,et al. Restoring Negative Information in Few-Shot Object Detection , 2020, NeurIPS.

[47] Deyu Meng,et al. Few-Example Object Detection with Model Communication , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48] Ross B. Girshick,et al. Fast R-CNN , 2015, 1504.08083.

[49] Jian Sun,et al. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.