论文信息 - FAPIS: A Few-shot Anchor-free Part-based Instance Segmenter

FAPIS: A Few-shot Anchor-free Part-based Instance Segmenter

This paper is about few-shot instance segmentation, where training and test image sets do not share the same object classes. We specify and evaluate a new few-shot anchor-free part-based instance segmenter (FAPIS). Our key novelty is in explicit modeling of latent object parts shared across training object classes, which is expected to facilitate our few-shot learning on new classes in testing. We specify a new anchor-free object detector aimed at scoring and regressing locations of foreground bounding boxes, as well as estimating relative importance of latent parts within each box. Also, we specify a new network for delineating and weighting latent parts for the final instance segmentation within every detected bounding box. Our evaluation on the benchmark COCO-20i dataset demonstrates that we significantly outperform the state of the art.

Sinisa Todorovic | Khoi Nguyen | S. Todorovic | Khoi Nguyen

[1] Yong Jae Lee,et al. YOLACT: Real-Time Instance Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[2] Shu Kong,et al. Recurrent Pixel Embedding for Instance Grouping , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3] Ming Yang,et al. Instance-level Human Parsing via Part Grouping Network , 2018, ECCV.

[4] Yi Yang,et al. SG-One: Similarity Guidance Network for One-Shot Semantic Segmentation , 2018, IEEE Transactions on Cybernetics.

[5] Corinna Cortes,et al. Support-Vector Networks , 1995, Machine Learning.

[6] Xiaodan Liang,et al. Meta R-CNN: Towards General Solver for Instance-Level Low-Shot Learning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[7] Hao Chen,et al. FCOS: Fully Convolutional One-Stage Object Detection , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[8] Eric P. Xing,et al. Few-Shot Semantic Segmentation with Prototype Learning , 2018, BMVC.

[9] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[10] Iasonas Kokkinos,et al. Dense and Low-Rank Gaussian CRFs Using Deep Embeddings , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[11] Philip H. S. Torr,et al. Holistic, Instance-Level Human Parsing , 2017, BMVC.

[12] Kaiming He,et al. Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[13] Gui-Song Xia,et al. FGN: Fully Guided Network for Few-Shot Instance Segmentation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14] H. Sebastian Seung,et al. Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[15] Sanja Fidler,et al. SGN: Sequential Grouping Networks for Instance Segmentation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[16] George Papandreou,et al. MaskLab: Instance Segmentation by Refining Object Detection with Semantic and Direction Features , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[17] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[18] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[19] Hei Law,et al. CornerNet: Detecting Objects as Paired Keypoints , 2018, ECCV.

[20] Carsten Rother,et al. InstanceCut: From Edges to Instances with MultiCut , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Kaiming He,et al. Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Alexei A. Efros,et al. Few-Shot Segmentation Propagation with Guided Networks , 2018, ArXiv.

[23] Ross B. Girshick,et al. Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Peng Wang,et al. Semantic Instance Segmentation via Deep Metric Learning , 2017, ArXiv.

[26] Seyed-Ahmad Ahmadi,et al. V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[27] Harold W. Kuhn,et al. The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[28] Mennatullah Siam,et al. Adaptive Masked Proxies for Few-Shot Segmentation , 2019 .

[29] Yi Li,et al. Fully Convolutional Instance-Aware Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Xingyi Zhou,et al. Objects as Points , 2019, ArXiv.

[31] Alexei A. Efros,et al. Conditional Networks for Few-Shot Semantic Segmentation , 2018, ICLR.

[32] Silvio Savarese,et al. Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[33] Iasonas Kokkinos,et al. Segmentation-Aware Deformable Part Models , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[34] David A. McAllester,et al. Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35] Khoi Nguyen,et al. Feature Weighting and Boosting for Few-Shot Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[36] Kaiming He,et al. Group Normalization , 2018, ECCV.

[37] Kai Chen,et al. MMDetection: Open MMLab Detection Toolbox and Benchmark , 2019, ArXiv.

[38] Chi Zhang,et al. Pyramid Graph Networks With Connection Attentions for Region-Based One-Shot Semantic Segmentation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[39] Ross B. Girshick,et al. Mask R-CNN , 2017, 1703.06870.

[40] Jiashi Feng,et al. PANet: Few-Shot Image Semantic Segmentation With Prototype Alignment , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[41] Gang Yu,et al. Attention-Based Multi-Context Guiding for Few-Shot Semantic Segmentation , 2019, AAAI.

[42] Shuai Chen,et al. A New Local Transformation Module for Few-shot Segmentation , 2019, MMM.

[43] Alexander S. Ecker,et al. One-Shot Instance Segmentation , 2018, ArXiv.

[44] Rui Yao,et al. CANet: Class-Agnostic Segmentation Networks With Iterative Refinement and Attentive Few-Shot Learning , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[45] Byron Boots,et al. One-Shot Learning for Semantic Segmentation , 2017, BMVC.

[46] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[49] Shu Liu,et al. Path Aggregation Network for Instance Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.