论文信息 - Learning to Decode Contextual Information for Efficient Contour Detection

Learning to Decode Contextual Information for Efficient Contour Detection

Contour detection plays an important role in both academic research and real-world applications. As the basic building block of many applications, its accuracy and efficiency highly influence the subsequent stages. In this work, we propose a novel lightweight system for contour detection that achieves state-of-the-art performance while keeps ultra-slim model size. The proposed method is built on an efficient encoder in a bottom-up/top-down fashion. Specially, we propose a novel decoder that compresses side features from an encoder and effectively decodes compact contextual information for high-accurate boundary localization. Besides, we propose a novel loss function that is able to assist a model to produce crisp object boundaries. We conduct extensive experiments to demonstrate the effectiveness of the proposed system on the widely adopted benchmarks BSDS500 and Multi-Cue. The results show that our system achieves the same best performance, yet only consumes 3.3% computational cost (16.45GFlops VS. 499.15GFlops) and 2.35% model size (1.94M VS. 82.43M) of the SOTA detector RCF-ResNet101. In the meantime, our method outperforms a large portion of the recent top edge detectors by a clear margin.

[1] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[2] C. Lawrence Zitnick,et al. Fast Edge Detection Using Structured Forests , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Josef Kittler,et al. On the accuracy of the Sobel edge detector , 1983, Image Vis. Comput..

[4] Ronan Collobert,et al. Learning to Refine Object Segments , 2016, ECCV.

[5] Calvin C. Zhao. Critical Review : Contour Detection and Hierarchical Image Segmentation , 2015 .

[6] Changming Sun,et al. Knowledge Adaptation for Efficient Semantic Segmentation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[7] Mark Sandler,et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8] Xiang Bai,et al. Richer Convolutional Features for Edge Detection , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[10] Enhua Wu,et al. Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Jun-Cheng Chen,et al. An adaptive edge detection based colorization algorithm and its applications , 2005, ACM Multimedia.

[12] Gérard G. Medioni,et al. Human pose estimation from a single view point, real-time range sensor , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[13] Charless C. Fowlkes,et al. Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14] Heng Tao Shen,et al. Bottom-up and Top-down: Bidirectional Additive Net for Edge Detection , 2020, IJCAI.

[15] Kaiming He,et al. Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[16] Thomas Serre,et al. A systematic comparison between visual cues for boundary detection , 2016, Vision Research.

[17] Roberto Cipolla,et al. SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling , 2015, CVPR 2015.

[18] Ross B. Girshick,et al. Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19] George Papandreou,et al. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation , 2018, ECCV.

[20] Shengjun Liu,et al. Learning to predict crisp boundaries , 2018, ECCV.

[21] Jianbo Shi,et al. DeepEdge: A multi-scale bifurcated deep network for top-down contour detection , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] N. Senthilkumaran,et al. Image Segmentation - A Survey of Soft Computing Approaches , 2009, 2009 International Conference on Advances in Recent Technologies in Communication and Computing.

[23] Forrest N. Iandola,et al. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[24] Jinhui Tang,et al. Richer Convolutional Features for Edge Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Iasonas Kokkinos,et al. Pushing the Boundaries of Boundary Detection using Deep Learning , 2015, ICLR 2016.

[26] Xin Zhao,et al. Deep Crisp Boundaries , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Sai-Keung Wong,et al. Adversarial Colorization of Icons Based on Contour and Color Conditions , 2019, ACM Multimedia.

[28] Jitendra Malik,et al. Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[29] Gerald Kühne,et al. Motion-based segmentation and contour-based classification of video objects , 2001, MULTIMEDIA '01.

[30] David G. Lowe,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[31] Quoc V. Le,et al. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks , 2019, ICML.

[32] Bo Chen,et al. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[33] Edward S. Deutsch,et al. On the Quantitative Evaluation of Edge Detection Schemes and their Comparison with Human Performance , 1975, IEEE Transactions on Computers.

[34] Kaiming He,et al. Designing Network Design Spaces , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Shengjun Liu,et al. Deep Structural Contour Detection , 2020, ACM Multimedia.

[36] Yan Wang,et al. DeepContour: A deep convolutional feature learned by positive-sharing loss for contour detection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37] G LoweDavid,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[38] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[39] John F. Canny,et al. A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40] Gang Sun,et al. Squeeze-and-Excitation Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.