论文信息 - Image Object Extraction Based on Semantic Segmentation and Label Loss

Image Object Extraction Based on Semantic Segmentation and Label Loss

Object extraction refers to the operation of obtaining an object area from an image based on a small amount of mark information given by users, which is a key step in image processing. In order to obtain a complete object profile, current methods usually require a large number of manual annotations, especially for objects with irregular contours. Since traditional algorithms rely on low-level pixel features without semantic information, and are based on obvious mathematical assumptions (ie, strong inductive bias), it is difficult to completely identify objects. At present, in order to improve the integrity of object extraction, semantic segmentation-based methods increase the complexity and latancy by adding more pre-processing and post-processing steps. In this paper, we propose a novel model named IOEBSS, which includes a fast binary plane pre-processing, an improved Deeplab v3+ semantic segmentation model, and an auxiliary loss function named Label Loss. Through the fast binary plane pre-processing, the model can accelerate the transformation of interactive inputs. The improved semantic segmentation model makes the extracted results more semantically complete, and Label Loss is more conducive to gradient flow and accelerates training convergence. For the above reasons, IOEBSS can accurately and quickly identify objects with complex contours and colors. On Pascal VOC and COCO datasets, compared to current methods, IOEBSS has a significant improvement in accuracy, inference speed, and convergence speed.

[1] David Salesin,et al. A Bayesian approach to digital matting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[2] Quoc V. Le,et al. Neural Architecture Search with Reinforcement Learning , 2016, ICLR.

[3] Harry Shum,et al. Lazy snapping , 2004, ACM Trans. Graph..

[4] Yuanjie Zheng,et al. Learning based digital matting , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[5] Subhransu Maji,et al. Semantic contours from inverse detectors , 2011, 2011 International Conference on Computer Vision.

[6] Iasonas Kokkinos,et al. DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7] Gilles Bertrand,et al. Watershed Cuts: Thinnings, Shortest Path Forests, and Topological Watersheds , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] In-So Kweon,et al. Natural Image Matting Using Deep Convolutional Neural Networks , 2016, ECCV.

[9] Kaiming He,et al. Rethinking ImageNet Pre-Training , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[10] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[11] Ning Xu,et al. Deep GrabCut for Object Selection , 2017, BMVC.

[12] Dani Lischinski,et al. A Closed-Form Solution to Natural Image Matting , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Luisa Verdoliva,et al. Marker-Controlled Watershed-Based Segmentation of Multiresolution Remote Sensing Images , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[14] Marcin Ciecholewski,et al. Malignant and Benign Mass Segmentation in Mammograms Using Active Contour Methods , 2017, Symmetry.

[15] Cordelia Schmid,et al. The 2005 PASCAL Visual Object Classes Challenge , 2005, MLCW.

[16] DingKeyan,et al. Active contours driven by region-scalable fitting and optimized Laplacian of Gaussian energy for image segmentation , 2017 .

[17] Vladlen Koltun,et al. Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[18] Pascal Fua,et al. SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19] Ning Xu,et al. Deep Image Matting , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Jitendra Malik,et al. Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[21] George Papandreou,et al. Searching for Efficient Multi-Scale Architectures for Dense Image Prediction , 2018, NeurIPS.

[22] Sébastien Ourselin,et al. DeepIGeoS: A Deep Interactive Geodesic Framework for Medical Image Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23] Ning Xu,et al. Deep Interactive Object Selection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Bastian Leibe,et al. Superpixels: An evaluation of the state-of-the-art , 2016, Comput. Vis. Image Underst..

[25] Xiaogang Wang,et al. Pyramid Scene Parsing Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Iasonas Kokkinos,et al. Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.

[27] Vibhav Vineet,et al. Conditional Random Fields as Recurrent Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[28] Chi-Keung Tang,et al. KNN Matting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[29] Rolf Adams,et al. Seeded Region Growing , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[30] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31] Marie-Pierre Jolly,et al. Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[32] Aljoscha Smolic,et al. AlphaGAN: Generative adversarial networks for natural image matting , 2018, BMVC.

[33] Patrick Pérez,et al. Poisson image editing , 2003, ACM Trans. Graph..

[34] Dragan Mirkovic,et al. Automatic Performance Tuning in the UHFFT Library , 2001, International Conference on Computational Science.

[35] Sébastien Ourselin,et al. Interactive Medical Image Segmentation Using Deep Learning With Image-Specific Fine Tuning , 2017, IEEE Transactions on Medical Imaging.