Deep Image Matting

Image matting is a fundamental computer vision problem and has many applications. Previous algorithms have poor performance when an image has similar foreground and background colors or complicated textures. The main reasons are prior methods 1) only use low-level features and 2) lack high-level context. In this paper, we propose a novel deep learning based algorithm that can tackle both these problems. Our deep model has two parts. The first part is a deep convolutional encoder-decoder network that takes an image and the corresponding trimap as inputs and predict the alpha matte of the image. The second part is a small convolutional network that refines the alpha matte predictions of the first network to have more accurate alpha values and sharper edges. In addition, we also create a large-scale image matting dataset including 49300 training images and 1000 testing images. We evaluate our algorithm on the image matting benchmark, our testing set, and a wide variety of real images. Experimental results clearly demonstrate the superiority of our algorithm over previous methods.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Pushmeet Kohli,et al.  A perceptually motivated online benchmark for image matting , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Aykut Erdem,et al.  Image Matting with KL-Divergence Based Sparse Sampling , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[4]  Michael J. Black,et al.  A Naturalistic Open Source Movie for Optical Flow Evaluation , 2012, ECCV.

[5]  Alexei A. Efros,et al.  Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[8]  Yuanjie Zheng,et al.  Learning based digital matting , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[9]  In-So Kweon,et al.  Natural Image Matting Using Deep Convolutional Neural Networks , 2016, ECCV.

[10]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[11]  Ieee Xplore,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Scott Cohen,et al.  Temporally coherent and spatially accurate video matting , 2014, Comput. Graph. Forum.

[13]  Jian Sun,et al.  A global sampling method for alpha matting , 2011, CVPR 2011.

[14]  Dani Lischinski,et al.  A Closed-Form Solution to Natural Image Matting , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Jian Sun,et al.  Guided Image Filtering , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Michael F. Cohen,et al.  Optimized Color Sampling for Robust Matting , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Rüdiger Westermann,et al.  RANDOM WALKS FOR INTERACTIVE ALPHA-MATTING , 2005 .

[18]  Honglak Lee,et al.  Object Contour Detection with a Fully Convolutional Encoder-Decoder Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  David Salesin,et al.  A Bayesian approach to digital matting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[20]  Jian Sun,et al.  Poisson matting , 2004, ACM Trans. Graph..

[21]  Manuel Menezes de Oliveira Neto,et al.  Shared Sampling for Real‐Time Alpha Matting , 2010, Comput. Graph. Forum.

[22]  Deepu Rajan,et al.  Sparse Coding for Alpha Matting , 2016, IEEE Transactions on Image Processing.

[23]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling , 2015, CVPR 2015.

[24]  Jiaya Jia,et al.  Deep Automatic Portrait Matting , 2016, ECCV.

[25]  Xiao Chen,et al.  A propagation matting method based on the Local Sampling and KNN Classification with adaptive feature space , 2016, ArXiv.

[26]  Guijin Wang,et al.  Iterative transductive learning for alpha matting , 2013, 2013 IEEE International Conference on Image Processing.

[27]  Xiaohui Liang,et al.  A Cluster Sampling Method for Image Matting via Sparse Coding , 2016, ECCV.

[28]  Qinping Zhao,et al.  Image Matting with Local and Nonlocal Smooth Priors , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Dmitriy Vatolin,et al.  Perceptually Motivated Benchmark for Video Matting , 2015, BMVC.

[30]  Chi-Keung Tang,et al.  KNN Matting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Deepu Rajan,et al.  Improving Image Matting Using Comprehensive Sampling Sets , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Deepu Rajan,et al.  Weighted color and texture sample selection for image matting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Ying Wu,et al.  Nonlocal matting , 2011, CVPR 2011.