Targeting Accurate Object Extraction From an Image: A Comprehensive Study of Natural Image Matting

With the development of digital multimedia technologies, image matting has gained increasing interests from both academic and industrial communities. The purpose of image matting is to precisely extract the foreground objects with arbitrary shapes from an image or a video frame for further editing. It is generally known that image matting is inherently an ill-posed problem because we need to output three images out of only one input image. In this paper, we provide a comprehensive survey of the existing image matting algorithms and evaluate their performance. In addition to the blue screen matting, we systematically divide all existing natural image matting methods into four categories: 1) color sampling-based; 2) propagation-based; 3) combination of sampling-based and propagation-based; and 4) learning-based approaches. Sampling-based methods assume that the foreground and background colors of an unknown pixel can be explicitly estimated by examining nearby pixels. Propagation-based methods are instead based on the assumption that foreground and background colors are locally smooth. Learning-based methods treat the matting process as a supervised or semisupervised learning problem. Via the learning process, users can construct a linear or nonlinear model between the alpha mattes and the image colors using a training set to estimate the alpha matte of an unknown pixel without any assumption about the characteristics of the testing image. With three benchmark data sets, the various matting algorithms are evaluated and compared using several metrics to demonstrate the strengths and weaknesses of each method both quantitatively and qualitatively. Finally, we conclude this paper by outlining the research trends and suggesting a number of promising directions for future development.

[1]  King Ngi Ngan,et al.  Unsupervised extraction of visual attention objects in color images , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Deepu Rajan,et al.  Using texture to complement color in image matting , 2013, Image Vis. Comput..

[3]  Michael J. Watts,et al.  IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS Publication Information , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Michael F. Cohen,et al.  An iterative optimization approach for unified image segmentation and matting , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[5]  Jian Sun,et al.  Fast matting using large kernel matting Laplacian matrices , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  Anil C. Kokaram,et al.  Matting with a depth map , 2010, 2010 IEEE International Conference on Image Processing.

[7]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[8]  Lei Wang,et al.  A Novel Nonlinear Regression Approach for Efficient and Accurate Image Matting , 2013, IEEE Signal Processing Letters.

[9]  Michael F. Cohen,et al.  Image and Video Matting: A Survey , 2007, Found. Trends Comput. Graph. Vis..

[10]  Michael S. Brown,et al.  A Learning-Based Approach to Reduce JPEG Artifacts in Image Matting , 2013, 2013 IEEE International Conference on Computer Vision.

[11]  Frédo Durand,et al.  Defocus video matting , 2005, ACM Trans. Graph..

[12]  Kwang Hee Won,et al.  Natural Image Matting Based on Neighbor Embedding , 2007, MIRAGE.

[13]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..

[14]  David Salesin,et al.  A Bayesian approach to digital matting , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[15]  Jian Sun,et al.  Poisson matting , 2004, ACM Trans. Graph..

[16]  Raúl Rojas,et al.  Image Segmentation by Uniform Color Clustering-Approach and Benchmark Results , 2005 .

[17]  Dacheng Tao,et al.  Slow Feature Analysis for Human Action Recognition , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Walter Beyer,et al.  Traveling-Matte Photography and the Blue-Screen System: A Tutorial Paper , 1965 .

[19]  Andrew W. Fitzgibbon,et al.  Image-based environment matting , 2002, SIGGRAPH '02.

[20]  Dan B. Goldman,et al.  Shadow Matting and Compositing , .

[21]  A. N. Rajagopalan,et al.  Natural Matting for Degraded Pictures , 2011, IEEE Transactions on Image Processing.

[22]  Weixiang Liu,et al.  Natural image matting with non-negative matrix factorization , 2005, IEEE International Conference on Image Processing 2005.

[23]  René Vidal,et al.  Interactive image matting for multiple layers , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[25]  Renaud Keriven,et al.  Trimap Segmentation for Fast and User-Friendly Alpha Matting , 2005, VLSM.

[26]  Xuelong Li,et al.  Asymmetric bagging and random subspace for support vector machines-based relevance feedback in image retrieval , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[28]  Jianfei Cai,et al.  Fast environment matting extraction using compressive sensing , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[29]  Jean-Yves Guillemaut,et al.  Natural image matting for multiple wide-baseline views , 2010, 2010 IEEE International Conference on Image Processing.

[30]  Fang Liu,et al.  A GPU-based matting Laplacian solver for high resolution image matting , 2010, The Visual Computer.

[31]  William T. Freeman,et al.  On the optimality of solutions of the max-product belief-propagation algorithm in arbitrary graphs , 2001, IEEE Trans. Inf. Theory.

[32]  G. Lu,et al.  GrowMatting: A GPU-based real-time interactive method for image matting , 2010, 2010 25th International Conference of Image and Vision Computing New Zealand.

[33]  Wojciech Matusik,et al.  Natural video matting using camera arrays , 2006, SIGGRAPH '06.

[34]  David Salesin,et al.  Environment matting and compositing , 1999, SIGGRAPH.

[35]  Deepu Rajan,et al.  Weighted color and texture sample selection for image matting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[36]  Jean-Michel Morel,et al.  A non-local algorithm for image denoising , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[37]  Wei Chen,et al.  Easy Matting ‐ A Stroke Based Approach for Continuous Image Matting , 2006, Comput. Graph. Forum.

[38]  Ying Wu,et al.  Nonlocal matting , 2011, CVPR 2011.

[39]  Guijin Wang,et al.  Iterative transductive learning for alpha matting , 2013, 2013 IEEE International Conference on Image Processing.

[40]  Michael F. Cohen,et al.  Optimized Color Sampling for Robust Matting , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[41]  Weifeng Liu,et al.  Multiview Hessian Regularization for Image Annotation , 2013, IEEE Transactions on Image Processing.

[42]  Maneesh Agrawala,et al.  Soft scissors: an interactive tool for realtime high quality matting , 2007, ACM Trans. Graph..

[43]  Ying Wu,et al.  L1 matting , 2010, 2010 IEEE International Conference on Image Processing.

[44]  H. Sebastian Seung,et al.  Learning the parts of objects by non-negative matrix factorization , 1999, Nature.

[45]  Chi-Keung Tang,et al.  KNN Matting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Xuelong Li,et al.  Improving Level Set Method for Fast Auroral Oval Segmentation , 2014, IEEE Transactions on Image Processing.

[47]  Harry Shum,et al.  Flash matting , 2006, ACM Trans. Graph..

[48]  Jean-Yves Guillemaut,et al.  Non-parametric natural image matting , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[49]  Pushmeet Kohli,et al.  A perceptually motivated online benchmark for image matting , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[50]  Kentaro Toyama,et al.  Wallflower: principles and practice of background maintenance , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[51]  Takeshi Naemura,et al.  Real-Time Video Matting Based on Bilayer Segmentation , 2009, ACCV.

[52]  Yuanjie Zheng,et al.  FuzzyMatte: A computationally efficient scheme for interactive matting , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[53]  Gerald J. Hahn,et al.  Applied Regression Analysis (2nd Ed.) , 2012 .

[54]  James F. Blinn,et al.  Blue screen matting , 1996, SIGGRAPH.

[55]  Dani Lischinski,et al.  A Closed-Form Solution to Natural Image Matting , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[56]  Chein-I Chang,et al.  Image segmentation by local entropy methods , 1995, Proceedings., International Conference on Image Processing.

[57]  Wu-Chih Hu,et al.  Automatic and Accurate Image Matting , 2010, ICCCI.

[58]  Sang Min Yoon,et al.  Alpha matting using compressive sensing , 2012 .

[59]  Tom Duff,et al.  Compositing digital images , 1984, SIGGRAPH.

[60]  Zhanpeng Zhang,et al.  Learning based alpha matting using support vector regression , 2012, 2012 19th IEEE International Conference on Image Processing.

[61]  Carlo Tomasi,et al.  Alpha estimation in natural images , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[62]  Carsten Rother,et al.  Improving Color Modeling for Alpha Matting , 2008, BMVC.

[63]  René Vidal,et al.  Estimation of Alpha Mattes for Multiple Image Layers , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[64]  Lei Guo,et al.  An Object-Oriented Visual Saliency Detection Framework Based on Sparse Coding Representations , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[65]  David Salesin,et al.  Video matting of complex scenes , 2002, SIGGRAPH.

[66]  Manuel Menezes de Oliveira Neto,et al.  Shared Sampling for Real‐Time Alpha Matting , 2010, Comput. Graph. Forum.

[67]  Xuelong Li,et al.  Lazy Random Walks for Superpixel Segmentation , 2014, IEEE Transactions on Image Processing.

[68]  Pushmeet Kohli,et al.  A spatially varying PSF-based prior for alpha matting , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[69]  Ruigang Yang,et al.  Automatic Real-Time Video Matting Using Time-of-Flight Camera and Multichannel Poisson Equations , 2012, International Journal of Computer Vision.

[70]  Ling Shao,et al.  Feature Learning for Image Classification Via Multiobjective Genetic Programming , 2014, IEEE Transactions on Neural Networks and Learning Systems.

[71]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[72]  Scott Cohen,et al.  Geodesic graph cut for interactive image segmentation , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[73]  TaoDacheng,et al.  Large-Margin Multi-ViewInformation Bottleneck , 2014 .

[74]  Rüdiger Westermann,et al.  RANDOM WALKS FOR INTERACTIVE ALPHA-MATTING , 2005 .

[75]  Dacheng Tao,et al.  Large-Margin Multi-ViewInformation Bottleneck , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[76]  Kaleem Siddiqi,et al.  Flux Maximizing Geometric Flows , 2001, ICCV.

[77]  Carsten Rother,et al.  New appearance models for natural image matting , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[78]  Jian Sun,et al.  A global sampling method for alpha matting , 2011, CVPR 2011.

[79]  Oscar C. Au,et al.  Adaptive depth map assisted matting in 3D video , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[80]  Dani Lischinski,et al.  Spectral Matting , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[81]  Guillermo Sapiro,et al.  Distancecut: Interactive Segmentation and Matting of Images and Videos , 2007, 2007 IEEE International Conference on Image Processing.

[82]  Hujun Bao,et al.  Oriented Poisson matting , 2005, IEEE International Conference on Image Processing 2005.

[83]  Xiaowu Chen,et al.  Manifold preserving edit propagation , 2012, ACM Trans. Graph..

[84]  Marie-Pierre Jolly,et al.  Interactive graph cuts for optimal boundary & region segmentation of objects in N-D images , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[85]  Deepu Rajan,et al.  Improving Image Matting Using Comprehensive Sampling Sets , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[86]  Takumi Kobayashi,et al.  Image Matting in the Framework of Quantification IV , 2007, 2007 IEEE International Conference on Image Processing.

[87]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[88]  Wojciech Matusik,et al.  Acquisition and Rendering of Transparent and Refractive Objects , 2002, Rendering Techniques.

[89]  Ling Shao,et al.  Spatio-Temporal Laplacian Pyramid Coding for Action Recognition , 2014, IEEE Transactions on Cybernetics.

[90]  Richard F. Gunst,et al.  Applied Regression Analysis , 1999, Technometrics.

[91]  Yuanjie Zheng,et al.  Learning based digital matting , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[92]  Michael S. Brown,et al.  Matting and compositing of transparent and refractive objects , 2011, TOGS.

[93]  Xuelong Li,et al.  A Nonlinear Adaptive Level Set for Image Segmentation , 2014, IEEE Transactions on Cybernetics.

[94]  Adam Finkelstein,et al.  PatchMatch: a randomized correspondence algorithm for structural image editing , 2009, SIGGRAPH 2009.

[95]  Kiichi Urahama,et al.  Image and Video Matting with Membership Propagation , 2007, ACCV.

[96]  Ron Brinkmann,et al.  The Art and Science of Digital Compositing , 1999 .

[97]  Tony F. Chan,et al.  Matting through variational inpainting , 2007, SIP.

[98]  Guillermo Sapiro,et al.  A Geodesic Framework for Fast Interactive Image and Video Segmentation and Matting , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[99]  John M. Hannah,et al.  Alpha channel estimation in high resolution images and image sequences , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[100]  Suk I. Yoo,et al.  A Convex Image Segmentation: Extending Graph Cuts and Closed-Form Matting , 2010, ACCV.

[101]  Harry Shum,et al.  Natural shadow matting , 2007, TOGS.

[102]  Chunhong Pan,et al.  An Iterative Bayesian Approach for Digital Matting , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[103]  Xiang Ji,et al.  Representing and Retrieving Video Shots in Human-Centric Brain Imaging Space , 2013, IEEE Transactions on Image Processing.

[104]  Michael F. Cohen,et al.  Simultaneous Matting and Compositing , 2006, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[105]  M. Ibrahim Sezan,et al.  Video background replacement without a blue screen , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[106]  Takumi Kobayashi,et al.  Image matting based on local color discrimination by SVM , 2009, Pattern Recognit. Lett..

[107]  Qinping Zhao,et al.  Image Matting with Local and Nonlocal Smooth Priors , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[108]  Carsten Rother,et al.  A stereo approach that handles the matting problem via image warping , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[109]  Toby Sharp,et al.  High resolution matting via interactive trimap segmentation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[110]  Jean Ponce,et al.  Segmentation by transduction , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[111]  Jue Wang Image Matting with Transductive Inference , 2011, MIRAGE.

[112]  Xuelong Li,et al.  General Tensor Discriminant Analysis and Gabor Features for Gait Recognition , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[113]  David Salesin,et al.  Environment matting extensions: towards higher accuracy and real-time capture , 2000, SIGGRAPH.