Temporally coherent and spatially accurate video matting

Image and video matting are still challenging problems in areas with low foreground‐background contrast. Video matting also has the challenge of ensuring temporally coherent mattes because the human visual system is highly sensitive to temporal jitter and flickering. On the other hand, video provides the opportunity to use information from other frames to improve the matte accuracy on a given frame. In this paper, we present a new video matting approach that improves the temporal coherence while maintaining high spatial accuracy in the computed mattes. We build sample sets of temporal and local samples that cover all the color distributions of the object and background over all previous frames. This helps guarantee spatial accuracy and temporal coherence by ensuring that proper samples are found even when distantly located in space or time. An explicit energy term encourages temporal consistency in the mattes derived from the selected samples. In addition, we use localized texture features to improve spatial accuracy in low contrast regions where color distributions overlap. The proposed method results in better spatial accuracy and temporal coherence than existing video matting methods.

[1]  Jian Sun,et al.  Video object cut and paste , 2005, SIGGRAPH 2005.

[2]  Thomas Brox,et al.  High Accuracy Optical Flow Estimation Based on a Theory for Warping , 2004, ECCV.

[3]  Carsten Rother,et al.  Improving Color Modeling for Alpha Matting , 2008, BMVC.

[4]  Maneesh Agrawala,et al.  Interactive video cutout , 2005, ACM Trans. Graph..

[5]  R.-H. Park,et al.  Video Matting based on Background Estimation , 2007 .

[6]  Ruigang Yang,et al.  Real-time video matting using multichannel poisson equations , 2010, Graphics Interface.

[7]  Guillermo Sapiro,et al.  Video SnapCut: robust video object cutout using localized classifiers , 2009, SIGGRAPH 2009.

[8]  Deepu Rajan,et al.  Improving Image Matting Using Comprehensive Sampling Sets , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Zhenjiang Miao,et al.  Temporally consistent video matting based on bilayer segmentation , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[10]  Zhenjiang Miao,et al.  Video matting via opacity propagation , 2011, The Visual Computer.

[11]  Margrit Gelautz,et al.  Temporally Consistent Disparity and Optical Flow via Efficient Spatio-temporal Filtering , 2011, PSIVT.

[12]  Paulo Villegas,et al.  Perceptually-weighted evaluation criteria for segmentation masks in video sequences , 2004, IEEE Transactions on Image Processing.

[13]  Harry Shum,et al.  Video object cut and paste , 2005, ACM Trans. Graph..

[14]  Yu-Wing Tai,et al.  Video Matting Using Multi-frame Nonlocal Matting Laplacian , 2012, ECCV.

[15]  Carsten Rother,et al.  Fast cost-volume filtering for visual correspondence and beyond , 2011, CVPR 2011.

[16]  Michael F. Cohen,et al.  Optimized Color Sampling for Robust Matting , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Jue Wang,et al.  Towards Temporally-Coherent Video Matting , 2011, MIRAGE.

[18]  Neil A. Dodgson,et al.  Real-Time Spatiotemporal Stereo Matching Using the Dual-Cross-Bilateral Grid , 2010, ECCV.

[19]  Chi-Keung Tang,et al.  KNN Matting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Dani Lischinski,et al.  A Closed-Form Solution to Natural Image Matting , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  M. Hestenes,et al.  Methods of conjugate gradients for solving linear systems , 1952 .

[22]  Jong-Chul Yoon,et al.  Temporally coherent video matting , 2010, SIGGRAPH '10.

[23]  David Salesin,et al.  Video matting of complex scenes , 2002, SIGGRAPH.

[24]  Manuel Menezes de Oliveira Neto,et al.  Shared Sampling for Real‐Time Alpha Matting , 2010, Comput. Graph. Forum.

[25]  Margrit Gelautz,et al.  Spatio-temporally Coherent Interactive Video Object Segmentation via Efficient Filtering , 2012, DAGM/OAGM Symposium.

[26]  J. C. Cexus,et al.  Teager-Huang Analysis Applied to Sonar Target Recognition , 2007 .

[27]  Xiangxu Meng,et al.  Discontinuity-aware video object cutout , 2012, ACM Trans. Graph..

[28]  Deepu Rajan,et al.  Weighted color and texture sample selection for image matting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[29]  Ying Wu,et al.  Nonlocal matting , 2011, CVPR 2011.