Toward an objective benchmark for video completion

Video-completion methods aim to complete selected regions of a video sequence in a natural looking manner with little to no additional user interaction. Numerous algorithms were proposed to solve this problem; however, a unified benchmark to quantify the progress in the field is still lacking. Video-completion results are usually judged by their plausibility and aren’t expected to adhere to one ground-truth result, which complicates measuring the video-completion performance. In this paper, we address this problem by proposing a set of full-reference quality metrics that outperform naïve approaches and an online benchmark for video-completion algorithms. We construct seven test sequences with ground-truth video-completion results by composing various foreground objects over a set of background videos. Using this dataset, we conduct an extensive comparative study of video-completion perceptual quality involving six algorithms and over 300 human participants. Finally, we show that by relaxing the requirement of complete adherence to ground truth and by taking into account temporal consistency we can increase the correlation of objective quality metrics with perceptual completion quality on the proposed dataset.

[1]  Gholamreza Anbarjafari An Objective No-Reference Measure of Illumination Assessment , 2015 .

[2]  Oscar C. Au,et al.  Video Error Concealment Using Spatio-Temporal Boundary Matching and Partial Differential Equation , 2008, IEEE Transactions on Multimedia.

[3]  Oliver Grau,et al.  How Not to Be Seen — Object Removal from Videos of Crowded Scenes , 2012, Comput. Graph. Forum.

[4]  Eric Paquette,et al.  Localized Search for High Definition Video Completion , 2015, J. WSCG.

[5]  Narendra Ahuja,et al.  Image completion using planar structure guidance , 2014, ACM Trans. Graph..

[6]  L. Thurstone A law of comparative judgment. , 1994 .

[7]  Nizar Bouguila,et al.  Automatic Inpainting Scheme for Video Text Detection and Removal , 2013, IEEE Transactions on Image Processing.

[8]  Dmitriy Vatolin,et al.  Perceptually Motivated Benchmark for Video Matting , 2015, BMVC.

[9]  Alexandru Telea,et al.  An Image Inpainting Technique Based on the Fast Marching Method , 2004, J. Graphics, GPU, & Game Tools.

[10]  Christine Guillemot,et al.  Video Inpainting With Short-Term Windows: Application to Object Removal and Error Concealment , 2015, IEEE Transactions on Image Processing.

[11]  Søren Holdt Jensen,et al.  Sequential Error Concealment for Video/Images by Sparse Linear Prediction , 2013, IEEE Transactions on Multimedia.

[12]  Eli Shechtman,et al.  Space-Time Completion of Video , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Jian Sun,et al.  Statistics of Patch Offsets for Image Completion , 2012, ECCV.

[14]  Hans-Peter Seidel,et al.  New measurements reveal weaknesses of image quality metrics in evaluating graphics artifacts , 2012, ACM Trans. Graph..

[15]  Alan C. Bovik,et al.  Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos , 2010, IEEE Transactions on Image Processing.

[16]  Patrick Pérez,et al.  Video Inpainting of Complex Scenes , 2014, SIAM J. Imaging Sci..

[17]  Nizar Bouguila,et al.  Bandlet-based sparsity regularization in video inpainting , 2014, J. Vis. Commun. Image Represent..

[18]  Katsushi Ikeuchi,et al.  Robust and Fast Motion Estimation for Video Completion , 2013, MVA.

[19]  Yasuyuki Matsushita,et al.  Video Completion by Motion Field Transfer , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[20]  Adam Finkelstein,et al.  PatchMatch: a randomized correspondence algorithm for structural image editing , 2009, SIGGRAPH 2009.

[21]  Jonathan Burton,et al.  RMIT3DV: Pre-announcement of a creative commons uncompressed HD 3D video database , 2012, 2012 Fourth International Workshop on Quality of Multimedia Experience.

[22]  Mohan S. Kankanhalli,et al.  Automatic video logo detection and removal , 2005, Multimedia Systems.

[23]  Ariel Shamir,et al.  A Survey on Data‐Driven Video Completion , 2015, Comput. Graph. Forum.

[24]  Søren Holdt Jensen,et al.  Sequential Error Concealment for Video/Images by Weighted Template Matching , 2012, 2012 Data Compression Conference.

[25]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[26]  Nizar Bouguila,et al.  Video Completion Using Bandlet Transform , 2012, IEEE Transactions on Multimedia.

[27]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.