Videoshop: A new framework for spatio-temporal video editing in gradient domain

This paper proposes a new framework for video editing in gradient domain. The spatio-temporal gradient fields of target videos are modified and/or mixed to generate a new gradient field which is usually not integrable. We compare two methods to solve this ''mixed gradient problem'', i.e., the variational method and loopy belief propagation. We propose a 3D video integration algorithm, which uses the variational method to find the potential function whose gradient field is closest to the mixed gradient field in the sense of least squares. The video is reconstructed by solving a 3D Poisson equation. The main contributions of our framework lie in three aspects: first, we derive a straightforward extension of current 2D gradient technique to 3D space, thus resulting in a novel video editing framework, which is very different from all current video editing software; secondly, we propose using a fast and accurate 3D discrete Poisson solver which uses diagonal multigrids to solve the 3D Poisson equation, which is up to twice as fast as a simple conventional multigrid algorithm; finally, we introduce a set of new applications, such as face replacement and painting, high dynamic range video compression and graphcut based video compositing. A set of gradient operators is also provided to the user for editing purposes. We evaluate our algorithm using a variety of examples for image/video or video/video pairs. The resulting video can be seamlessly reconstructed.

[1]  W. Freeman,et al.  Generalized Belief Propagation , 2000, NIPS.

[2]  Dani Lischinski,et al.  Gradient Domain High Dynamic Range Compression , 2023 .

[3]  David Salesin,et al.  Video matting of complex scenes , 2002, SIGGRAPH.

[4]  Lawrence B. Wolff,et al.  A new visualization paradigm for multispectral imagery and data fusion , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[5]  E. Land,et al.  Lightness and retinex theory. , 1971, Journal of the Optical Society of America.

[6]  Guillermo Sapiro,et al.  Image inpainting , 2000, SIGGRAPH.

[7]  A. J. Roberts Fast and accurate multigrid solution of Poissons equation using diagonally oriented grids , 1999 .

[8]  William A. Barrett,et al.  Object-based image editing , 2002, ACM Trans. Graph..

[9]  Tom Duff,et al.  Compositing digital images , 1984, SIGGRAPH.

[10]  Ramesh Raskar,et al.  Image fusion for context enhancement and video surrealism , 2004, NPAR '04.

[11]  Irfan A. Essa,et al.  Graphcut textures: image and video synthesis using graph cuts , 2003, ACM Trans. Graph..

[12]  Narendra Ahuja,et al.  Selecting objects with freehand sketches , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[13]  Guillermo Sapiro,et al.  Filling-in by joint interpolation of vector fields and gray levels , 2001, IEEE Trans. Image Process..

[14]  Brendan J. Frey,et al.  Very loopy belief propagation for unwrapping phase images , 2001, NIPS.

[15]  Greg Turk,et al.  LCIS: a boundary hierarchy for detail-preserving contrast reduction , 1999, SIGGRAPH.

[16]  Richard Szeliski,et al.  High dynamic range video , 2003, ACM Trans. Graph..

[17]  Edward H. Adelson,et al.  Layered representation for motion analysis , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Leonard McMillan,et al.  Proscenium: a framework for spatio-temporal video editing , 2003, ACM Multimedia.

[19]  Brendan J. Frey,et al.  Enforcing integrability for surface reconstruction algorithms using belief propagation in graphical models , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[20]  Patrick Pérez,et al.  Poisson image editing , 2003, ACM Trans. Graph..

[21]  Mark S. Drew,et al.  Removing Shadows from Images , 2002, ECCV.

[22]  James H. Elder,et al.  Image Editing in the Contour Domain , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Tianli Yu,et al.  Sparse lumigraph relighting by illumination and reflectance estimation from multi-view images , 2006, EGSR '06.

[24]  Narendra Ahuja,et al.  Split Aperture Imaging for High Dynamic Range , 2004, International Journal of Computer Vision.

[25]  Jitendra Malik,et al.  Recovering high dynamic range radiance maps from photographs , 1997, SIGGRAPH '08.

[26]  Michael Elad,et al.  A Variational Framework for Retinex , 2002, IS&T/SPIE Electronic Imaging.

[27]  Tomaso A. Poggio,et al.  Reanimating Faces in Images and Video , 2003, Comput. Graph. Forum.

[28]  N. Ahuja,et al.  Seamless video editing , 2004, ICPR 2004.

[29]  Stefano Soatto,et al.  Editable dynamic textures , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[30]  Erik Reinhard,et al.  Photographic tone reproduction for digital images , 2002, ACM Trans. Graph..