Cliplets: juxtaposing still and dynamic imagery

We explore creating ""cliplets"", a form of visual media that juxtaposes still image and video segments, both spatially and temporally, to expressively abstract a moment. Much as in ""cinemagraphs"", the tension between static and dynamic elements in a cliplet reinforces both aspects, strongly focusing the viewer's attention. Creating this type of imagery is challenging without professional tools and training. We develop a set of idioms, essentially spatiotemporal mappings, that characterize cliplet elements, and use these idioms in an interactive system to quickly compose a cliplet from ordinary handheld video. One difficulty is to avoid artifacts in the cliplet composition without resorting to extensive manual input. We address this with automatic alignment, looping optimization and feathering, simultaneous matting and compositing, and Laplacian blending. A key user-interface challenge is to provide affordances to define the parameters of the mappings from input time to output time while maintaining a focus on the cliplet being created. We demonstrate the creation of a variety of cliplet types. We also report on informal feedback as well as a more structured survey of users.

[1]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[2]  Edward H. Adelson,et al.  A multiresolution spline with application to image mosaics , 1983, TOGS.

[3]  Edward H. Adelson,et al.  Motion without movement , 1991, SIGGRAPH.

[4]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[5]  Richard Szeliski,et al.  Video textures , 2000, SIGGRAPH.

[6]  David Salesin,et al.  Video matting of complex scenes , 2002, SIGGRAPH.

[7]  Irfan A. Essa,et al.  Controlled animation of video sprites , 2002, SCA '02.

[8]  Irfan A. Essa,et al.  Graphcut textures: image and video synthesis using graph cuts , 2003, ACM Trans. Graph..

[9]  David Salesin,et al.  Interactive digital photomontage , 2004, SIGGRAPH 2004.

[10]  Dani Lischinski,et al.  Spatio-temporal video warping: Copyright restrictions prevent ACM from providing the full text for this work. , 2005, International Conference on Computer Graphics and Interactive Techniques.

[11]  Frédo Durand,et al.  Motion magnification , 2005, ACM Trans. Graph..

[12]  A. Torralba,et al.  Motion magnification , 2005, SIGGRAPH 2005.

[13]  David Salesin,et al.  Panoramic video textures , 2005, ACM Trans. Graph..

[14]  David Salesin,et al.  Animating pictures with stochastic motion textures , 2005, SIGGRAPH 2005.

[15]  Dani Lischinski,et al.  Spatio-temporal video warping. , 2005, SIGGRAPH 2005.

[16]  R. Szeliski Locally adapted hierarchical basis preconditioning , 2006, SIGGRAPH 2006.

[17]  Michael F. Cohen,et al.  Simultaneous Matting and Compositing , 2006, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Dani Lischinski,et al.  Dynamosaicing: Mosaicing of Dynamic Scenes , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Michael Gleicher,et al.  Re-cinematography: Improving the camerawork of casual video , 2008, TOMCCAP.

[20]  David Salesin,et al.  Video object annotation, navigation, and composition , 2008, UIST '08.

[21]  David Salesin,et al.  Parallax photography: creating 3D cinematic effects from stills , 2009, Graphics Interface.

[22]  Guillermo Sapiro,et al.  Video SnapCut: robust video object cutout using localized classifiers , 2009, ACM Trans. Graph..

[23]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[24]  Irfan A. Essa,et al.  Auto-directed video stabilization with robust L1 optimal camera paths , 2011, CVPR 2011.

[25]  Richard Szeliski,et al.  Fast Poisson blending using multi-splines , 2011, 2011 IEEE International Conference on Computational Photography (ICCP).

[26]  Jan Kautz,et al.  Towards Moment Imagery: Automatic Cinemagraphs , 2011, 2011 Conference for Visual Media Production.

[27]  Maneesh Agrawala,et al.  Selectively de-animating video , 2012, ACM Trans. Graph..