Responsive Action-based Video Synthesis

We propose technology to enable a new medium of expression, where video elements can be looped, merged, and triggered, interactively. Like audio, video is easy to sample from the real world, but hard to segment into clean reusable elements. Reusing a video clip means non-linear editing, and compositing with novel footage. The new context dictates how carefully a clip must be prepared, so our end-to-end approach enables previewing and easy iteration. We convert static-camera videos into loopable sequences, synthesizing them in response to simple end-user requests. This is hard because a) users want essentially semantic-level control over the synthesized video content, and b) automatic loop-finding is brittle and leaves users limited opportunity to work through problems. We propose a human-in-the-loop system where adding effort gives the user progressively more creative control. Artists help us evaluate how our trigger interfaces can be used for authoring of videos and video-performances.

[1]  Jan Kautz,et al.  Towards Moment Imagery: Automatic Cinemagraphs , 2011, 2011 Conference for Visual Media Production.

[2]  Rui Caseiro,et al.  High-Speed Tracking with Kernelized Correlation Filters , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Rémi Ronfard,et al.  A survey of vision-based methods for action representation, segmentation and recognition , 2011, Comput. Vis. Image Underst..

[4]  David Salesin,et al.  Panoramic video textures , 2005, ACM Trans. Graph..

[5]  Yael Pritch,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008 1 Non-Chronological Video , 2022 .

[6]  Vladimir Kolmogorov,et al.  Convergent Tree-Reweighted Message Passing for Energy Minimization , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Maneesh Agrawala,et al.  Selectively de-animating video , 2012, ACM Trans. Graph..

[8]  Jing Liao,et al.  Fast computation of seamless video loops , 2015, ACM Trans. Graph..

[9]  Gunnar Farnebäck,et al.  Two-Frame Motion Estimation Based on Polynomial Expansion , 2003, SCIA.

[10]  Roman P. Pflugfelder,et al.  Clustering of static-adaptive correspondences for deformable object tracking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Steven M. Drucker,et al.  Cliplets: juxtaposing still and dynamic imagery , 2012, UIST.

[12]  Jan O. Borchers,et al.  DRAGON: a direct manipulation interface for frame-accurate in-scene video navigation , 2008, CHI.

[13]  Michael S. Langer,et al.  Panoramic stereo video textures , 2011, 2011 International Conference on Computer Vision.

[14]  Pierre Dragicevic,et al.  Video browsing by direct manipulation , 2008, CHI.

[15]  Jin Wei,et al.  Timeline Editing of Objects in Video , 2013, IEEE Transactions on Visualization and Computer Graphics.

[16]  Yizhou Yu,et al.  Audeosynth: Music-driven Video Montage , 2015, ACM Trans. Graph..

[17]  Dani Lischinski,et al.  Spatio-temporal video warping: Copyright restrictions prevent ACM from providing the full text for this work. , 2005, International Conference on Computer Graphics and Interactive Techniques.

[18]  P. J. Narayanan,et al.  Interactive Video Manipulation Using Object Trajectories and Scene Backgrounds , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  KolmogorovVladimir Convergent Tree-Reweighted Message Passing for Energy Minimization , 2006 .

[20]  David Salesin,et al.  Image Analogies , 2001, SIGGRAPH.

[21]  Dani Lischinski,et al.  Evolving Time Fronts: Spatio-Temporal Video Warping , 2005 .

[22]  Patrick Pérez,et al.  Poisson image editing , 2003, ACM Trans. Graph..

[23]  Neel Joshi,et al.  Automated video looping with progressive dynamism , 2013, ACM Trans. Graph..

[24]  Richard Szeliski,et al.  Video textures , 2000, SIGGRAPH.

[25]  Jessica K. Hodgins,et al.  Flow-based video synthesis and editing , 2004, SIGGRAPH 2004.

[26]  Irfan A. Essa,et al.  Graphcut textures: image and video synthesis using graph cuts , 2003, ACM Trans. Graph..

[27]  Kalyan Sunkavalli,et al.  Smooth Loops from Unconstrained Video , 2015, Comput. Graph. Forum.

[28]  Aljoscha Smolic,et al.  DuctTake: Spatiotemporal Video Compositing , 2013, Comput. Graph. Forum.

[29]  Atsushi Nakazawa,et al.  Human video textures , 2009, I3D '09.

[30]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[31]  David Salesin,et al.  Video object annotation, navigation, and composition , 2008, UIST '08.

[32]  Irfan A. Essa,et al.  Controlled animation of video sprites , 2002, SCA '02.