EXCOL: An EXtract-and-COmplete Layering Approach to Cartoon Animation Reusing

We introduce the EXtract-and-COmplete Layering method (EXCOL)-a novel cartoon animation processing technique to convert a traditional animated cartoon video into multiple semantically meaningful layers. Our technique is inspired by vision-based layering techniques but focuses on shape cues in both the extraction and completion steps to reflect the unique characteristics of cartoon animation. For layer extraction, we define a novel similarity measure incorporating both shape and color of automatically segmented regions within individual frames and propagate a small set of user-specified layer labels among similar regions across frames. By clustering regions with the same labels, each frame is appropriately partitioned into different layers, with each layer containing semantically meaningful content. Then, a warping-based approach is used to fill missing parts caused by occlusion within the extracted layers to achieve a complete representation. EXCOL provides a flexible way to effectively reuse traditional cartoon animations with only a small amount of user interaction. It is demonstrated that our EXCOL method is effective and robust, and the layered representation benefits a variety of applications in cartoon animation processing.

[1]  Bobby Bodenheimer,et al.  Cartoon textures , 2004, SCA '04.

[2]  Vladimir Kolmogorov,et al.  What energy functions can be minimized via graph cuts? , 2002, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Jean-Yves Guillemaut,et al.  Multi-label propagation for coherent video segmentation and artistic stylization , 2010, 2010 IEEE International Conference on Image Processing.

[4]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[5]  Jian Sun,et al.  Video object cut and paste , 2005, SIGGRAPH 2005.

[6]  Mubarak Shah,et al.  Motion layer extraction in the presence of occlusion using graph cuts , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Takeo Kanade,et al.  A robust subspace approach to layer extraction , 2002, Workshop on Motion and Video Computing, 2002. Proceedings..

[8]  John Dingliana,et al.  As-rigid-as-possible image registration for hand-drawn cartoon animations , 2009, NPAR '09.

[9]  Daniel Cohen-Or,et al.  Fragment-based image completion , 2003, ACM Trans. Graph..

[10]  Tai-Pang Wu,et al.  Video repairing under variable illumination using cyclic motions , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Harry Shum,et al.  Lazy snapping , 2004, ACM Trans. Graph..

[12]  Tinghuai Wang,et al.  Video stylization for digital ambient displays of home movies , 2010, NPAR.

[13]  Christoph Bregler,et al.  Turning to the masters: motion capturing cartoons , 2002, ACM Trans. Graph..

[14]  Nancy S. Pollard,et al.  Local layering , 2009, SIGGRAPH 2009.

[15]  Ralph R. Martin,et al.  Vectorizing Cartoon Animations , 2009, IEEE Transactions on Visualization and Computer Graphics.

[16]  Bobby Bodenheimer,et al.  Eurographics/ Acm Siggraph Symposium on Computer Animation (2006) Re-using Traditional Animation: Methods for Semi-automatic Segmentation and Inbetweening , 2022 .

[17]  Jirí Zára,et al.  Sketching Cartoons by Example , 2005, SBM.

[18]  John Dingliana,et al.  LazyBrush: Flexible Painting Tool for Hand‐drawn Cartoons , 2009, Comput. Graph. Forum.

[19]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[20]  Guillermo Sapiro,et al.  Video SnapCut: robust video object cutout using localized classifiers , 2009, SIGGRAPH 2009.

[21]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[22]  Jan Flusser,et al.  Image registration methods: a survey , 2003, Image Vis. Comput..

[23]  Ralph R. Martin,et al.  Video completion using tracking and fragment merging , 2005, The Visual Computer.

[24]  Olga Veksler,et al.  Fast approximate energy minimization via graph cuts , 2001, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[25]  Patrick Pérez,et al.  Time-sequential extraction of motion layers , 2008, 2008 15th IEEE International Conference on Image Processing.

[26]  A. Criminisi,et al.  Bilayer Segmentation of Live Video , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[27]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[28]  Eli Shechtman,et al.  Space-time video completion , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[29]  Jirí Zára,et al.  Colorization of black-and-white cartoons , 2005, Image Vis. Comput..

[30]  Maneesh Agrawala,et al.  Interactive video cutout , 2005, ACM Trans. Graph..

[31]  John Dingliana,et al.  Adding Depth to Cartoons Using Sparse Depth (In)equalities , 2010, Comput. Graph. Forum.

[32]  Gareth Funka-Lea,et al.  Graph Cuts and Efficient N-D Image Segmentation , 2006, International Journal of Computer Vision.

[33]  Tai-Pang Wu,et al.  Video repairing: inference of foreground and background under severe occlusion , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[34]  Harry Shum,et al.  Video tooning , 2004, ACM Trans. Graph..

[35]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Andrew Blake,et al.  "GrabCut" , 2004, ACM Trans. Graph..