论文信息 - Reference-Based Deep Line Art Video Colorization

Reference-Based Deep Line Art Video Colorization

Coloring line art images based on the colors of reference images is an important stage in animation production, which is time-consuming and tedious. In this paper, we propose a deep architecture to automatically color line art videos with the same color style as the given reference images. Our framework consists of a color transform network and a temporal constraint network. The color transform network takes the target line art images as well as the line art and color images of one or more reference images as input, and generates corresponding target color images. To cope with larger differences between the target line art image and reference color images, our architecture utilizes non-local similarity matching to determine the region correspondences between the target image and the reference images, which are used to transform the local color information from the references to the target. To ensure global color style consistency, we further incorporate Adaptive Instance Normalization (AdaIN) with the transformation parameters obtained from a style embedding vector that describes the global color style of the references, extracted by an embedder. The temporal constraint network takes the reference images and the target image together in chronological order, and learns the spatiotemporal features through 3D convolution to ensure the temporal consistency of the target image and the reference image. Our model can achieve even better coloring results by fine-tuning the parameters with only a small amount of samples when dealing with an animation of a new style. To evaluate our method, we build a line art coloring dataset. Experiments show that our method achieves the best performance on line art video coloring compared to the state-of-the-art methods and other baselines.

[1] John F. Canny,et al. A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[3] Kiyoharu Aizawa,et al. cGAN-Based Manga Colorization Using a Single Training Image , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[4] Mehran Ebrahimi,et al. Automatic Temporally Coherent Video Colorization , 2019, 2019 16th Conference on Computer and Robot Vision (CRV).

[5] Eunhyeok Park,et al. Tag2Pix: Line Art Colorization Using Text Tag With SECat and Changing Loss , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[6] James Hays,et al. SketchyGAN: Towards Diverse and Realistic Sketch to Image Synthesis , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[7] Gang Hua,et al. Visual attribute transfer through deep image analogy , 2017, ACM Trans. Graph..

[8] Jaakko Lehtinen,et al. Few-Shot Unsupervised Image-to-Image Translation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[9] Jan Kautz,et al. Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[10] Kazuyuki Hiroshiba,et al. Comicolorization: semi-automatic manga colorization , 2017, SIGGRAPH Asia Technical Briefs.

[11] Vincent Dumoulin,et al. Deconvolution and Checkerboard Artifacts , 2016 .

[12] Serge J. Belongie,et al. Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[13] Tien-Tsin Wong,et al. Manga colorization , 2006, ACM Trans. Graph..

[14] Qifeng Chen,et al. Fully Automatic Video Colorization With Self-Regularization and Diversity , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Sepp Hochreiter,et al. GANs Trained by a Two Time-Scale Update Rule Converge to a Local Nash Equilibrium , 2017, NIPS.

[16] Kiyoharu Aizawa,et al. Reference-based manga colorization by graph correspondence using quadratic programming , 2014, SIGGRAPH ASIA Technical Briefs.

[17] Yuichi Yoshida,et al. Spectral Normalization for Generative Adversarial Networks , 2018, ICLR.

[18] Domonkos Varga,et al. Automatic Cartoon Colorization Based on Convolutional Neural Network , 2017, CBMI.

[19] John Dingliana,et al. LazyBrush: Flexible Painting Tool for Hand‐drawn Cartoons , 2009, Comput. Graph. Forum.

[20] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[21] Winston H. Hsu,et al. Free-Form Video Inpainting With 3D Gated Convolution and Temporal PatchGAN , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[22] Leon A. Gatys,et al. Image Style Transfer Using Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Tien-Tsin Wong,et al. Two-stage sketch colorization , 2018, ACM Trans. Graph..

[24] Adam Finkelstein,et al. PatchMatch: a randomized correspondence algorithm for structural image editing , 2009, SIGGRAPH 2009.

[25] Holger Winnemöller,et al. XDoG: An eXtended difference-of-Gaussians compendium including advanced image stylization , 2012, Comput. Graph..

[26] Connelly Barnes,et al. Stable and Controllable Neural Texture Synthesis and Style Transfer Using Histogram Losses , 2017, ArXiv.

[27] Amine Bermak,et al. Deep Exemplar-Based Video Colorization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Peter V. Gehler,et al. Video Propagation Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Xin Lin,et al. Style Transfer for Anime Sketches with Enhanced Residual U-net and Auxiliary Classifier GAN , 2017, 2017 4th IAPR Asian Conference on Pattern Recognition (ACPR).

[30] Dani Lischinski,et al. Colorization using optimization , 2004, ACM Trans. Graph..

[31] Jirí Zára,et al. Unsupervised colorization of black-and-white cartoons , 2004, NPAR '04.

[32] Haojie Li,et al. User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks , 2018, ACM Multimedia.

[33] Pascal Barla,et al. Diffusion curves: a vector representation for smooth-shaded images , 2008, ACM Trans. Graph..

[34] Seungyong Lee,et al. Coherent line drawing , 2007, NPAR '07.