MARVEL: Raster Manga Vectorization via Primitive-wise Deep Reinforcement Learning

Manga is a fashionable Japanese-style comic form that is composed of black-and-white strokes and is generally displayed as raster images on digital devices. Typical mangas have simple textures, wide lines, and few color gradients, which are vectorizable natures to enjoy the merits of vector graphics, e.g., adaptive resolutions and small file sizes. In this paper, we propose MARVEL (MAnga's Raster to VEctor Learning), a primitive-wise approach for vectorizing raster mangas by Deep Reinforcement Learning (DRL). Unlike previous learning-based methods which predict vector parameters for an entire image, MARVEL introduces a new perspective that regards an entire manga as a collection of basic primitives\textemdash stroke lines, and designs a DRL model to decompose the target image into a primitive sequence for achieving accurate vectorization. To improve vectorization accuracies and decrease file sizes, we further propose a stroke accuracy reward to predict accurate stroke lines, and a pruning mechanism to avoid generating erroneous and repeated strokes. Extensive subjective and objective experiments show that our MARVEL can generate impressive results and reaches the state-of-the-art level. Our code is open-source at: https://github.com/SwordHolderSH/Mang2Vec.

[1]  H. Zha,et al.  All-Higher-Stages-In Adaptive Context Aggregation for Semantic Edge Detection , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Y. Fu,et al.  Towards Layer-wise Image Vectorization , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  Siwei Ma,et al.  Cross-SRN: Structure-Preserving Super-Resolution Network With Cross Convolution , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Ji Wan,et al.  Q-Art Code: Generating Scanning-robust Art-style QR Codes by Deformable Convolution , 2021, ACM Multimedia.

[5]  Menghan Xia,et al.  Deep Halftoning with Reversible Binary Pattern , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[6]  E. Simo-Serra,et al.  General virtual sketching framework for vector line art , 2021, ACM Trans. Graph..

[7]  T. Wong,et al.  Seamless manga inpainting with semantics awareness , 2021, ACM Trans. Graph..

[8]  Hao Su,et al.  ArtCoder: An End-to-end Method for Generating Scanning-robust Stylized QR Codes , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Tien-Tsin Wong,et al.  Exploiting Aliasing for Manga Restoration , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  N. Mitra,et al.  Im2Vec: Synthesizing Vector Graphics without Vector Supervision , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Tzu-Mao Li,et al.  Differentiable vector graphics rasterization for editing and learning , 2020, ACM Trans. Graph..

[12]  Xueting Liu,et al.  Manga filling style conversion with screentone variational autoencoder , 2020, ACM Trans. Graph..

[13]  Mike Preuss,et al.  Model-Based Deep Reinforcement Learning for High-Dimensional Problems, a Survey , 2020, ArXiv.

[14]  Alexandre Alahi,et al.  DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation , 2020, NeurIPS.

[15]  Alla Sheffer,et al.  PolyFit : perception-aligned vectorization of raster clip-art via intermediate polygonal fitting , 2020 .

[16]  J. Niu,et al.  Unpaired Photo-to-manga Translation Based on The Methodology of Manga Drawing , 2020, AAAI.

[17]  D. Zorin,et al.  Deep Vectorization of Technical Drawings , 2020, ECCV.

[18]  John Collomosse,et al.  Sketchformer: Transformer-Based Representation for Sketched Structure , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Jianguo Xiao,et al.  Artistic glyph image synthesis via one-stage few-shot learning , 2019, ACM Trans. Graph..

[20]  Yi Guo,et al.  Deep Line Drawing Vectorization via Line Subdivision and Topology Reconstruction , 2019, Comput. Graph. Forum.

[21]  Zhou Yu,et al.  Multimodal Transformer With Multi-View Visual Representation for Image Captioning , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Douglas Eck,et al.  A Learned Representation for Scalable Vector Graphics , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[23]  Shuchang Zhou,et al.  Learning to Paint With Model-Based Deep Reinforcement Learning , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[24]  Leonidas J. Guibas,et al.  DeepSpline: Data-Driven Reconstruction of Parametric Curves and Surfaces , 2019, ArXiv.

[25]  Wai Keung Wong,et al.  Bilinear Supervised Hashing Based on 2D Image Features , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[26]  Tien-Tsin Wong,et al.  Deep unsupervised pixelization , 2018, ACM Trans. Graph..

[27]  Tien-Tsin Wong,et al.  Two-stage sketch colorization , 2018, ACM Trans. Graph..

[28]  Xueying Qin,et al.  Deeply Supervised Depth Map Super-Resolution as Novel View Synthesis , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Mikhail Bessmeltsev,et al.  Vectorization of Line Drawings via Polyvector Fields , 2018, ACM Trans. Graph..

[30]  Trevor Darrell,et al.  Multi-content GAN for Few-Shot Font Style Transfer , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[31]  Andrea Prati,et al.  An Accurate System for Fashion Hand-Drawn Sketches Vectorization , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[32]  Jiajun Wu,et al.  Raster-to-Vector: Revisiting Floorplan Transformation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[33]  Chih-Yuan Yao,et al.  Resolution Independent Real-Time Vector-Embedded Mesh for Animation , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[34]  Xueting Liu,et al.  Deep extraction of manga structural lines , 2017, ACM Trans. Graph..

[35]  Douglas Eck,et al.  A Neural Representation of Sketch Drawings , 2017, ICLR.

[36]  Karthik Ramani,et al.  SurfNet: Generating 3D Shape Surfaces Using Deep Residual Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Chih-Yuan Yao,et al.  Manga Vectorization and Manipulation with Procedural Simple Screentone , 2017, IEEE Transactions on Visualization and Computer Graphics.

[38]  查正军,et al.  A Unified Scheme for Super-resolution and Depth Estimation from Asymmetric Stereoscopic Video , 2016 .

[39]  Tim Salimans,et al.  Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks , 2016, NIPS.

[40]  Qi Tian,et al.  Making Residual Vector Distribution Uniform for Distinctive Image Representation , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[41]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42]  Tien-Tsin Wong,et al.  Closure-aware sketch simplification , 2015, ACM Trans. Graph..

[43]  Yuval Tassa,et al.  Continuous control with deep reinforcement learning , 2015, ICLR.

[44]  Christian Szegedy,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[45]  Markus H. Gross,et al.  Topology-driven vectorization of clean line drawings , 2013, ACM Trans. Graph..

[46]  Markus H. Gross,et al.  Smart Scribbles for Sketch Segmentation , 2012, Comput. Graph. Forum.

[47]  Harry Shum,et al.  Object-Based Rendering and 3-D Reconstruction Using a Moveable Image-Based System , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[48]  Dani Lischinski,et al.  Depixelizing pixel art , 2011, ACM Trans. Graph..

[49]  김정민,et al.  Cubic Spline Interpolation을 이용한 얼굴 영상의 단순화 , 2010 .

[50]  Ralph R. Martin,et al.  Vectorizing Cartoon Animations , 2009, IEEE Transactions on Visualization and Computer Graphics.

[51]  Tien-Tsin Wong,et al.  Richness-preserving manga screening , 2008, ACM Trans. Graph..

[52]  Vladimir G. Kim,et al.  Diffusion curves: a vector representation for smooth-shaded images , 2008, ACM Trans. Graph..

[53]  Baoquan Chen,et al.  Efficient and Dynamic Simplification of Line Drawings , 2008, Comput. Graph. Forum.

[54]  Tien-Tsin Wong,et al.  Deringing cartoons by image analogies , 2006, TOGS.

[55]  Bruno Lévy,et al.  Ardeco: automatic region detection and conversion , 2006, EGSR '06.

[56]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[57]  P. Selinger Potrace : a polygon-based tracing algorithm , 2003 .

[58]  Vijay R. Konda,et al.  Actor-Critic Algorithms , 1999, NIPS.

[59]  李蕻 培训站Adobe Illustrator(三) , 1996 .