ClipGen: A Deep Generative Model for Clipart Vectorization and Synthesis

In this paper, we present a novel deep learning-based approach for automatically vectorizing and synthesizing clipart of man-made objects. Given a raster clipart image and it's corresponding object category (e.g., airplane), our method sequentially generates new layers. Each layer is composed of a new closed path filled with a single color. The final result is obtained by compositing all layers together into a vector clipart that falls into the target category. At the core of our approach is an iterative generative model that decides (i) whether to keep synthesizing a new layer and (ii) the geometry and appearance of the new layer. We formulate a joint loss function for training our generative model, including shape similarity, symmetry, local curve smoothness losses, and a vector graphics rendering accuracy loss to synthesize recognizable clipart. We also introduce a collection of man-made object clipart, ClipNet, composed of layers of a closed path, and we design two preprocessing tasks to clean up and enrich the original raw clipart. To validate our approach, we perform several validation studies and demonstrate the ability to vectorize and synthesize various clipart categories. We envision that our generative model can facilitate efficient and intuitive clipart design for novice users and graphic designers.

[1]  Jaakko Lehtinen,et al.  Analyzing and Improving the Image Quality of StyleGAN , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[3]  Harry Shum,et al.  Image vectorization using optimized gradient meshes , 2007, ACM Trans. Graph..

[4]  Siddhartha Chaudhuri,et al.  SCORES: Shape Composition with Recursive Substructure Priors , 2018, ACM Trans. Graph..

[5]  Christian Wolf,et al.  Interactive example-based terrain authoring with conditional generative adversarial networks , 2017, ACM Trans. Graph..

[6]  Alexei A. Efros,et al.  Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[7]  Marc Alexa,et al.  How do humans sketch objects? , 2012, ACM Trans. Graph..

[8]  Siddhartha Chaudhuri,et al.  A probabilistic model for component-based shape synthesis , 2012, ACM Trans. Graph..

[9]  Pat Hanrahan,et al.  Example-based synthesis of 3D object arrangements , 2012, ACM Trans. Graph..

[10]  Leonidas J. Guibas,et al.  Probabilistic reasoning for assembly-based 3D modeling , 2011, ACM Trans. Graph..

[11]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[12]  Adrien Bousseau,et al.  Photo2clipart , 2017, ACM Trans. Graph..

[13]  Tien-Tsin Wong,et al.  Deep unsupervised pixelization , 2018, ACM Trans. Graph..

[14]  Hao Su,et al.  A Point Set Generation Network for 3D Object Reconstruction from a Single Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Tzu-Mao Li,et al.  Differentiable vector graphics rasterization for editing and learning , 2020, ACM Trans. Graph..

[16]  Xiaogang Wang,et al.  Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[17]  Ersin Yumer,et al.  3D-PRNN: Generating Shape Primitives with Recurrent Neural Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[18]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Pascal Barla,et al.  Diffusion curves: a vector representation for smooth-shaded images , 2008, ACM Trans. Graph..

[20]  Aaron C. Courville,et al.  Improved Training of Wasserstein GANs , 2017, NIPS.

[21]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Alexei A. Efros,et al.  Generative Visual Manipulation on the Natural Image Manifold , 2016, ECCV.

[23]  Markus H. Gross,et al.  Semantic Segmentation for Line Drawing Vectorization Using Neural Networks , 2018, Comput. Graph. Forum.

[24]  Rynson W. H. Lau,et al.  Content-aware generative modeling of graphic design layouts , 2019, ACM Trans. Graph..

[25]  Léon Bottou,et al.  Wasserstein GAN , 2017, ArXiv.

[26]  Wenpeng Yin,et al.  Comparative Study of CNN and RNN for Natural Language Processing , 2017, ArXiv.

[27]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Sanja Fidler,et al.  Efficient Interactive Annotation of Segmentation Datasets with Polygon-RNN++ , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[29]  Luc Van Gool,et al.  A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Geoffrey E. Hinton,et al.  Layer Normalization , 2016, ArXiv.

[31]  Thomas A. Funkhouser,et al.  Semantic Scene Completion from a Single Depth Image , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Daniel Cohen-Or,et al.  GRAINS , 2018, ACM Trans. Graph..

[34]  Adam Finkelstein,et al.  Where do people draw lines? , 2008, ACM Trans. Graph..

[35]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[36]  Takeo Igarashi,et al.  Interactive Subspace Exploration on Generative Image Modelling , 2019, ArXiv.

[37]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[38]  Niloy J. Mitra,et al.  Learning a shared shape space for multimodal garment design , 2018, ACM Trans. Graph..

[39]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[40]  Hui Huang,et al.  RPM-Net , 2019, ACM Trans. Graph..

[41]  Christian Rössl,et al.  Laplacian surface editing , 2004, SGP '04.

[42]  Tingfa Xu,et al.  LayoutGAN: Generating Graphic Layouts with Wireframe Discriminators , 2019, ICLR.

[43]  Luca Antiga,et al.  Automatic differentiation in PyTorch , 2017 .

[44]  Alla Sheffer,et al.  Perception-driven semi-structured boundary vectorization , 2018, ACM Trans. Graph..

[45]  Douglas Eck,et al.  A Neural Representation of Sketch Drawings , 2017, ICLR.

[46]  Leonidas J. Guibas,et al.  GRASS: Generative Recursive Autoencoders for Shape Structures , 2017, ACM Trans. Graph..

[47]  Dani Lischinski,et al.  Depixelizing pixel art , 2011, ACM Trans. Graph..

[48]  Marc Alexa,et al.  ABC: A Big CAD Model Dataset for Geometric Deep Learning , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[49]  Holger Winnemöller,et al.  Interactive Vectorization , 2017, CHI.

[50]  Harshad Rai,et al.  Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks , 2018 .

[51]  Adrien Bousseau,et al.  Depicting stylized materials with vector shade trees , 2013, ACM Trans. Graph..

[52]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[53]  Leonidas J. Guibas,et al.  ShapeNet: An Information-Rich 3D Model Repository , 2015, ArXiv.

[54]  Sanja Fidler,et al.  Creative Flow+ Dataset , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[55]  Mathieu Aubry,et al.  Vector Image Generation by Learning Parametric Layer Decomposition , 2018, ArXiv.

[56]  Douglas Eck,et al.  A Learned Representation for Scalable Vector Graphics , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[57]  Daniel Cohen-Or,et al.  GlobFit: consistently fitting primitives by discovering global relations , 2011, ACM Trans. Graph..

[58]  Adrien Bousseau,et al.  OpenSketch: a richly-annotated dataset of product design sketches , 2019, ACM Trans. Graph..

[59]  Bolei Zhou,et al.  Semantic photo manipulation with a generative image prior , 2019, ACM Trans. Graph..

[60]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[61]  Noah Snavely,et al.  OpenSurfaces , 2013, ACM Trans. Graph..

[62]  Daphna Weinshall,et al.  On The Power of Curriculum Learning in Training Deep Networks , 2019, ICML.

[63]  Angel X. Chang,et al.  Deep convolutional priors for indoor scene synthesis , 2018, ACM Trans. Graph..

[64]  Mikhail Bessmeltsev,et al.  Vectorization of Line Drawings via Polyvector Fields , 2018, ACM Trans. Graph..

[65]  Tomas Pfister,et al.  Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[66]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[67]  Szymon Rusinkiewicz,et al.  Data-driven iconification , 2016, Expressive.

[68]  Diego Gutierrez,et al.  A similarity measure for illustration style , 2014, ACM Trans. Graph..

[69]  Jason Weston,et al.  Curriculum learning , 2009, ICML '09.

[70]  Leonidas J. Guibas,et al.  PartNet: A Large-Scale Benchmark for Fine-Grained and Hierarchical Part-Level 3D Object Understanding , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[71]  Yizhou Yu,et al.  Patch-based image vectorization with automatic curvilinear feature alignment , 2009, ACM Trans. Graph..