论文信息 - Mastering Sketching

Mastering Sketching

We present an integral framework for training sketch simplification networks that convert challenging rough sketches into clean line drawings. Our approach augments a simplification network with a discriminator network, training both networks jointly so that the discriminator network discerns whether a line drawing is real training data or the output of the simplification network, which, in turn, tries to fool it. This approach has two major advantages: first, because the discriminator network learns the structure in line drawings, it encourages the output sketches of the simplification network to be more similar in appearance to the training sketches. Second, we can also train the networks with additional unsupervised data: by adding rough sketches and line drawings that are not corresponding to each other, we can improve the quality of the sketch simplification. Thanks to a difference in the architecture, our approach has advantages over similar adversarial training approaches in stability of training and the aforementioned ability to utilize unsupervised training data. We show how our framework can be used to train models that significantly outperform the state of the art in the sketch simplification task, despite using the same architecture for inference. We also present an approach to optimize for a single image, which improves accuracy at the cost of additional computation time. Finally, we show that, using the same framework, it is possible to train the network to perform the inverse problem, i.e., convert simple line sketches into pencil drawings, which is not possible using the standard mean squared error loss. We validate our framework with two user tests, in which our approach is preferred to the state of the art in sketch simplification 88.9% of the time.

[1] M. Wertheimer. Untersuchungen zur Lehre von der Gestalt. II , 1923 .

[2] Chuan Li,et al. Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks , 2016, ECCV.

[3] K. Sasaki,et al. Learning to simplify , 2016, ACM Trans. Graph..

[4] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[5] Ravin Balakrishnan,et al. ILoveSketch: as-natural-as-possible sketching system for creating 3d curve models , 2008, UIST '08.

[6] Satoshi Matsuoka,et al. Interactive beautification: a technique for rapid geometric design , 2006, SIGGRAPH Courses.

[7] Paul Asente,et al. ShipShape: a drawing beautification assistant , 2015, SBIM '15.

[8] Cewu Lu,et al. Combining sketch and tone for pencil drawing production , 2012, NPAR '12.

[9] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[10] Levent Burak Kara,et al. Beautification of Design Sketches Using Trainable Stroke Clustering and Curve Fitting , 2011, IEEE Transactions on Visualization and Computer Graphics.

[11] Tamara L. Berg,et al. Learning Temporal Transformations from Time-Lapse Videos , 2016, ECCV.

[12] Xiaoou Tang,et al. Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] 拓海杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[15] Karl Tombre,et al. Robust and accurate vectorization of line drawings , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[17] Hiroshi Ishikawa,et al. Let there be color! , 2016, ACM Trans. Graph..

[18] Markus H. Gross,et al. Topology-driven vectorization of clean line drawings , 2013, ACM Trans. Graph..

[19] Kunihiko Fukushima,et al. Neocognitron: A hierarchical neural network capable of visual pattern recognition , 1988, Neural Networks.

[20] Namil Kim,et al. Pixel-Level Domain Transfer , 2016, ECCV.

[21] Alexei A. Efros,et al. Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Matthew D. Zeiler. ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[23] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[24] Thomas Brox,et al. Generating Images with Perceptual Similarity Metrics based on Deep Networks , 2016, NIPS.

[25] Joaquim A. Jorge,et al. Sketch-based Interfaces and Modeling , 2010 .

[26] David Lindlbauer,et al. Perceptual grouping: selection assistance for digital sketching , 2013, ITS.

[27] Tien-Tsin Wong,et al. Closure-aware sketch simplification , 2015, ACM Trans. Graph..

[28] Pascal Barla,et al. Non‐Oriented MLS Gradient Fields , 2013, Comput. Graph. Forum.

[29] Lior Wolf,et al. Unsupervised Cross-Domain Image Generation , 2016, ICLR.

[30] Abhinav Gupta,et al. Generative Image Modeling Using Style and Structure Adversarial Networks , 2016, ECCV.

[31] Cindy Grimm,et al. Just DrawIt: a 3D sketching system , 2012, SBIM '12.

[32] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[33] Adrien Bousseau,et al. Fidelity vs. simplicity , 2016, ACM Trans. Graph..

[34] Baoquan Chen,et al. Efficient and Dynamic Simplification of Line Drawings , 2008, Comput. Graph. Forum.

[35] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[36] P. Selinger. Potrace : a polygon-based tracing algorithm , 2003 .

[37] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[38] Leon A. Gatys,et al. Image Style Transfer Using Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39] Ariel Shamir,et al. Style and abstraction in portrait sketching , 2013, ACM Trans. Graph..

[40] Seungyong Lee,et al. Coherent line drawing , 2007, NPAR '07.

[41] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[42] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43] Lawrence D. Jackel,et al. Backpropagation Applied to Handwritten Zip Code Recognition , 1989, Neural Computation.

[44] Seunghoon Hong,et al. Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[45] Geoffrey E. Hinton,et al. Learning representations by back-propagating errors , 1986, Nature.