SmartPaint: a co-creative drawing system based on generative adversarial networks

Artificial intelligence (AI) has played a significant role in imitating and producing large-scale designs such as e-commerce banners. However, it is less successful at creative and collaborative design outputs. Most humans express their ideas as rough sketches, and lack the professional skills to complete pleasing paintings. Existing AI approaches have failed to convert varied user sketches into artistically beautiful paintings while preserving their semantic concepts. To bridge this gap, we have developed SmartPaint, a co-creative drawing system based on generative adversarial networks (GANs), enabling a machine and a human being to collaborate in cartoon landscape painting. SmartPaint trains a GAN using triples of cartoon images, their corresponding semantic label maps, and edge detection maps. The machine can then simultaneously understand the cartoon style and semantics, along with the spatial relationships among the objects in the landscape images. The trained system receives a sketch as a semantic label map input, and automatically synthesizes its edge map for stable handling of varied sketches. It then outputs a creative and fine painting with the appropriate style corresponding to the human’s sketch. Experiments confirmed that the proposed SmartPaint system successfully generates high-quality cartoon paintings.

[1]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[2]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  K. Sasaki,et al.  Learning to simplify , 2016, ACM Trans. Graph..

[4]  Francisco Vides,et al.  I don't believe my eyes!: geometric sketch recognition for a computer art tutorial , 2012, SBIM '12.

[5]  Brian Magerko,et al.  Empirically Studying Participatory Sense-Making in Abstract Drawing with a Co-Creative Cognitive Agent , 2016, IUI.

[6]  Sungwoo Lee,et al.  I Lead, You Help but Only with Enough Details: Understanding User Experience of Co-Creation with Artificial Intelligence , 2018, CHI.

[7]  Yong Jae Lee,et al.  ShadowDraw: real-time user guidance for freehand drawing , 2011, ACM Trans. Graph..

[8]  Yong-Jin Liu,et al.  CartoonGAN: Generative Adversarial Networks for Photo Cartoonization , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[9]  Nelson Siu-Hang Chu,et al.  Real-time painting with an expressive virtual Chinese brush , 2004, IEEE Computer Graphics and Applications.

[10]  Matthias Zwicker,et al.  Faceshop , 2018, ACM Trans. Graph..

[11]  拓海 杉山,et al.  “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[12]  Shi-Min Hu,et al.  Sketch2Photo: internet image montage , 2009, ACM Trans. Graph..

[13]  Brian Magerko,et al.  Co-Creative Drawing Agent with Object Recognition , 2016, AIIDE.

[14]  Chen Chen,et al.  Learning to detect salient curves of cartoon images based on composition rules , 2016, 2016 11th International Conference on Computer Science & Education (ICCSE).

[15]  Samy Bengio,et al.  Generating Sentences from a Continuous Space , 2015, CoNLL.

[16]  Chuang Gan,et al.  Sparse, Smart Contours to Represent and Edit Images , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[17]  Marc Alexa,et al.  Photosketcher: Interactive Sketch-Based Image Synthesis , 2011, IEEE Computer Graphics and Applications.

[18]  Alexei A. Efros,et al.  Generative Visual Manipulation on the Natural Image Manifold , 2016, ECCV.

[19]  Simon Osindero,et al.  Conditional Generative Adversarial Nets , 2014, ArXiv.

[20]  Rynson W. H. Lau,et al.  What characterizes personalities of graphic designs? , 2018, ACM Trans. Graph..

[21]  Fisher Yu,et al.  Scribbler: Controlling Deep Image Synthesis with Sketch and Color , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Pegah Karimi,et al.  Deep Learning for Identifying Potential Conceptual Shifts for Co-creative Drawing , 2018, ArXiv.

[23]  Sylvain Paris,et al.  Deep Photo Style Transfer , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Linda Doyle,et al.  Painting style transfer for head portraits using convolutional neural networks , 2016, ACM Trans. Graph..

[25]  Hao Wang,et al.  Real-Time Neural Style Transfer for Videos , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Holger Winnemöller,et al.  Painting with Bob: assisted creativity for novices , 2014, UIST.

[27]  Keiji Yanai,et al.  Neural Font Style Transfer , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[28]  Marcel van Gerven,et al.  Convolutional Sketch Inversion , 2016, ECCV Workshops.

[29]  Wei Liu,et al.  Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks , 2018, ECCV.

[30]  Brian Magerko,et al.  Drawing Apprentice: An Enactive Co-Creative Agent for Artistic Collaboration , 2015, Creativity & Cognition.

[31]  Daniel Dixon,et al.  iCanDraw: using sketch recognition and corrective feedback to assist a user in drawing human faces , 2010, CHI.

[32]  Jitendra Malik,et al.  Shape Context: A New Descriptor for Shape Matching and Object Recognition , 2000, NIPS.

[33]  J. Canny A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Nicholas Davis,et al.  Human-Computer Co-Creativity: Blending Human and Computational Creativity , 2013, Proceedings of the AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment.

[35]  Leon A. Gatys,et al.  Image Style Transfer Using Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Haojie Li,et al.  User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks , 2018, ACM Multimedia.

[37]  Zengchang Qin,et al.  Auto-painter: Cartoon image generation from sketch by using conditional Wasserstein generative adversarial networks , 2018, Neurocomputing.

[38]  Masayuki Nakajima,et al.  Contour-driven Sumi-e rendering of real photos , 2011, Comput. Graph..

[39]  Douglas Eck,et al.  A Neural Representation of Sketch Drawings , 2017, ICLR.