论文信息 - Logo Synthesis and Manipulation with Clustered Generative Adversarial Networks

Logo Synthesis and Manipulation with Clustered Generative Adversarial Networks

Designing a logo for a new brand is a lengthy and tedious back-and-forth process between a designer and a client. In this paper we explore to what extent machine learning can solve the creative task of the designer. For this, we build a dataset - LLD - of 600k+ logos crawled from the world wide web. Training Generative Adversarial Networks (GANs) for logo synthesis on such multi-modal data is not straightforward and results in mode collapse for some state-of-the-art methods. We propose the use of synthetic labels obtained through clustering to disentangle and stabilize GAN training, and validate this approach on CIFAR-10 and ImageNet-small to demonstrate its generality. We are able to generate a high diversity of plausible logos and demonstrate latent space exploration techniques to ease the logo design task in an interactive manner. GANs can cope with multi-modal data by means of synthetic labels achieved through clustering, and our results show the creative potential of such techniques for logo synthesis and manipulation. Our dataset and models are publicly available at https://data.vision.ee.ethz.ch/sagea/lld/.

[1] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[2] Simon Haykin,et al. GradientBased Learning Applied to Document Recognition , 2001 .

[3] Zhou Wang,et al. Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[4] D. Doermann,et al. Automatic Document Logo Detection , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[5] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[6] Olivier Buisson,et al. Logo retrieval with a contrario visual query expansion , 2009, ACM Multimedia.

[7] Yannis Avrithis,et al. Scalable triangulation-based logo recognition , 2011, ICMR.

[8] Rainer Lienhart,et al. Scalable logo recognition in real-world images , 2011, ICMR.

[9] David S. Doermann,et al. No-Reference Image Quality Assessment Using Visual Codebooks , 2012, IEEE Transactions on Image Processing.

[10] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[11] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[12] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[13] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[14] Max Welling,et al. Semi-supervised Learning with Deep Generative Models , 2014, NIPS.

[15] Yinda Zhang,et al. LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop , 2015, ArXiv.

[16] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[17] Thomas Brox,et al. Learning to generate chairs with convolutional neural networks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[18] M. Student,et al. Context-Dependent Logo Matching and Recognition , 2015 .

[19] Qiang Wu,et al. LOGO-Net: Large-scale Deep Logo Detection and Brand Recognition with Deep Region-based Convolutional Networks , 2015, ArXiv.

[20] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[21] Alex Graves,et al. Conditional Image Generation with PixelCNN Decoders , 2016, NIPS.

[22] Bogdan Raducanu,et al. Invertible Conditional GANs for image editing , 2016, ArXiv.

[23] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Honglak Lee,et al. Attribute2Image: Conditional Image Generation from Visual Attributes , 2015, ECCV.

[25] Koray Kavukcuoglu,et al. Pixel Recurrent Neural Networks , 2016, ICML.

[26] Wojciech Zaremba,et al. Improved Techniques for Training GANs , 2016, NIPS.

[27] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.

[28] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[29] Philip Bachman,et al. Calibrating Energy-based Generative Adversarial Networks , 2017, ICLR.

[30] Andrew Brock,et al. Neural Photo Editing with Introspective Adversarial Networks , 2016, ICLR.

[31] Ian J. Goodfellow,et al. NIPS 2016 Tutorial: Generative Adversarial Networks , 2016, ArXiv.

[32] Yoshua Bengio,et al. Improving Generative Adversarial Networks with Denoising Feature Matching , 2016, ICLR.

[33] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[34] R. Timofte,et al. ABC-GAN : Adaptive Blur and Control for improved training stability of Generative Adversarial Networks , 2017 .

[35] Léon Bottou,et al. Towards Principled Methods for Training Generative Adversarial Networks , 2017, ICLR.

[36] John E. Hopcroft,et al. Stacked Generative Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37] David Berthelot,et al. BEGAN: Boundary Equilibrium Generative Adversarial Networks , 2017, ArXiv.

[38] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.

[39] Sina Honari,et al. Learning to Generate Samples from Noise through Infusion Training , 2017, ICLR.

[40] Shaogang Gong,et al. WebLogo-2M: Scalable Logo Detection by Deep Learning from the Web , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[41] Jonathon Shlens,et al. Conditional Image Synthesis with Auxiliary Classifier GANs , 2016, ICML.

[42] Aaron C. Courville,et al. Improved Training of Wasserstein GANs , 2017, NIPS.

[43] Aaron C. Courville,et al. Adversarially Learned Inference , 2016, ICLR.

[44] Luc Van Gool,et al. Optimal transport maps for distribution preserving operations on latent spaces of Generative Models , 2019, ICLR.