Gating PatternPyramid for diversified image style transfer

Abstract. In diversified image style transfer tasks, the goal is to translate an everyday photograph to diverse stylized images conditioned on the style of an artwork. Although several style transfer methods have achieved certain diversity through noise injection, they still have two unsolved problems: (1) relatively limited diversity and (2) significant degradation of quality. In this work, we introduce an effective Gating PatternPyramid block (GPP) to resolve these issues by employing a well-designed multibranch architecture. The main idea is underpinned by a finding that different network architectures capture different style patterns from the same artwork. In addition, the GPP block is compatible with many feed-forward style transfer models and empowers them with the ability of generating diverse stylization results. To the best of our knowledge, this is the first style transfer method that achieves significant diversity without injecting random noise, which provides an inspiring perspective for diverse translation research. We perform qualitative and quantitative experiments, showing the effectiveness and superiority of our method against the state-of-the-art diversified style transfer methods.

[1]  Lei Zhao,et al.  AesUST: Towards Aesthetic-Enhanced Universal Style Transfer , 2022, ACM Multimedia.

[2]  Zhizhong Wang,et al.  Universal Video Style Transfer via Crystallization, Separation, and Blending , 2022, IJCAI.

[3]  Lei Zhao,et al.  Style Fader Generative Adversarial Networks for Style Degree Controllable Artistic Style Transfer , 2022, IJCAI.

[4]  Lei Zhao,et al.  Texture Reformer: Towards Fast and Universal Interactive Texture Transfer , 2021, AAAI.

[5]  Lei Zhao,et al.  DivSwapper: Towards Diversified Patch-based Arbitrary Style Transfer , 2021, IJCAI.

[6]  Lei Zhao,et al.  Diverse Image Style Transfer via Invertible Cross-Space Mapping , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[7]  Lei Zhao,et al.  Diversified text-to-image generation via deep mutual information estimation , 2021, Comput. Vis. Image Underst..

[8]  Errui Ding,et al.  AdaAttN: Revisit Attention Mechanism in Arbitrary Neural Style Transfer , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[9]  Lei Zhao,et al.  DualAST: Dual Style-Learning Networks for Artistic Style Transfer , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Xinbo Gao,et al.  Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Arthur Heimbrecht,et al.  Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Siyu Huang,et al.  ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Dongming Lu,et al.  Evaluate and improve the quality of neural style transfer , 2021, Comput. Vis. Image Underst..

[14]  J. D. Wegner,et al.  In the light of feature distributions: moment matching for Neural Style Transfer , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Minseong Kim,et al.  Uncorrelated Feature Encoding for Faster Image Style Transfer , 2018, Neural Networks.

[16]  Lei Zhao,et al.  Artistic Style Transfer with Internal-external Learning and Contrastive Learning , 2021, NeurIPS.

[17]  Lei Zhao,et al.  GLStyleNet: exquisite style transfer combining global and local pyramid features , 2020, IET Comput. Vis..

[18]  Lei Zhao,et al.  Creative and diverse artwork generation using adversarial networks , 2020, IET Comput. Vis..

[19]  Jie Li,et al.  Image style transfer with collection representation space and semantic-guided reconstruction , 2020, Neural Networks.

[20]  Lei Zhao,et al.  UCTGAN: Diverse Image Inpainting Based on Unsupervised Cross-Space Translation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Yun Sheng,et al.  Stylization-Based Architecture for Fast Deep Exemplar Colorization , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Weiming Dong,et al.  Arbitrary Style Transfer via Multi-Adaptation Network , 2020, ACM Multimedia.

[23]  Gregory Shakhnarovich,et al.  Deformable Style Transfer , 2020, ECCV.

[24]  Victor Lempitsky,et al.  High-Resolution Daytime Translation Without Domain Labels , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Ming-Hsuan Yang,et al.  Collaborative Distillation for Ultra-Resolution Universal Style Transfer , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Maneesh Kumar Singh,et al.  DRIT++: Diverse Image-to-Image Translation via Disentangled Representations , 2020, International Journal of Computer Vision.

[27]  Jung-Woo Ha,et al.  StarGAN v2: Diverse Image Synthesis for Multiple Domains , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Shilei Wen,et al.  Dynamic Instance Normalization for Arbitrary Style Transfer , 2019, AAAI.

[29]  Lei Zhao,et al.  Diversified Arbitrary Style Transfer via Deep Feature Perturbation , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Jonathan Masci,et al.  Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Minglun Gong,et al.  EFANet: Exchangeable Feature Alignment Network for Arbitrary Style Transfer , 2018, AAAI.

[32]  Björn Ommer,et al.  Content and Style Disentanglement for Artistic Style Transfer , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[33]  Feng Xu,et al.  A Closed-Form Solution to Universal Style Transfer , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[34]  Björn Ommer,et al.  A Content Transformation Block for Image Style Transfer , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Gregory Shakhnarovich,et al.  Style Transfer by Relaxed Optimal Transport and Self-Similarity , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Yun Fu,et al.  Multimodal Style Transfer via Graph Cuts , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[37]  Siwei Ma,et al.  Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Jun Wang,et al.  Attention-Aware Multi-Stroke Style Transfer , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Neil Smith,et al.  Latent Filter Scaling for Multimodal Unsupervised Image-To-Image Translation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Kwang Hee Lee,et al.  Arbitrary Style Transfer With Style-Attentional Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Björn Ommer,et al.  A Style-Aware Content Loss for Real-time HD Style Transfer , 2018, ECCV.

[42]  Xiaogang Wang,et al.  Avatar-Net: Multi-scale Zero-Shot Style Transfer by Feature Decoration , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[43]  Jan Kautz,et al.  Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[44]  Zunlei Feng,et al.  Stroke Controllable Fast Style Transfer with Adaptive Receptive Fields , 2018, ECCV.

[45]  Alexei A. Efros,et al.  The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[46]  Hang Zhang,et al.  Multi-style Generative Network for Real-time Transfer , 2017, ECCV Workshops.

[47]  Alexei A. Efros,et al.  Toward Multimodal Image-to-Image Translation , 2017, NIPS.

[48]  Ming-Hsuan Yang,et al.  Universal Style Transfer via Feature Transforms , 2017, NIPS.

[49]  Nenghai Yu,et al.  StyleBank: An Explicit Representation for Neural Image Style Transfer , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[50]  Serge J. Belongie,et al.  Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[51]  Ming-Hsuan Yang,et al.  Diversified Texture Synthesis with Feed-Forward Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[52]  Andrea Vedaldi,et al.  Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[53]  Xin Wang,et al.  Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[54]  Jonathon Shlens,et al.  A Learned Representation For Artistic Style , 2016, ICLR.

[55]  Mark W. Schmidt,et al.  Fast Patch-based Style Transfer of Arbitrary Style , 2016, ArXiv.

[56]  Leon A. Gatys,et al.  Image Style Transfer Using Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[57]  Chuan Li,et al.  Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks , 2016, ECCV.

[58]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[59]  Alex J. Champandard,et al.  Semantic Style Transfer and Turning Two-Bit Doodles into Fine Artworks , 2016, ArXiv.

[60]  Chuan Li,et al.  Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[61]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[62]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[63]  Trevor Darrell,et al.  Recognizing Image Style , 2013, BMVC.