论文信息 - Deep Facial Synthesis: A New Challenge

Deep Facial Synthesis: A New Challenge

The goal of this paper is to conduct a comprehensive study on the facial sketch synthesis (FSS) problem. However, due to the high costs in obtaining hand-drawn sketch datasets, there lacks a complete benchmark for assessing the development of FSS algorithms over the last decade. As such, we first introduce a high-quality dataset for FSS, named FS2K, which consists of 2,104 image-sketch pairs spanning three types of sketch styles, image backgrounds, lighting conditions, skin colors, and facial attributes. FS2K differs from previous FSS datasets in difficulty, diversity, and scalability, and should thus facilitate the progress of FSS research. Second, we present the largest-scale FSS study by investigating 139 classical methods, including 24 handcrafted feature based facial sketch synthesis approaches, 37 general neural-style transfer methods, 43 deep image-to-image translation methods, and 35 image-tosketch approaches. Besides, we elaborate comprehensive experiments for existing 19 cutting-edge models. Third, we present a simple baseline for FSS, named FSGAN. With only two straightforward components, i.e., facial-aware masking and style-vector expansion, FSGAN surpasses the performance of all previous state-of-the-art models on the proposed FS2K dataset, by a large margin. Finally, we conclude with lessons learned over the past years, and point out several unsolved challenges. Our open-source code is available at https://github.com/DengPingFan/FSGAN.

[1] Fisher Yu,et al. TextureGAN: Controlling Deep Image Synthesis with Texture Patches , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[2] Trevor Darrell,et al. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Stefano Ermon,et al. SDEdit: Image Synthesis and Editing with Stochastic Differential Equations , 2021, ArXiv.

[4] Amit R.Sharma,et al. Face Photo-Sketch Synthesis and Recognition , 2012 .

[5] Andrea Vedaldi,et al. Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6] Yunsong Li,et al. Markov Random Neural Fields for Face Sketch Synthesis , 2018, IJCAI.

[7] Jinwen Ma,et al. ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes , 2018, ECCV.

[8] Alexei A. Efros,et al. Interactive Sketch & Fill: Multiclass Sketch-to-Image Translation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[9] Quan Pan,et al. Semi-coupled dictionary learning with applications to image super-resolution and photo-sketch synthesis , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[10] Linda Doyle,et al. Painting style transfer for head portraits using convolutional neural networks , 2016, ACM Trans. Graph..

[11] Georgios Tzimiropoulos,et al. How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks) , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[12] Xinbo Gao,et al. Cascaded Face Sketch Synthesis Under Various Illuminations , 2020, IEEE Transactions on Image Processing.

[13] Yang Song,et al. Age Progression/Regression by Conditional Adversarial Autoencoder , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Liang Lin,et al. BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network , 2018, ACM Multimedia.

[15] Pietro Perona,et al. Microsoft COCO: Common Objects in Context , 2014, ECCV.

[16] Shan Li,et al. Deep Facial Expression Recognition: A Survey , 2018, IEEE Transactions on Affective Computing.

[17] Bo Zhao,et al. Modular Generative Adversarial Networks , 2018, ECCV.

[18] Jan Kautz,et al. Unsupervised Image-to-Image Translation Networks , 2017, NIPS.

[19] Leon A. Gatys,et al. Controlling Perceptual Factors in Neural Style Transfer , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Rama Chellappa,et al. Attributes for Improved Attributes: A Multi-Task Network Utilizing Implicit and Explicit Relationships for Facial Attribute Classification , 2017, AAAI.

[21] Yaxing Wang,et al. DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs , 2020, NeurIPS.

[22] Siwei Ma,et al. Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[23] Jonathan Krause,et al. 3D Object Representations for Fine-Grained Categorization , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[24] T. Onisawa,et al. Expressive facial caricature drawing , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[25] Yunsong Li,et al. Deep Latent Low-Rank Representation for Face Sketch Synthesis , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[26] Zunlei Feng,et al. Stroke Controllable Fast Style Transfer with Adaptive Receptive Fields , 2018, ECCV.

[27] David Bau,et al. Sketch Your Own GAN , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[28] Yike Guo,et al. Semantic Image Synthesis via Adversarial Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[29] Yingtao Tian,et al. Towards the Automatic Anime Characters Creation with Generative Adversarial Networks , 2017, ArXiv.

[30] Fei-Fei Li,et al. Novel Dataset for Fine-Grained Image Categorization : Stanford Dogs , 2012 .

[31] Arthur Heimbrecht,et al. Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[32] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[33] 拓海杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[34] Chuan Li,et al. Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks , 2016, ECCV.

[35] Pietro Perona,et al. Building a bird recognition app and large scale dataset with citizen scientists: The fine print in fine-grained dataset collection , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36] Abhishek Kumar,et al. Score-Based Generative Modeling through Stochastic Differential Equations , 2020, ICLR.

[37] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Frédo Durand,et al. Style transfer for headshot portraits , 2014, ACM Trans. Graph..

[39] Francesc Moreno-Noguer,et al. GANimation: Anatomically-aware Facial Animation from a Single Image , 2018, ECCV.

[40] Chunna Tian,et al. Face Sketch Synthesis Algorithm Based on E-HMM and Selective Ensemble , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[41] Xuelong Li,et al. Face sketch-photo synthesis based on support vector regression , 2011, 2011 18th IEEE International Conference on Image Processing.

[42] Timo Aila,et al. A Style-Based Generator Architecture for Generative Adversarial Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[43] Eric R. Ziegel,et al. The Elements of Statistical Learning , 2003, Technometrics.

[44] Yang Gao,et al. End-to-End Learning of Driving Models from Large-Scale Video Datasets , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45] Roland Memisevic,et al. Incorporating long-range consistency in CNN-based texture generation , 2016, ICLR.

[46] Qi Liu,et al. SketchyCOCO: Image Generation From Freehand Scene Sketches , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[47] Nasser M. Nasrabadi,et al. Facial Attributes Guided Deep Sketch-to-Photo Synthesis , 2018, 2018 IEEE Winter Applications of Computer Vision Workshops (WACVW).

[48] Lei Cai,et al. Multi-Scale Gradients Self-Attention Residual Learning for Face Photo-Sketch Transformation , 2021, IEEE Transactions on Information Forensics and Security.

[49] Shree K. Nayar,et al. FaceTracer: A Search Engine for Large Collections of Images with Faces , 2008, ECCV.

[50] Thomas Brox,et al. A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51] Hanqing Lu,et al. A nonlinear approach for face sketch synthesis and recognition , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[52] Jan Kautz,et al. High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[53] Yu-Ding Lu,et al. DRIT++: Diverse Image-to-Image Translation via Disentangled Representations , 2020, International Journal of Computer Vision.

[54] Kwang Hee Lee,et al. Arbitrary Style Transfer With Style-Attentional Networks , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[55] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[56] Jonathon Shlens,et al. A Learned Representation For Artistic Style , 2016, ICLR.

[57] Bogdan Raducanu,et al. Transferring GANs: generating images from limited data , 2018, ECCV.

[58] Xinbo Gao,et al. Face Recognition from Multiple Stylistic Sketches: Scenarios, Datasets, and Evaluation , 2016, ECCV Workshops.

[59] Xiaoming Liu,et al. Disentangled Representation Learning GAN for Pose-Invariant Face Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60] Peter Wonka,et al. Image2StyleGAN: How to Embed Images Into the StyleGAN Latent Space? , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[61] Xuelong Li,et al. A Comprehensive Survey to Face Hallucination , 2013, International Journal of Computer Vision.

[62] Keren Fu,et al. An Identity-Preserved Model for Face Sketch-Photo Synthesis , 2020, IEEE Signal Processing Letters.

[63] Rongrong Ji,et al. Scoot: A Perceptual Metric for Facial Sketches , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[64] Xiaoou Tang,et al. Facial Landmark Detection by Deep Multi-task Learning , 2014, ECCV.

[65] Hugo Larochelle,et al. Optimization as a Model for Few-Shot Learning , 2016, ICLR.

[66] Jaakko Lehtinen,et al. Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.

[67] Derek Hoiem,et al. Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[68] Aleix M. Martinez,et al. The AR face database , 1998 .

[69] Luc Van Gool,et al. Exemplar Guided Unsupervised Image-to-Image Translation with Semantic Consistency , 2018, ICLR.

[70] S. Prime,et al. “Do I Know You?” Altering hairstyle affects facial recognition , 2018 .

[71] Victor Lempitsky,et al. High-Resolution Daytime Translation Without Domain Labels , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[72] David Zhang,et al. FSIM: A Feature Similarity Index for Image Quality Assessment , 2011, IEEE Transactions on Image Processing.

[73] Jan Kautz,et al. Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[74] Andrew Y. Ng,et al. Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[75] Fang Wen,et al. CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[76] Xinbo Gao,et al. A Deep Collaborative Framework for Face Photo–Sketch Synthesis , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[77] Wei Liu,et al. Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks , 2018, ECCV.

[78] Li Fei-Fei,et al. Characterizing and Improving Stability in Neural Style Transfer , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[79] Radim Sára,et al. Spatial Pattern Templates for Recognition of Objects with Regular Structure , 2013, GCPR.

[80] Yu Qiao,et al. Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks , 2016, IEEE Signal Processing Letters.

[81] Iasonas Kokkinos,et al. Describing Textures in the Wild , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[82] Andrew Hunter,et al. Learnable Stroke Models for Example-based Portrait Painting , 2013, BMVC.

[83] Ariel Shamir,et al. Style and abstraction in portrait sketching , 2013, ACM Trans. Graph..

[84] Paul L. Rosin,et al. Unpaired Portrait Drawing Generation via Asymmetric Cycle Mapping , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[85] Monson H. Hayes,et al. Face Recognition Using An Embedded HMM , 1999 .

[86] Antonio M. López,et al. The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[87] Xuelong Li,et al. Face Sketch Synthesis by Multidomain Adversarial Learning , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[88] Hao Dong,et al. Unpaired Image-to-Image Translation using Adversarial Consistency Loss , 2020, ECCV.

[89] James Hays,et al. Localizing and Orienting Street Views Using Overhead Imagery , 2016, ECCV.

[90] Yann LeCun,et al. Generalization and network design strategies , 1989 .

[91] Erik Learned-Miller,et al. FDDB: A benchmark for face detection in unconstrained settings , 2010 .

[92] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[93] Abhinav Gupta,et al. Transitive Invariance for Self-Supervised Visual Representation Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[94] Qingming Huang,et al. Toward Realistic Face Photo–Sketch Synthesis via Composition-Aided GANs , 2017, IEEE Transactions on Cybernetics.

[95] Benjamin Z. Yao,et al. Introduction to a Large-Scale General Purpose Ground Truth Database: Methodology, Annotation Tool and Benchmarks , 2007, EMMCVPR.

[96] Xinbo Gao,et al. Deep Graphical Feature Learning for Face Sketch Synthesis , 2017, IJCAI.

[97] Shiguang Shan,et al. Local Regression Model for Automatic Face Sketch Generation , 2011, 2011 Sixth International Conference on Image and Graphics.

[98] Andrea Vedaldi,et al. Texture Networks: Feed-forward Synthesis of Textures and Stylized Images , 2016, ICML.

[99] Vishal M. Patel,et al. High-Quality Facial Photo-Sketch Synthesis Using Multi-Adversarial Networks , 2017, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[100] J. Nishino,et al. Linguistic knowledge acquisition system on facial caricature drawing system , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[101] Michael Felsberg,et al. DoodleFormer: Creative Sketch Drawing with Transformers , 2021, ArXiv.

[102] Lu Yuan,et al. Cross-Domain Correspondence Learning for Exemplar-Based Image Translation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[103] Bin Fang,et al. Reusing Discriminators for Encoding: Towards Unsupervised Image-to-Image Translation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[104] Taghi M. Khoshgoftaar,et al. A survey on Image Data Augmentation for Deep Learning , 2019, Journal of Big Data.

[105] Michael J. Black,et al. A Naturalistic Open Source Movie for Optical Flow Evaluation , 2012, ECCV.

[106] Lin Gao,et al. DeepFaceDrawing: deep generation of face images from sketches , 2020, ACM Trans. Graph..

[107] Fisher Yu,et al. Scribbler: Controlling Deep Image Synthesis with Sketch and Color , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[108] Luc Van Gool,et al. A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[109] Marwan Mattar,et al. Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[110] Xin Zheng,et al. A Survey of Deep Facial Attribute Analysis , 2018, International Journal of Computer Vision.

[111] Chunxiao Liu,et al. TSIT: A Simple and Versatile Framework for Image-to-Image Translation , 2020, ECCV.

[112] Yunsong Li,et al. Face Sketch Synthesis From Coarse to Fine , 2018, AAAI.

[113] S. Avidan,et al. Seam carving for content-aware image resizing , 2007, SIGGRAPH 2007.

[114] Dani Lischinski,et al. CrossNet: Latent Cross-Consistency for Unpaired Image Translation , 2020, 2020 IEEE Winter Conference on Applications of Computer Vision (WACV).

[115] Douglas Eck,et al. A Neural Representation of Sketch Drawings , 2017, ICLR.

[116] Shuicheng Yan,et al. Neural Style Transfer via Meta Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[117] Yue Gao,et al. Robust Face Sketch Synthesis via Generative Adversarial Fusion of Priors and Parametric Sigmoid , 2018, IJCAI.

[118] Mohammad Nayeem Teli,et al. Comparison and Analysis of Image-to-Image Generative Adversarial Networks: A Survey , 2021, ArXiv.

[119] O. Chapelle,et al. Semi-Supervised Learning (Chapelle, O. et al., Eds.; 2006) [Book reviews] , 2009, IEEE Transactions on Neural Networks.

[120] Xiaogang Wang,et al. DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[121] Minjae Kim,et al. U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation , 2019, ICLR.

[122] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[123] Denise C. Park,et al. A lifespan database of adult facial stimuli , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[124] Aleix M. Martínez,et al. EmotioNet: An Accurate, Real-Time Algorithm for the Automatic Annotation of a Million Facial Expressions in the Wild , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[125] Zhe L. Lin,et al. Exemplar-Based Face Parsing , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[126] Taesung Park,et al. Semantic Image Synthesis With Spatially-Adaptive Normalization , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[127] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[128] Vladlen Koltun,et al. Playing for Data: Ground Truth from Computer Games , 2016, ECCV.

[129] Xiaogang Wang,et al. Face photo recognition using sketch , 2002, Proceedings. International Conference on Image Processing.

[130] Yung-Yu Chuang,et al. Domain-Specific Mappings for Generative Adversarial Style Transfer , 2020, ECCV.

[131] Andrew Zisserman,et al. Automated Flower Classification over a Large Number of Classes , 2008, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing.

[132] Hong Chen,et al. A Hierarchical Compositional Model for Face Representation and Sketching , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[133] Daniel Cohen-Or,et al. Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[134] Bolei Zhou,et al. Generative Hierarchical Features from Synthesizing Images , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[135] Nicu Sebe,et al. Multi-Channel Attention Selection GAN With Cascaded Semantic Guidance for Cross-View Image Translation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[136] Liang Chang,et al. Face Sketch Synthesis via Multivariate Output Regression , 2011, HCI.

[137] Chuan Li,et al. Combining Markov Random Fields and Convolutional Neural Networks for Image Synthesis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[138] Alexander Kolesnikov,et al. MLP-Mixer: An all-MLP Architecture for Vision , 2021, NeurIPS.

[139] Thomas S. Huang,et al. Interactive Facial Feature Localization , 2012, ECCV.

[140] Hailin Jin,et al. BAM! The Behance Artistic Media Dataset for Recognition Beyond Photography , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[141] Xiaogang Wang,et al. Lighting and Pose Robust Face Sketch Synthesis , 2010, ECCV.

[142] Feng Liu,et al. Sketch Me That Shoe , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[143] C. Lawrence Zitnick,et al. Creative Sketch Generation , 2020, ICLR.

[144] Ali Borji,et al. Ego2Top: Matching Viewers in Egocentric and Top-View Videos , 2016, ECCV.

[145] Nenghai Yu,et al. StyleBank: An Explicit Representation for Neural Image Style Transfer , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[146] Eirikur Agustsson,et al. NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[147] Xiaogang Wang,et al. Face sketch recognition , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[148] Ming Lu,et al. Decoder Network over Lightweight Reconstructed Feature for Fast Semantic Style Transfer , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[149] Nick Barnes,et al. Conditional Generative Modeling via Learning the Latent Space , 2020, ICLR.

[150] Sven J. Dickinson,et al. 3D Object Detection and Viewpoint Estimation with a Deformable 3D Cuboid Model , 2012, NIPS.

[151] Bohyung Han,et al. Visual Reference Resolution using Attention Memory for Visual Dialog , 2017, NIPS.

[152] Xinbo Gao,et al. Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[153] Hao Zhou,et al. Markov Weight Fields for face sketch synthesis , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[154] Jaegul Choo,et al. Image-To-Image Translation via Group-Wise Deep Whitening-And-Coloring Transformation , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[155] Lei Zhang,et al. End-to-End Photo-Sketch Generation via Fully Convolutional Representation Learning , 2015, ICMR.

[156] Leon A. Gatys,et al. Image Style Transfer Using Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[157] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.

[158] Roberto Pinto,et al. Managing supplier delivery reliability risk under limited information: Foundations for a human-in-the-loop DSS , 2013, Decis. Support Syst..

[159] Hyunsoo Kim,et al. Learning to Discover Cross-Domain Relations with Generative Adversarial Networks , 2017, ICML.

[160] Changsheng Xu,et al. StyTr^2: Unbiased Image Style Transfer with Transformers , 2021, ArXiv.

[161] Brian B. Avants,et al. The Multimodal Brain Tumor Image Segmentation Benchmark (BRATS) , 2015, IEEE Transactions on Medical Imaging.

[162] Hyeonjoon Moon,et al. The FERET evaluation methodology for face-recognition algorithms , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[163] Wenbin Cai,et al. Separating Style and Content for Generalized Style Transfer , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[164] Jan Kautz,et al. Learning Linear Transformations for Fast Image and Video Style Transfer , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[165] Scott Workman,et al. Wide-Area Image Geolocalization with Aerial Reference Imagery , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[166] LinLin Shen,et al. Influence of Wavelet Frequency and Orientation in an SVM-Based Parallel Gabor PCA Face Verification System , 2007, IDEAL.

[167] Nenghai Yu,et al. Stereoscopic Neural Style Transfer , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[168] Shiguang Shan,et al. Stacked Progressive Auto-Encoders (SPAE) for Face Recognition Across Poses , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[169] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[170] Qiang Ji,et al. Facial Feature Tracking Under Varying Facial Expressions and Face Poses Based on Restricted Boltzmann Machines , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[171] Xinbo Gao,et al. Dual-Transfer Face Sketch–Photo Synthesis , 2019, IEEE Transactions on Image Processing.

[172] Kristen Grauman,et al. Fine-Grained Visual Comparisons with Local Learning , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[173] Xinbo Gao,et al. Robust Face Sketch Style Synthesis , 2016, IEEE Transactions on Image Processing.

[174] Mark W. Schmidt,et al. Fast Patch-based Style Transfer of Arbitrary Style , 2016, ArXiv.

[175] Sheng You,et al. PI-REC: Progressive Image Reconstruction Network With Edge and Color Domain , 2019, ArXiv.

[176] Xiaogang Wang,et al. Coupled information-theoretic encoding for face photo-sketch recognition , 2011, CVPR 2011.

[177] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[178] Xiaogang Wang,et al. Avatar-Net: Multi-scale Zero-Shot Style Transfer by Feature Decoration , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[179] C. V. Jawahar,et al. Cats and dogs , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[180] Jianguo Xiao,et al. A Common Framework for Interactive Texture Transfer , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[181] Yinda Zhang,et al. LSUN: Construction of a Large-scale Image Dataset using Deep Learning with Humans in the Loop , 2015, ArXiv.

[182] H. Koshimizu,et al. Facial caricaturing with motion caricaturing in PICASSO system , 1997, Proceedings of IEEE/ASME International Conference on Advanced Intelligent Mechatronics.

[183] Ming-Hsuan Yang,et al. Universal Style Transfer via Feature Transforms , 2017, NIPS.

[184] Xiaogang Wang,et al. Face sketch synthesis and recognition , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[185] Jung-Woo Ha,et al. StarGAN v2: Diverse Image Synthesis for Multiple Domains , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[186] Shree K. Nayar,et al. Attribute and simile classifiers for face verification , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[187] Nasser M. Nasrabadi,et al. Unsupervised Facial Geometry Learning for Sketch to Photo Synthesis , 2018, 2018 International Conference of the Biometrics Special Interest Group (BIOSIG).

[188] Andreas Dengel,et al. Real-time Analysis and Visualization of the YFCC100m Dataset , 2015, MMCommons '15.

[189] Bolei Zhou,et al. Scene Parsing through ADE20K Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[190] Zhuowen Tu,et al. ViTGAN: Training GANs with Vision Transformers , 2021, ICLR.

[191] Ming-Hsuan Yang,et al. Real-Time Exemplar-Based Face Sketch Synthesis , 2014, ECCV.

[192] Jie Li,et al. Knowledge Distillation for Face Photo–Sketch Synthesis , 2020, IEEE Transactions on Neural Networks and Learning Systems.

[193] Lior Wolf,et al. One-Sided Unsupervised Domain Mapping , 2017, NIPS.

[194] Yu-Chiang Frank Wang,et al. Coupled Dictionary and Feature Space Learning with Applications to Cross-Domain Image Synthesis and Recognition , 2013, 2013 IEEE International Conference on Computer Vision.

[195] Shilei Wen,et al. Dynamic Instance Normalization for Arbitrary Style Transfer , 2019, AAAI.

[196] Xiaoou Tang,et al. Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[197] Tao Xiang,et al. Learning to Sketch with Shortcut Cycle Consistency , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[198] Luca Antiga,et al. Automatic differentiation in PyTorch , 2017 .

[199] Jaakko Lehtinen,et al. Few-Shot Unsupervised Image-to-Image Translation , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[200] Zhangyang Wang,et al. Deep Plastic Surgery: Robust and Controllable Image Editing with Human-Drawn Sketches , 2020, ECCV.

[201] Jung-Woo Ha,et al. StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[202] Xinbo Gao,et al. Universal Face Photo-Sketch Style Transfer via Multiview Domain Translation , 2020, IEEE Transactions on Image Processing.

[203] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[204] Xin Wang,et al. Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[205] Jiri Matas,et al. XM2VTSDB: The Extended M2VTS Database , 1999 .

[206] Alexei A. Efros,et al. Toward Multimodal Image-to-Image Translation , 2017, NIPS.

[207] Hongsheng Li,et al. DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[208] Silvio Savarese,et al. Beyond PASCAL: A benchmark for 3D object detection in the wild , 2014, IEEE Winter Conference on Applications of Computer Vision.

[209] Sebastian Ramos,et al. The Cityscapes Dataset for Semantic Urban Scene Understanding , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[210] S T Roweis,et al. Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[211] Himanshu S. Bhatt,et al. Memetically Optimized MCWLD for Matching Sketches With Digital Face Images , 2012, IEEE Transactions on Information Forensics and Security.

[212] Thomas Vetter,et al. Skin Detail Analysis for Face Recognition , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[213] Xiaofeng Tao,et al. Transient attributes for high-level understanding and editing of outdoor scenes , 2014, ACM Trans. Graph..

[214] Leon A. Gatys,et al. A Neural Algorithm of Artistic Style , 2015, ArXiv.

[215] Peter Wonka,et al. SEAN: Image Synthesis With Semantic Region-Adaptive Normalization , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[216] D. Keeble,et al. The Significance of Hair for Face Recognition , 2012, PloS one.

[217] Alex Krizhevsky,et al. Learning Multiple Layers of Features from Tiny Images , 2009 .

[218] Gang Hua,et al. Visual attribute transfer through deep image analogy , 2017, ACM Trans. Graph..

[219] Shaogang Gong,et al. Free-Hand Sketch Synthesis with Deformable Stroke Models , 2016, International Journal of Computer Vision.

[220] Skyler T. Hawk,et al. Presentation and validation of the Radboud Faces Database , 2010 .

[221] Lihi Zelnik-Manor,et al. The Contextual Loss for Image Transformation with Non-Aligned Data , 2018, ECCV.

[222] Xuelong Li,et al. Face Sketch-Photo Synthesis under Multi-dictionary Sparse Representation Framework , 2011, 2011 Sixth International Conference on Image and Graphics.

[223] Jing Liao,et al. Arbitrary Style Transfer with Deep Feature Reshuffle , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[224] Weihong Deng,et al. Identity-aware CycleGAN for face photo-sketch synthesis and recognition , 2020, Pattern Recognit..

[225] Zunlei Feng,et al. Neural Style Transfer: A Review , 2017, IEEE Transactions on Visualization and Computer Graphics.

[226] Lior Wolf,et al. Unsupervised Cross-Domain Image Generation , 2016, ICLR.

[227] Yong-Jin Liu,et al. CartoonGAN: Generative Adversarial Networks for Photo Cartoonization , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[228] Zhenan Sun,et al. Multi-caption Text-to-Face Synthesis: Dataset and Algorithm , 2021, ACM Multimedia.

[229] Ran Yi,et al. APDrawingGAN: Generating Artistic Portrait Drawings From Face Photos With Hierarchical GANs , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[230] Stefan Winkler,et al. A data-driven approach to cleaning large face datasets , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[231] Harry Wechsler,et al. The FERET database and evaluation procedure for face-recognition algorithms , 1998, Image Vis. Comput..

[232] Eli Shechtman,et al. Im2Pencil: Controllable Pencil Illustration From Photographs , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[233] Rui Zhang,et al. Harmonic Unpaired Image-to-image Translation , 2019, ICLR.

[234] Jie Li,et al. Face Photo-Sketch Synthesis via Knowledge Transfer , 2019, IJCAI.

[235] Nanning Zheng,et al. Example-based facial sketch generation with non-parametric sampling , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[236] Paul L. Rosin,et al. Line Drawings for Face Portraits From Photos Using Global and Local Structure Based GANs , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[237] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[238] Hidefumi Kobatake,et al. Extraction of facial sketch image based on morphological processing , 1997, Proceedings of International Conference on Image Processing.

[239] Xuelong Li,et al. Multiple Representations-Based Face Sketch–Photo Synthesis , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[240] Trevor Darrell,et al. PANDA: Pose Aligned Networks for Deep Attribute Modeling , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[241] Marc Alexa,et al. How do humans sketch objects? , 2012, ACM Trans. Graph..

[242] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[243] Ping Tan,et al. DualGAN: Unsupervised Dual Learning for Image-to-Image Translation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[244] Shiguang Shan,et al. Heterogeneous Face Attribute Estimation: A Deep Multi-Task Learning Approach , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[245] Michael J. Black,et al. Lessons and Insights from Creating a Synthetic Optical Flow Benchmark , 2012, ECCV Workshops.

[246] Ming-Hsuan Yang,et al. Diversified Texture Synthesis with Feed-Forward Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[247] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[248] Jie Li,et al. Superpixel-Based Face Sketch–Photo Synthesis , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[249] Jinze Yu,et al. Learning to Cartoonize Using White-Box Cartoon Representations , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[250] Keiji Yanai,et al. Automatic Expansion of a Food Image Dataset Leveraging Existing Categories with Domain Adaptation , 2014, ECCV Workshops.

[251] Xinbo Gao,et al. Random sampling for fast face sketch synthesis , 2017, Pattern Recognit..

[252] Qin Huang,et al. SPG-Net: Segmentation Prediction and Guidance Network for Image Inpainting , 2018, BMVC.

[253] William T. Freeman,et al. Semantic Pyramid for Image Generation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[254] Bao-Gang Hu,et al. Facial Image Attributes Transformation via Conditional Recycle Generative Adversarial Networks , 2018, Journal of Computer Science and Technology.

[255] Nicu Sebe,et al. AttentionGAN: Unpaired Image-to-Image Translation Using Attention-Guided Generative Adversarial Networks , 2019, IEEE Transactions on Neural Networks and Learning Systems.

[256] Zhe Gan,et al. Triangle Generative Adversarial Networks , 2017, NIPS.

[257] Jie Li,et al. Adaptive representation-based face sketch-photo synthesis , 2017, Neurocomputing.

[258] Alexei A. Efros,et al. Generative Visual Manipulation on the Natural Image Manifold , 2016, ECCV.

[259] Takayuki Fujiwara,et al. On KANSEI facial image processing for computerized facial caricaturing system PICASSO , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[260] Chi-Keung Tang,et al. Image Generation from Sketch Constraint Using Contextual GAN , 2017, ECCV.

[261] Shengcai Liao,et al. Learning Face Representation from Scratch , 2014, ArXiv.

[262] Rama Chellappa,et al. HyperFace: A Deep Multi-Task Learning Framework for Face Detection, Landmark Localization, Pose Estimation, and Gender Recognition , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[263] Himanshu S. Bhatt,et al. On matching sketches with digital face images , 2010, 2010 Fourth IEEE International Conference on Biometrics: Theory, Applications and Systems (BTAS).

[264] Hatice Gunes,et al. SmileNet: Registration-Free Smiling Face Detection In The Wild , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[265] Ivan Laptev,et al. Is object localization for free? - Weakly-supervised learning with convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[266] Serge J. Belongie,et al. Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[267] Tao Mei,et al. DA-GAN: Instance-Level Image Translation by Deep Attention Generative Adversarial Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[268] Michael S. Bernstein,et al. ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[269] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[270] Björn Ommer,et al. A Style-Aware Content Loss for Real-time HD Style Transfer , 2018, ECCV.

[271] Xuelong Li,et al. Face Sketch–Photo Synthesis and Retrieval Using Sparse Representation , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[272] Carlos D. Castillo,et al. An All-In-One Convolutional Neural Network for Face Analysis , 2016, 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017).

[273] Sylvain Paris,et al. Deep Photo Style Transfer , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).