论文信息 - Semantic Face Hallucination: Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes

Semantic Face Hallucination: Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes

Given a tiny face image, existing face hallucination methods aim at super-resolving its high-resolution (HR) counterpart by learning a mapping from an exemplary dataset. Since a low-resolution (LR) input patch may correspond to many HR candidate patches, this ambiguity may lead to distorted HR facial details and wrong attributes such as gender reversal and rejuvenation. An LR input contains low-frequency facial components of its HR version while its residual face image, defined as the difference between the HR ground-truth and interpolated LR images, contains the missing high-frequency facial details. We demonstrate that supplementing residual images or feature maps with additional facial attribute information can significantly reduce the ambiguity in face super-resolution. To explore this idea, we develop an attribute-embedded upsampling network, which consists of an upsampling network and a discriminative network. The upsampling network is composed of an autoencoder with skip-connections, which incorporates facial attribute vectors into the residual features of LR inputs at the bottleneck of the autoencoder, and deconvolutional layers used for upsampling. The discriminative network is designed to examine whether super-resolved faces contain the desired attributes or not and then its loss is used for updating the upsampling network. In this manner, we can super-resolve tiny (16×16 pixels) unaligned face images with a large upscaling factor of 8× while reducing the uncertainty of one-to-many mappings remarkably. By conducting extensive evaluations on a large-scale dataset, we demonstrate that our method achieves superior face hallucination results and outperforms the state-of-the-art.

[1] Xuelong Li,et al. A Comprehensive Survey to Face Hallucination , 2013, International Journal of Computer Vision.

[2] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[3] Antonio Torralba,et al. SIFT Flow: Dense Correspondence across Scenes and Its Applications , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4] Bogdan Raducanu,et al. Invertible Conditional GANs for image editing , 2016, ArXiv.

[5] Xin Yu,et al. Imagining the Unimaginable Faces by Deconvolutional Networks , 2018, IEEE Transactions on Image Processing.

[6] David Berthelot,et al. BEGAN: Boundary Equilibrium Generative Adversarial Networks , 2017, ArXiv.

[7] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Mohammad Norouzi,et al. Pixel Recursive Super Resolution , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[9] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10] Honglak Lee,et al. Attribute2Image: Conditional Image Generation from Visual Attributes , 2015, ECCV.

[11] Kyoung Mu Lee,et al. Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[13] Chi-Keung Tang,et al. Attribute-Guided Face Generation Using Conditional CycleGAN , 2017, ECCV.

[14] Harry Shum,et al. Face Hallucination: Theory and Practice , 2007, International Journal of Computer Vision.

[15] Xin Yu,et al. Face Hallucination with Tiny Unaligned Images by Transformative Discriminative Neural Networks , 2017, AAAI.

[16] Yuan Yan Tang,et al. Robust Face Hallucination via Locality-Constrained Bi-Layer Representation , 2018, IEEE Transactions on Cybernetics.

[17] Kin-Man Lam,et al. Face hallucination based on sparse local-pixel structure , 2014, Pattern Recognit..

[18] Jung-Woo Ha,et al. StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[19] Léon Bottou,et al. Wasserstein GAN , 2017, ArXiv.

[20] Léon Bottou,et al. Wasserstein Generative Adversarial Networks , 2017, ICML.

[21] Koray Kavukcuoglu,et al. Pixel Recurrent Neural Networks , 2016, ICML.

[22] Yi Yu,et al. Context-patch based face hallucination via thresholding locality-constrained representation and reproducing learning , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[23] Xin Yu,et al. Ultra-Resolving Face Images by Discriminative Generative Networks , 2016, ECCV.

[24] Xin Yu,et al. Can We See More? Joint Frontalization and Hallucination of Unaligned Tiny Faces , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[25] Takeo Kanade,et al. Hallucinating faces , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[26] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[27] Takeo Kanade,et al. Limits on super-resolution and how to break them , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[28] Rob Fergus,et al. Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks , 2015, NIPS.

[29] Georgios Tzimiropoulos,et al. Super-FAN: Integrated Facial Landmark Localization and Super-Resolution of Real-World Low Resolution Faces in Arbitrary Poses with GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[30] Yann LeCun,et al. Energy-based Generative Adversarial Network , 2016, ICLR.

[31] Guoying Zhao,et al. Hallucinating Face Image by Regularization Models in High-Resolution Feature Space , 2018, IEEE Transactions on Image Processing.

[32] Bernt Schiele,et al. Generative Adversarial Text to Image Synthesis , 2016, ICML.

[33] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[34] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Chun Qi,et al. Hallucinating face by position-patch , 2010, Pattern Recognit..

[36] Xiaoou Tang,et al. Deep Cascaded Bi-Network for Face Hallucination , 2016, ECCV.

[37] Liang Lin,et al. Attention-Aware Face Hallucination via Deep Reinforcement Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Yuning Jiang,et al. Learning Face Hallucination in the Wild , 2015, AAAI.

[39] Gustavo K. Rohde,et al. Transport-based single frame super resolution of very low resolution face images , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40] Wei Shen,et al. Learning Residual Images for Face Attribute Manipulation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41] Chih-Yuan Yang,et al. Structured Face Hallucination , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[42] Ersin Yumer,et al. Neural Face Editing with Intrinsic Image Disentangling , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43] Yoshua Bengio,et al. Generative Adversarial Networks , 2014, ArXiv.

[44] Ce Liu,et al. A Bayesian Approach to Alignment-Based Image Hallucination , 2012, ECCV.

[45] Winston H. Hsu,et al. Attribute Augmented Convolutional Neural Network for Face Hallucination , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[46] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[47] Ming-Hsuan Yang,et al. Generative Face Completion , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48] Xin Yu,et al. Face Super-Resolution Guided by Facial Component Heatmaps , 2018, ECCV.

[49] Azriel Rosenfeld,et al. Face recognition: A literature survey , 2003, CSUR.

[50] Dimitris N. Metaxas,et al. StackGAN: Text to Photo-Realistic Image Synthesis with Stacked Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[51] Deqing Sun,et al. Learning to Super-Resolve Blurry Face and Text Images , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[52] Reuben A. Farrugia,et al. Face Hallucination Using Linear Models of Coupled Sparse Support , 2015, IEEE Transactions on Image Processing.

[53] Thomas S. Huang,et al. Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[54] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[55] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[56] Xin Yu,et al. Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[57] Andrew Zisserman,et al. Spatial Transformer Networks , 2015, NIPS.

[58] Tieniu Tan,et al. Wavelet-SRNet: A Wavelet-Based CNN for Multi-scale Face Super Resolution , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[59] Klemen Grm,et al. Face Hallucination Using Cascaded Super-Resolution and Identity Priors , 2018, IEEE Transactions on Image Processing.

[60] 拓海杉山,et al. “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”の学習報告 , 2017 .

[61] Jian Yang,et al. FSRNet: End-to-End Learning Face Super-Resolution with Facial Priors , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[62] Gene Cheung,et al. SiGAN: Siamese Generative Adversarial Network for Identity-Preserving Face Hallucination , 2018, IEEE Transactions on Image Processing.

[63] Xin Yu,et al. Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[64] Beat Fasel,et al. Automati Fa ial Expression Analysis: A Survey , 1999 .