论文信息 - Face Hallucination Using Cascaded Super-Resolution and Identity Priors

Face Hallucination Using Cascaded Super-Resolution and Identity Priors

In this paper we address the problem of hallucinating high-resolution facial images from low-resolution inputs at high magnification factors. We approach this task with convolutional neural networks (CNNs) and propose a novel (deep) face hallucination model that incorporates identity priors into the learning procedure. The model consists of two main parts: i) a cascaded super-resolution network that upscales the low-resolution facial images, and ii) an ensemble of face recognition models that act as identity priors for the super-resolution network during training. Different from most competing super-resolution techniques that rely on a single model for upscaling (even with large magnification factors), our network uses a cascade of multiple SR models that progressively upscale the low-resolution images using steps of $2\times $ . This characteristic allows us to apply supervision signals (target appearances) at different resolutions and incorporate identity constraints at multiple-scales. The proposed C-SRIP model (Cascaded Super Resolution with Identity Priors) is able to upscale (tiny) low-resolution images captured in unconstrained conditions and produce visually convincing results for diverse low-resolution inputs. We rigorously evaluate the proposed model on the Labeled Faces in the Wild (LFW), Helen and CelebA datasets and report superior performance compared to the existing state-of-the-art.

[1] Jan Kautz,et al. Loss Functions for Image Restoration With Neural Networks , 2017, IEEE Transactions on Computational Imaging.

[2] Ling Shao,et al. Simultaneous Super-Resolution and Cross-Modality Synthesis of 3D Medical Images Using Weakly-Supervised Joint Convolutional Sparse Coding , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Takeo Kanade,et al. Limits on super-resolution and how to break them , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[4] Zhou Wang,et al. Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[5] Kyoung Mu Lee,et al. Enhanced Deep Residual Networks for Single Image Super-Resolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[6] Yin Zhang,et al. A Fast Algorithm for Image Deblurring with Total Variation Regularization , 2007 .

[7] Thomas B. Moeslund,et al. Super-resolution: a comprehensive survey , 2014, Machine Vision and Applications.

[8] Xiaoming Liu,et al. Pose-Invariant Face Alignment with a Single CNN , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[9] Yücel Altunbasak,et al. Eigenface-domain super-resolution for face recognition , 2003, IEEE Trans. Image Process..

[10] Jordi Salvador,et al. Naive Bayes Super-Resolution Forest , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[11] A. Bovik. A VISUAL INFORMATION FIDELITY APPROACH TO VIDEO QUALITY ASSESSMENT , 2005 .

[12] Stefanos Zafeiriou,et al. 300 Faces in-the-Wild Challenge: The First Facial Landmark Localization Challenge , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[13] L. Rudin,et al. Nonlinear total variation based noise removal algorithms , 1992 .

[14] Xin Yu,et al. Ultra-Resolving Face Images by Discriminative Generative Networks , 2016, ECCV.

[15] Klemen Grm,et al. Strengths and weaknesses of deep learning models for face recognition against image degradations , 2017, IET Biom..

[16] Takeo Kanade,et al. Hallucinating faces , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[17] Josephine Sullivan,et al. One millisecond face alignment with an ensemble of regression trees , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[18] Shengcai Liao,et al. Learning Face Representation from Scratch , 2014, ArXiv.

[19] Jian Yang,et al. FSRNet: End-to-End Learning Face Super-Resolution with Facial Priors , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[20] Xin Yu,et al. Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21] Christos-Savvas Bouganis,et al. Robust multi-image based blind face hallucination , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Bernhard Schölkopf,et al. EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[23] Dahua Lin,et al. Neighbor combination and transformation for hallucinating faces , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[24] Kai-Kuang Ma,et al. A survey on super-resolution imaging , 2011, Signal Image Video Process..

[25] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[26] Kyoung Mu Lee,et al. Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.

[28] Xin Yu,et al. Imagining the Unimaginable Faces by Deconvolutional Networks , 2018, IEEE Transactions on Image Processing.

[29] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30] Yiying Tong,et al. Adaptive 3D Face Reconstruction from Unconstrained Photo Collections , 2016, CVPR.

[31] Ming-Hsuan Yang,et al. Joint Face Hallucination and Deblurring via Structure Generation and Detail Enhancement , 2018, International Journal of Computer Vision.

[32] Yochai Blau,et al. The Perception-Distortion Tradeoff , 2017, CVPR.

[33] Jing Yang,et al. To learn image super-resolution, use a GAN to learn how to do image degradation first , 2018, ECCV.

[34] Harry Shum,et al. Face Hallucination: Theory and Practice , 2007, International Journal of Computer Vision.

[35] Xin Yu,et al. Face Hallucination with Tiny Unaligned Images by Transformative Discriminative Neural Networks , 2017, AAAI.

[36] Daniel Rueckert,et al. Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Shiguang Shan,et al. Aligning Coupled Manifolds for Face Hallucination , 2009, IEEE Signal Processing Letters.

[38] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39] M. Turk,et al. Eigenfaces for Recognition , 1991, Journal of Cognitive Neuroscience.

[40] Xiaoou Tang,et al. Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[41] Georgios Tzimiropoulos,et al. Super-FAN: Integrated Facial Landmark Localization and Super-Resolution of Real-World Low Resolution Faces in Arbitrary Poses with GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[42] Ming-Hsuan Yang,et al. Generative Face Completion , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43] Jan Kautz,et al. Deep Semantic Face Deblurring , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[44] Xin Yu,et al. Face Super-Resolution Guided by Facial Component Heatmaps , 2018, ECCV.

[45] Wei Liu,et al. Super-Identity Convolutional Neural Network for Face Hallucination , 2018, ECCV.

[46] Pablo H. Hennings-Yeomans,et al. Simultaneous super-resolution and feature extraction for recognition of low-resolution faces , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[47] Xuelong Li,et al. A Comprehensive Survey to Face Hallucination , 2013, International Journal of Computer Vision.

[48] Deqing Sun,et al. Learning to Super-Resolve Blurry Face and Text Images , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[49] Reuben A. Farrugia,et al. Face Hallucination Using Linear Models of Coupled Sparse Support , 2015, IEEE Transactions on Image Processing.

[50] Jiaya Jia,et al. High-quality motion deblurring from a single image , 2008, ACM Trans. Graph..

[51] Chih-Yuan Yang,et al. Structured Face Hallucination , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[52] Simon Dobrisek,et al. Face Hallucination Revisited: An Exploratory Study on Dataset Bias , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[53] Sridha Sridharan,et al. Super-resolution for biometrics: A comprehensive survey , 2018, Pattern Recognit..

[54] Xin Yu,et al. Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[55] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[56] Tong Tong,et al. Image Super-Resolution Using Dense Skip Connections , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[57] Thomas S. Huang,et al. Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[58] Thomas S. Huang,et al. Interactive Facial Feature Localization , 2012, ECCV.

[59] Richard Szeliski,et al. Image Restoration by Matching Gradient Distributions , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[60] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[61] Xiaoou Tang,et al. Deep Cascaded Bi-Network for Face Hallucination , 2016, ECCV.

[62] Liang Lin,et al. Attention-Aware Face Hallucination via Deep Reinforcement Learning , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[63] Yuning Jiang,et al. Learning Face Hallucination in the Wild , 2015, AAAI.

[64] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[65] Yiguang Chen,et al. Single-Image Super-Resolution Reconstruction via Learned Geometric Dictionaries and Clustered Sparse Coding , 2012, IEEE Transactions on Image Processing.

[66] Vassilis Anastassopoulos,et al. Super-resolution image reconstruction techniques: Trade-offs between the data-fidelity and regularization terms , 2012, Inf. Fusion.

[67] Shaogang Gong,et al. Generalized Face Super-Resolution , 2008, IEEE Transactions on Image Processing.

[68] Jian Yang,et al. Image Super-Resolution via Deep Recursive Residual Network , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[69] Mei Han,et al. Soft Edge Smoothness Prior for Alpha Channel Super Resolution , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[70] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[71] Marwan Mattar,et al. Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[72] Luc Van Gool,et al. Anchored Neighborhood Regression for Fast Example-Based Super-Resolution , 2013, 2013 IEEE International Conference on Computer Vision.

[73] Takeo Kanade,et al. Geometric reasoning for single image structure recovery , 2009, CVPR.

[74] Forrest N. Iandola,et al. SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <1MB model size , 2016, ArXiv.

[75] Kyung-Ah Sohn,et al. Fast, Accurate, and, Lightweight Super-Resolution with Cascading Residual Network , 2018, ECCV.

[76] Narendra Ahuja,et al. Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[77] Tae Hyun Kim,et al. Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[78] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[79] Xuelong Li,et al. Single image super resolution with high resolution dictionary , 2011, 2011 18th IEEE International Conference on Image Processing.