论文信息 - Can We See More? Joint Frontalization and Hallucination of Unaligned Tiny Faces

Can We See More? Joint Frontalization and Hallucination of Unaligned Tiny Faces

In popular TV programs (such as CSI), a very low-resolution face image of a person, who is not even looking at the camera in many cases, is digitally super-resolved to a degree that suddenly the person's identity is made visible and recognizable. Of course, we suspect that this is merely a cinematographic special effect and such a magical transformation of a single image is not technically possible. Or, is it? In this paper, we push the boundaries of super-resolving (hallucinating to be more accurate) a tiny, non-frontal face image to understand how much of this is possible by leveraging the availability of large datasets and deep networks. To this end, we introduce a novel Transformative Adversarial Neural Network (TANN) to jointly frontalize very-low resolution (i.e., 16 × 16 pixels) out-of-plane rotated face images (including profile views) and aggressively super-resolve them (8×), regardless of their original poses and without using any 3D information. TANN is composed of two components: a transformative upsampling network which embodies encoding, spatial transformation and deconvolutional layers, and a discriminative network that enforces the generated high-resolution frontal faces to lie on the same manifold as real frontal face images. We evaluate our method on a large set of synthesized non-frontal face images to assess its reconstruction performance. Extensive experiments demonstrate that TANN generates both qualitatively and quantitatively superior results achieving over 4 dB improvement over the state-of-the-art.

[1] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2] Xiaogang Wang,et al. Recover Canonical-View Faces in the Wild with Deep Neural Networks , 2014, ArXiv.

[3] Ce Liu,et al. A Bayesian Approach to Alignment-Based Image Hallucination , 2012, ECCV.

[4] Shuo Yang,et al. WIDER FACE: A Face Detection Benchmark , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5] Justus Thies,et al. Face2Face: real-time face capture and reenactment of RGB videos , 2019, Commun. ACM.

[6] Kin-Man Lam,et al. Face hallucination based on sparse local-pixel structure , 2014, Pattern Recognit..

[7] Xiaogang Wang,et al. Hallucinating face by eigentransformation , 2005, IEEE Trans. Syst. Man Cybern. Part C.

[8] Tal Hassner,et al. Viewing Real-World Faces in 3D , 2013, 2013 IEEE International Conference on Computer Vision.

[9] Takeo Kanade,et al. Limits on Super-Resolution and How to Break Them , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[10] Andrew Zisserman,et al. Spatial Transformer Networks , 2015, NIPS.

[11] Mohammad Norouzi,et al. Pixel Recursive Super Resolution , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[12] Tieniu Tan,et al. Wavelet-SRNet: A Wavelet-Based CNN for Multi-scale Face Super Resolution , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[13] Bhiksha Raj,et al. SphereFace: Deep Hypersphere Embedding for Face Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Jian Yang,et al. FSRNet: End-to-End Learning Face Super-Resolution with Facial Priors , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[15] Xiangyu Zhu,et al. High-fidelity Pose and Expression Normalization for face recognition in the wild , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Kyoung Mu Lee,et al. Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Ming Yang,et al. DeepFace: Closing the Gap to Human-Level Performance in Face Verification , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[18] Gustavo K. Rohde,et al. Transport-based single frame super resolution of very low resolution face images , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19] Marwan Mattar,et al. Labeled Faces in the Wild: A Database forStudying Face Recognition in Unconstrained Environments , 2008 .

[20] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[21] Tal Hassner,et al. Effective face frontalization in unconstrained images , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[23] William T. Freeman,et al. Face Synthesis from Facial Identity Features , 2017, ArXiv.

[24] Du-Sik Park,et al. Rotating your face using multi-task deep neural network , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Tal Hassner,et al. Regressing Robust and Discriminative 3D Morphable Models with a Very Deep Neural Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26] Michael J. Jones,et al. Fully automatic pose-invariant face recognition via 3D pose normalization , 2011, 2011 International Conference on Computer Vision.

[27] Stefanos Zafeiriou,et al. Robust Statistical Face Frontalization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[28] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Takeo Kanade,et al. Limits on super-resolution and how to break them , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[30] Tal Hassner,et al. Do We Really Need to Collect Millions of Faces for Effective Face Recognition? , 2016, ECCV.

[31] Xiaoming Liu,et al. Coefficients Pose-Variant Input Recogni 8 on Engine Frontalized Output Generator FF-GAN D Discriminator Extreme Pose Input Frontalized Output , 2017 .

[32] Ramakant Nevatia,et al. FacePoseNet: Making a Case for Landmark-Free Face Alignment , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[33] Harry Shum,et al. A two-step approach to hallucinating faces: global parametric model and local nonparametric model , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[34] Andrew Zisserman,et al. Deep Face Recognition , 2015, BMVC.

[35] Pablo H. Hennings-Yeomans,et al. Simultaneous super-resolution and feature extraction for recognition of low-resolution faces , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[36] Xuelong Li,et al. A Comprehensive Survey to Face Hallucination , 2013, International Journal of Computer Vision.

[37] Rob Fergus,et al. Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks , 2015, NIPS.

[38] Fei Yang,et al. Expression flow for 3D-aware face component transfer , 2011, ACM Trans. Graph..

[39] Chih-Yuan Yang,et al. Structured Face Hallucination , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[40] Xiaoou Tang,et al. Deep Cascaded Bi-Network for Face Hallucination , 2016, ECCV.

[41] Yuning Jiang,et al. Learning Face Hallucination in the Wild , 2015, AAAI.

[42] Georgios Tzimiropoulos,et al. Super-FAN: Integrated Facial Landmark Localization and Super-Resolution of Real-World Low Resolution Faces in Arbitrary Poses with GANs , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[43] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44] Yoshua Bengio,et al. Generative Adversarial Networks , 2014, ArXiv.

[45] Xin Yu,et al. Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46] Chun Qi,et al. Hallucinating face by position-patch , 2010, Pattern Recognit..

[47] Deva Ramanan,et al. Face detection, pose estimation, and landmark localization in the wild , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[48] Antonio Torralba,et al. SIFT Flow: Dense Correspondence across Scenes and Its Applications , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49] Xin Yu,et al. Ultra-Resolving Face Images by Discriminative Generative Networks , 2016, ECCV.

[50] Takeo Kanade,et al. Hallucinating faces , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[51] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[52] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[53] Matthew Turk,et al. A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.

[54] Xiaoming Liu,et al. Disentangled Representation Learning GAN for Pose-Invariant Face Recognition , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[55] Chih-Yuan Yang,et al. Hallucinating Compressed Face Images , 2017, International Journal of Computer Vision.

[56] Koray Kavukcuoglu,et al. Pixel Recurrent Neural Networks , 2016, ICML.

[57] Deqing Sun,et al. Learning to Super-Resolve Blurry Face and Text Images , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[58] Thomas S. Huang,et al. Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[59] Ran He,et al. Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[60] Xin Yu,et al. Imagining the Unimaginable Faces by Deconvolutional Networks , 2018, IEEE Transactions on Image Processing.

[61] Harry Shum,et al. Face Hallucination: Theory and Practice , 2007, International Journal of Computer Vision.

[62] Xin Yu,et al. Face Hallucination with Tiny Unaligned Images by Transformative Discriminative Neural Networks , 2017, AAAI.

[63] R. Basri,et al. Statistical Symmetric Shape from Shading for 3D Structure Recovery of Faces , 2004, eccv 2004.