Progressive face super-resolution with cascaded recurrent convolutional network

Abstract Progressive upsampling is beneficial for deep learning based large factor (e.g., 8 × ) super-resolution (SR) to improve network performance and reduce the difficulties of network training. The feedback mechanism is helpful in strengthening the representation power of deep networks since it can efficiently enlarge the receptive field. In this paper, we propose a progressive cascaded recurrent convolutional network, dubbed PCRCN, for large factor face SR (FSR). Specifically, a novel multi-stage cascaded convolutional neural network is developed to progressively obtain high magnification face images, where the first stage of network achieves an initial 2 × magnification image, and the following other stages, adopting the recurrent structure, sequentially generate the corresponding 4 × , 8 × and possibly larger factor SR images through multiple independent iterative modules. The deep features and parsing priors of face are extracted in parallel in each stage of network, and integrated to improve the deep representation ability of network. The training of the whole network is supervised in an end-to-end way by the weighted sum of multiple losses. Compared with other state-of-the-art methods, the experimental results show that the proposed method can achieve superior results in terms of both subjective and objective evaluations.

[1]  Narendra Ahuja,et al.  Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Narendra Ahuja,et al.  Fast and Accurate Image Super-Resolution with Deep Laplacian Pyramid Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Jie Zhou,et al.  Deep Face Super-Resolution With Iterative Collaboration Between Attentive Recovery and Landmark Estimation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Heng Tao Shen,et al.  Large Factor Image Super-Resolution With Cascaded Convolutional Neural Networks , 2021, IEEE Transactions on Multimedia.

[5]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[6]  Jie Li,et al.  Progressive Perception-Oriented Network for Single Image Super-Resolution , 2019, Inf. Sci..

[7]  Lei Zhang,et al.  An edge-guided image interpolation algorithm via directional filtering and data fusion , 2006, IEEE Transactions on Image Processing.

[8]  Klemen Grm,et al.  Face Hallucination Using Cascaded Super-Resolution and Identity Priors , 2018, IEEE Transactions on Image Processing.

[9]  Wei Wu,et al.  Feedback Network for Image Super-Resolution , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Kyoung Mu Lee,et al.  Deeply-Recursive Convolutional Network for Image Super-Resolution , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Huan Liu,et al.  Single image super-resolution via multi-scale residual channel attention network , 2019, Neurocomputing.

[12]  Liang Chen,et al.  MSFSR: A Multi-Stage Face Super-Resolution with Accurate Facial Representation via Enhanced Facial Boundaries , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[13]  Jia Deng,et al.  Stacked Hourglass Networks for Human Pose Estimation , 2016, ECCV.

[14]  Jian Yang,et al.  Image Super-Resolution via Deep Recursive Residual Network , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Hongjun Zhou,et al.  Facial Image Synthesis and Super-Resolution With Stacked Generative Adversarial Network , 2020, Neurocomputing.

[16]  Yun Fu,et al.  Image Super-Resolution Using Very Deep Residual Channel Attention Networks , 2018, ECCV.

[17]  Yu Qiao,et al.  Joint Face Detection and Alignment Using Multitask Cascaded Convolutional Networks , 2016, IEEE Signal Processing Letters.

[18]  Yifan Wang,et al.  A Fully Progressive Approach to Single-Image Super-Resolution , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[19]  Xiu-Shen Wei,et al.  Adversarial PoseNet: A Structure-Aware Convolutional Network for Human Pose Estimation , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[20]  Junjun Jiang,et al.  Global-local fusion network for face super-resolution , 2020, Neurocomputing.

[21]  Wangmeng Zuo,et al.  Learning a Single Convolutional Super-Resolution Network for Multiple Degradations , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22]  Haibo Li,et al.  On potentials of regularized Wasserstein generative adversarial networks for realistic hallucination of tiny faces , 2019, Neurocomputing.

[23]  Junjun Jiang,et al.  Parsing Map Guided Multi-Scale Attention Network For Face Hallucination , 2020, ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[24]  Bernhard Schölkopf,et al.  EnhanceNet: Single Image Super-Resolution Through Automated Texture Synthesis , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[25]  Yun Fu,et al.  Residual Dense Network for Image Super-Resolution , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[26]  Lingyun Wu,et al.  MaskGAN: Towards Diverse and Interactive Facial Image Manipulation , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Daniel Rueckert,et al.  Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Chengyi Xiong,et al.  Attention augmented multi-scale network for single image super-resolution , 2021, Appl. Intell..

[29]  Xuelong Li,et al.  Single Image Super-Resolution With Non-Local Means and Steering Kernel Regression , 2012, IEEE Transactions on Image Processing.

[30]  Xiaoou Tang,et al.  Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[31]  Jian Yang,et al.  FSRNet: End-to-End Learning Face Super-Resolution with Facial Priors , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[32]  Jiayi Ma,et al.  Multi-Temporal Ultra Dense Memory Network for Video Super-Resolution , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Steven C. H. Hoi,et al.  Deep Learning for Image Super-Resolution: A Survey , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Xin Yu,et al.  Face Super-Resolution Guided by Facial Component Heatmaps , 2018, ECCV.

[35]  Tieniu Tan,et al.  Wavelet-SRNet: A Wavelet-Based CNN for Multi-scale Face Super Resolution , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[36]  Thomas S. Huang,et al.  Image Super-Resolution via Dual-State Recurrent Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[37]  Jian Yang,et al.  MemNet: A Persistent Memory Network for Image Restoration , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[38]  Thomas S. Huang,et al.  Deep Networks for Image Super-Resolution with Sparse Prior , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[39]  Xiaoou Tang,et al.  Deep Cascaded Bi-Network for Face Hallucination , 2016, ECCV.

[40]  Kyoung Mu Lee,et al.  Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Xin Jin,et al.  Single image super-resolution with multi-level feature fusion recursive network , 2019, Neurocomputing.

[42]  Thomas S. Huang,et al.  Interactive Facial Feature Localization , 2012, ECCV.