论文信息 - A Practical Contrastive Learning Framework for Single Image Super-Resolution

A Practical Contrastive Learning Framework for Single Image Super-Resolution

Contrastive learning has achieved remarkable success on various high-level tasks, but there are fewer methods proposed for low-level tasks. It is challenging to adopt vanilla contrastive learning technologies proposed for high-level visual tasks straight to low-level visual tasks since the acquired global visual representations are insufficient for low-level tasks requiring rich texture and context information. In this paper, we propose a novel contrastive learning framework for single image superresolution (SISR). We investigate the contrastive learningbased SISR from two perspectives: sample construction and feature embedding. The existing methods propose some naive sample construction approaches (e.g., considering the low-quality input as a negative sample and the ground truth as a positive sample) and they adopt a prior model (e.g., pre-trained VGG [48] model) to obtain the feature embedding instead of exploring a task-friendly one. To this end, we propose a practical contrastive learning framework for SISR that involves the generation of many informative positive and hard negative samples in frequency space. Instead of utilizing an additional pre-trained network, we design a simple but effective embedding network inherited from the discriminator network and can be iteratively optimized with the primary SR network making it task-generalizable. Finally, we conduct an extensive experimental evaluation of our method compared with benchmark methods and show remarkable gains of up to 0.21 dB over the current state-of-the-art approaches for SISR.

Jiayi Ma | Gang Wu | Xianming Liu | Junjun Jiang

[1] Peter Vajda,et al. Supplementary: Tackling the Ill-Posedness of Super-Resolution through Adaptive Target Generation , 2021 .

[2] Yu Wang,et al. Joint Contrastive Learning with Infinite Possibilities , 2020, NeurIPS.

[3] Cordelia Schmid,et al. What makes for good views for contrastive learning , 2020, NeurIPS.

[4] Radu Timofte,et al. Frequency Separation for Real-World Super-Resolution , 2019, 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW).

[5] Jian Yang,et al. MemNet: A Persistent Memory Network for Image Restoration , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[6] Lars Petersson,et al. Dual Contrastive Learning for Unsupervised Image-to-Image Translation , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[7] Ching-Yao Chuang,et al. Contrastive Learning with Hard Negative Samples , 2020, ArXiv.

[8] Wei Wu,et al. Feedback Network for Image Super-Resolution , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[10] Wei An,et al. Unsupervised Degradation Representation Learning for Blind Super-Resolution , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Luc Van Gool,et al. SwinIR: Image Restoration Using Swin Transformer , 2021, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW).

[12] Kihyuk Sohn,et al. Improved Deep Metric Learning with Multi-class N-pair Loss Objective , 2016, NIPS.

[13] Lars Petersson,et al. Single Underwater Image Restoration by Contrastive Learning , 2021, 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS.

[14] Ching-Yao Chuang,et al. Debiased Contrastive Learning , 2020, NeurIPS.

[15] Jie Tang,et al. Residual Feature Aggregation Network for Image Super-Resolution , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[16] Shu-Tao Xia,et al. Second-Order Attention Network for Single Image Super-Resolution , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[17] Yu Qiao,et al. ESRGAN: Enhanced Super-Resolution Generative Adversarial Networks , 2018, ECCV Workshops.

[18] Natalia Gimelshein,et al. PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[19] Xinlei Chen,et al. Exploring Simple Siamese Representation Learning , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20] Yu Qiao,et al. RankSRGAN: Generative Adversarial Networks With Ranker for Image Super-Resolution , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[21] Yun Fu,et al. Residual Dense Network for Image Super-Resolution , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22] Yann LeCun,et al. Dimensionality Reduction by Learning an Invariant Mapping , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[23] Kyoung Mu Lee,et al. Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24] Stella X. Yu,et al. Unsupervised Feature Learning via Non-parametric Instance Discrimination , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[25] Oriol Vinyals,et al. Representation Learning with Contrastive Predictive Coding , 2018, ArXiv.

[26] Jitendra Malik,et al. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[27] Hongxia Jin,et al. Negative Data Augmentation , 2021, ICLR.

[28] Wangmeng Zuo,et al. Learning a Single Convolutional Super-Resolution Network for Multiple Degradations , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[29] Kui Jiang,et al. Unpaired Adversarial Learning for Single Image Deraining with Rain-Space Contrastive Constraints , 2021, ArXiv.

[30] Luc Van Gool,et al. NTIRE 2017 Challenge on Single Image Super-Resolution: Methods and Results , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[31] Xinbo Gao,et al. Lightweight Image Super-Resolution with Information Multi-distillation Network , 2019, ACM Multimedia.

[32] Lei Zhang,et al. SEED: Self-supervised Distillation For Visual Representation , 2021, ArXiv.

[33] Geoffrey E. Hinton,et al. A Simple Framework for Contrastive Learning of Visual Representations , 2020, ICML.

[34] James Philbin,et al. FaceNet: A unified embedding for face recognition and clustering , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[36] Yuan Xie,et al. Towards Compact Single Image Super-Resolution via Contrastive Self-distillation , 2021, IJCAI.

[37] Kiyoharu Aizawa,et al. Sketch-based manga retrieval using manga109 dataset , 2015, Multimedia Tools and Applications.

[38] Xiaochun Cao,et al. Correction to: Single Image Super-Resolution via a Holistic Attention Network , 2020, ECCV.

[39] Ales Leonardis,et al. Residual Contrastive Learning for Joint Demosaicking and Denoising , 2021, ArXiv.

[40] Yonglong Tian,et al. Contrastive Representation Distillation , 2019, ICLR.

[41] Xiaoou Tang,et al. Accelerating the Super-Resolution Convolutional Neural Network , 2016, ECCV.

[42] Mario Fritz,et al. Dual Contrastive Loss and Attention for GANs , 2021, ArXiv.

[43] Kaiming He,et al. Improved Baselines with Momentum Contrastive Learning , 2020, ArXiv.

[44] Narendra Ahuja,et al. Single image super-resolution from transformed self-exemplars , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45] Kyoung Mu Lee,et al. Enhanced Deep Residual Networks for Single Image Super-Resolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[46] Lizhuang Ma,et al. Contrastive Learning for Compact Single Image Dehazing , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[47] Jie Liu,et al. Residual Feature Distillation Network for Lightweight Image Super-Resolution , 2020, ECCV Workshops.

[48] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[49] Kate Saenko,et al. Fine-grained Angular Contrastive Learning with Coarse Labels , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[50] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[51] Xiaoou Tang,et al. Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[52] Radu Timofte,et al. 2018 PIRM Challenge on Perceptual Image Super-resolution , 2018, ArXiv.

[53] Gregory Shakhnarovich,et al. Deep Back-Projection Networks for Super-Resolution , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[54] Yun Fu,et al. Image Super-Resolution Using Very Deep Residual Channel Attention Networks , 2018, ECCV.

[55] Ce Liu,et al. Supervised Contrastive Learning , 2020, NeurIPS.

[56] Kaiming He,et al. Momentum Contrast for Unsupervised Visual Representation Learning , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[57] Narendra Ahuja,et al. Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[58] Wen Gao,et al. Pre-Trained Image Processing Transformer , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[59] Xiao Wang,et al. AdCo: Adversarial Contrast for Efficient Learning of Unsupervised Representations from Self-Trained Negative Adversaries , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[60] Alexei A. Efros,et al. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[61] Harshad Rai,et al. Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks , 2018 .

[62] Alexei A. Efros,et al. Contrastive Learning for Unpaired Image-to-Image Translation , 2020, ECCV.

[63] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[64] Shijian Lu,et al. Blind Image Super-Resolution via Contrastive Representation Learning , 2021, ArXiv.

[65] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[66] Michael Elad,et al. On Single Image Scale-Up Using Sparse-Representations , 2010, Curves and Surfaces.

[67] Aline Roumy,et al. Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding , 2012, BMVC.

[68] Yoshua Bengio,et al. Learning deep representations by mutual information estimation and maximization , 2018, ICLR.

[69] Dezhong Peng,et al. Contrastive Clustering , 2021, AAAI.