论文信息 - EnlightenGAN: Deep Light Enhancement Without Paired Supervision

EnlightenGAN: Deep Light Enhancement Without Paired Supervision

Deep learning-based methods have achieved remarkable success in image restoration and enhancement, but are they still competitive when there is a lack of paired training data? As one such example, this paper explores the low-light image enhancement problem, where in practice it is extremely challenging to simultaneously take a low-light and a normal-light photo of the same visual scene. We propose a highly effective unsupervised generative adversarial network, dubbed EnlightenGAN, that can be trained without low/normal-light image pairs, yet proves to generalize very well on various real-world test images. Instead of supervising the learning using ground truth data, we propose to regularize the unpaired training using the information extracted from the input itself, and benchmark a series of innovations for the low-light image enhancement problem, including a global-local discriminator structure, a self-regularized perceptual loss fusion, and the attention mechanism. Through extensive experiments, our proposed approach outperforms recent methods under a variety of metrics in terms of visual quality and subjective user study. Thanks to the great flexibility brought by unpaired training, EnlightenGAN is demonstrated to be easily adaptable to enhancing real-world images from various domains. Our codes and pre-trained models are available at: https://github.com/VITA-Group/EnlightenGAN.

[1] Xianming Liu,et al. When Image Denoising Meets High-Level Vision Tasks: A Deep Learning Approach , 2017, IJCAI.

[2] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[3] Jiaying Liu,et al. UG2 Track 2: A Collective Benchmark Effort for Evaluating and Advancing Image Understanding in Poor Visibility Environments , 2019, CVPR 2019.

[4] Chul Lee,et al. Contrast enhancement based on layered difference representation , 2012, 2012 19th IEEE International Conference on Image Processing.

[5] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[6] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[7] Zia-ur Rahman,et al. A multiscale retinex for bridging the gap between color images and the human observation of scenes , 1997, IEEE Trans. Image Process..

[8] Lei Zhang,et al. Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising , 2016, IEEE Transactions on Image Processing.

[9] Jiebo Luo,et al. Towards Perceptual Image Dehazing by Physics-Based Disentanglement and Adversarial Training , 2018, AAAI.

[10] Alan C. Bovik,et al. Making a “Completely Blind” Image Quality Analyzer , 2013, IEEE Signal Processing Letters.

[11] Xiao-Ping Zhang,et al. A Weighted Variational Model for Simultaneous Reflectance and Illumination Estimation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Wenhan Yang,et al. Attentive Generative Adversarial Network for Raindrop Removal from A Single Image , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13] Wen-Huang Cheng,et al. Joint Enhancement and Denoising Method via Sequential Decomposition , 2018, 2018 IEEE International Symposium on Circuits and Systems (ISCAS).

[14] Andrea Vedaldi,et al. Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] R. A. Bradley,et al. RANK ANALYSIS OF INCOMPLETE BLOCK DESIGNS THE METHOD OF PAIRED COMPARISONS , 1952 .

[16] Chen Wei,et al. Deep Retinex Decomposition for Low-Light Enhancement , 2018, BMVC.

[17] Taesung Park,et al. CyCADA: Cycle-Consistent Adversarial Domain Adaptation , 2017, ICML.

[18] Hai-Miao Hu,et al. Naturalness Preserved Enhancement Algorithm for Non-Uniform Illumination Images , 2013, IEEE Transactions on Image Processing.

[19] Jizheng Xu,et al. AOD-Net: All-in-One Dehazing Network , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[20] Siyuan Liu,et al. Unsupervised Image Super-Resolution Using Cycle-in-Cycle Generative Adversarial Networks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[21] Thomas S. Huang,et al. Generative Image Inpainting with Contextual Attention , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[22] Kai Zeng,et al. Perceptual Quality Assessment for Multi-Exposure Image Fusion , 2015, IEEE Transactions on Image Processing.

[23] John D. Austin,et al. Adaptive histogram equalization and its variations , 1987 .

[24] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[25] Walter J. Scheirer,et al. PsyPhy: A Psychophysics Driven Evaluation Framework for Visual Recognition , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26] Chee Seng Chan,et al. Getting to Know Low-light Images with The Exclusively Dark Dataset , 2018, Comput. Vis. Image Underst..

[27] Sunil Kumar,et al. Unsupervised Class-Specific Deblurring , 2018, ECCV.

[28] Alexei A. Efros,et al. Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[29] Jiri Matas,et al. DeblurGAN: Blind Motion Deblurring Using Conditional Adversarial Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[30] Zhangyang Wang,et al. Deep Plastic Surgery: Robust and Controllable Image Editing with Human-Drawn Sketches , 2020, ECCV.

[31] Jung-Woo Ha,et al. StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[32] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[33] Harshad Rai,et al. Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks , 2018 .

[34] Shiyu Chang,et al. AutoGAN: Neural Architecture Search for Generative Adversarial Networks , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[35] Lei Zhang,et al. Learning a Deep Single Image Contrast Enhancer from Multi-Exposure Images , 2018, IEEE Transactions on Image Processing.

[36] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37] Xiaochun Cao,et al. Single Image Deraining: A Comprehensive Benchmark Analysis , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Ning Xu,et al. Controllable Artistic Text Style Transfer via Shape-Matching GAN , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[39] Chi-Keung Tang,et al. Deep High Dynamic Range Imaging with Large Foreground Motions , 2017, ECCV.

[40] Jan Kautz,et al. Unsupervised Image-to-Image Translation Networks , 2017, NIPS.

[41] Alexia Jolicoeur-Martineau,et al. The relativistic discriminator: a key element missing from standard GAN , 2018, ICLR.

[42] Kyoung Mu Lee,et al. Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43] Yu Li,et al. LIME: Low-Light Image Enhancement via Illumination Map Estimation , 2017, IEEE Transactions on Image Processing.

[44] E. Land. The retinex theory of color vision. , 1977, Scientific American.

[45] Dan Feng,et al. Benchmarking Single-Image Dehazing and Beyond , 2017, IEEE Transactions on Image Processing.

[46] Jia Xu,et al. Learning to See in the Dark , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[47] Yi Wang,et al. Scale-Recurrent Network for Deep Image Deblurring , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[48] Soumik Sarkar,et al. LLNet: A deep autoencoder approach to natural low-light image enhancement , 2015, Pattern Recognit..

[49] Giulia Boato,et al. RAISE: a raw images dataset for digital image forensics , 2015, MMSys.

[50] Wei Zhou,et al. Unsupervised Single Image Deraining with Self-Supervised Constraints , 2018, 2019 IEEE International Conference on Image Processing (ICIP).

[51] Jonathan T. Barron,et al. Deep bilateral learning for real-time image enhancement , 2017, ACM Trans. Graph..

[52] Yung-Yu Chuang,et al. Deep Photo Enhancer: Unpaired Learning for Image Enhancement from Photographs with GANs , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[53] Xiaoyan Sun,et al. Structure-Revealing Low-Light Image Enhancement Via Robust Retinex Model , 2018, IEEE Transactions on Image Processing.

[54] Zhangyang Wang,et al. DeblurGAN-v2: Deblurring (Orders-of-Magnitude) Faster and Better , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[55] Trevor Darrell,et al. BDD100K: A Diverse Driving Video Database with Scalable Annotation Tooling , 2018, ArXiv.

[56] Jinhui Tang,et al. Single Image Dehazing via Conditional Generative Adversarial Network , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[57] Yang Li,et al. Improved Techniques for Learning to Dehaze and Beyond: A Collective Study , 2018, ArXiv.

[58] Jan Kautz,et al. Multimodal Unsupervised Image-to-Image Translation , 2018, ECCV.

[59] Chen Hong,et al. Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study , 2020, IEEE Transactions on Image Processing.

[60] Ravi Ramamoorthi,et al. Deep high dynamic range imaging of dynamic scenes , 2017, ACM Trans. Graph..