论文信息 - Conditional Sequential Modulation for Efficient Global Image Retouching

Conditional Sequential Modulation for Efficient Global Image Retouching

Photo retouching aims at enhancing the aesthetic visual quality of images that suffer from photographic defects such as over/under exposure, poor contrast, inharmonious saturation. Practically, photo retouching can be accomplished by a series of image processing operations. In this paper, we investigate some commonly-used retouching operations and mathematically find that these pixel-independent operations can be approximated or formulated by multi-layer perceptrons (MLPs). Based on this analysis, we propose an extremely light-weight framework - Conditional Sequential Retouching Network (CSRNet) - for efficient global image retouching. CSRNet consists of a base network and a condition network. The base network acts like an MLP that processes each pixel independently and the condition network extracts the global features of the input image to generate a condition vector. To realize retouching operations, we modulate the intermediate features using Global Feature Modulation (GFM), of which the parameters are transformed by condition vector. Benefiting from the utilization of $1\times1$ convolution, CSRNet only contains less than 37k trainable parameters, which is orders of magnitude smaller than existing learning-based methods. Extensive experiments show that our method achieves state-of-the-art performance on the benchmark MIT-Adobe FiveK dataset quantitively and qualitatively. Code is available at this https URL.

[1] Chao Dong,et al. Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[2] Luc Van Gool,et al. DSLR-Quality Photos on Mobile Devices with Deep Convolutional Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[3] Qing Zhang,et al. High-Quality Exposure Correction of Underexposed Photos , 2018, ACM Multimedia.

[4] Qiang Chen,et al. Network In Network , 2013, ICLR.

[5] Graham D. Finlayson,et al. Shades of Gray and Colour Constancy , 2004, CIC.

[6] Ronggang Wang,et al. A New Low-Light Image Enhancement Algorithm Using Camera Response Model , 2017, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).

[7] Jiawen Chen,et al. Bilateral guided upsampling , 2016, ACM Trans. Graph..

[8] Sylvain Paris,et al. Learning photographic global tonal adjustment with a database of input / output image pairs , 2011, CVPR 2011.

[9] Joost van de Weijer,et al. Author Manuscript, Published in "ieee Transactions on Image Processing Edge-based Color Constancy , 2022 .

[10] Xiaoou Tang,et al. Deep Network Interpolation for Continuous Imagery Effect Transition , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11] Frédo Durand,et al. Fast Local Laplacian Filters , 2014, ACM Trans. Graph..

[12] Yung-Yu Chuang,et al. Deep Photo Enhancer: Unpaired Learning for Image Enhancement from Photographs with GANs , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13] In-So Kweon,et al. Distort-and-Recover: Color Enhancement Using Deep Reinforcement Learning , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[14] Chi-Wing Fu,et al. Underexposed Photo Enhancement Using Deep Illumination Estimation , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Xiaoou Tang,et al. Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[16] Luc Van Gool,et al. WESPE: Weakly Supervised Photo Enhancer for Digital Cameras , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[17] Alexei A. Efros,et al. Fast bilateral filtering for the display of high-dynamic-range images , 2002 .

[18] E. Land. The retinex theory of color vision. , 1977, Scientific American.

[19] Andrea Vedaldi,et al. Instance Normalization: The Missing Ingredient for Fast Stylization , 2016, ArXiv.

[20] Jiawen Chen,et al. Real-time edge-aware image processing with the bilateral grid , 2007, ACM Trans. Graph..

[21] Jonathan T. Barron,et al. Deep bilateral learning for real-time image enhancement , 2017, ACM Trans. Graph..

[22] Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.

[23] Lihi Zelnik-Manor,et al. Dynamic-Net: Tuning the Objective Without Re-Training for Synthesis Tasks , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[24] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25] Hao He,et al. Exposure , 2017, ACM Trans. Graph..

[26] Yu Qiao,et al. Modulating Image Restoration With Continual Levels via Adaptive Feature Modification Layers , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Xiao-Ping Zhang,et al. A Weighted Variational Model for Simultaneous Reflectance and Illumination Estimation , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).