Exposure-Structure Blending Network for High Dynamic Range Imaging of Dynamic Scenes

This paper presents a deep end-to-end network for high dynamic range (HDR) imaging of dynamic scenes with background and foreground motions. Generating an HDR image from a sequence of multi-exposure images is a challenging process when the images have misalignments by being taken in a dynamic situation. Hence, recent methods first align the multi-exposure images to the reference by using patch matching, optical flow, homography transformation, or attention module before the merging. In this paper, we propose a deep network that synthesizes the aligned images as a result of blending the information from multi-exposure images, because explicitly aligning photos with different exposures is inherently a difficult problem. Specifically, the proposed network generates under/over-exposure images that are structurally aligned to the reference, by blending all the information from the dynamic multi-exposure images. Our primary idea is that blending two images in the deep-feature-domain is effective for synthesizing multi-exposure images that are structurally aligned to the reference, resulting in better-aligned images than the pixel-domain blending or geometric transformation methods. Specifically, our alignment network consists of a two-way encoder for extracting features from two images separately, several convolution layers for blending deep features, and a decoder for constructing the aligned images. The proposed network is shown to generate the aligned images with a wide range of exposure differences very well and thus can be effectively used for the HDR imaging of dynamic scenes. Moreover, by adding a simple merging network after the alignment network and training the overall system end-to-end, we obtain a performance gain compared to the recent state-of-the-art methods.

[1]  Michael Elad,et al.  A Variational Framework for Retinex , 2002, IS&T/SPIE Electronic Imaging.

[2]  Diego Gutierrez,et al.  Dynamic range expansion based on image statistics , 2015, Multimedia Tools and Applications.

[3]  Fan Yang,et al.  Physiological inverse tone mapping based on retina response , 2013, The Visual Computer.

[4]  Gabriel Eilertsen,et al.  HDR image reconstruction from a single exposure using deep CNNs , 2017, ACM Trans. Graph..

[5]  Jun Hu,et al.  Exposure Stacks of Live Scenes with Hand-Held Cameras , 2012, ECCV.

[6]  Eli Shechtman,et al.  Robust patch-based hdr reconstruction of dynamic scenes , 2012, ACM Trans. Graph..

[7]  Alexei A. Efros,et al.  Fast bilateral filtering for the display of high-dynamic-range images , 2002 .

[8]  Nam Ik Cho,et al.  Generation of high dynamic range illumination from a single image for the enhancement of undesirably illuminated images , 2017, Multimedia Tools and Applications.

[9]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[10]  Greg Ward,et al.  Fast, Robust Image Registration for Compositing High Dynamic Range Photographs from Hand-Held Exposures , 2003, J. Graphics, GPU, & Game Tools.

[11]  Zia-ur Rahman,et al.  Properties and performance of a center/surround retinex , 1997, IEEE Trans. Image Process..

[12]  Yanning Zhang,et al.  High dynamic range imaging by sparse representation , 2017, Neurocomputing.

[13]  Manuel Menezes de Oliveira Neto,et al.  High-Quality Reverse Tone Mapping for a Wide Range of Exposures , 2014, 2014 27th SIBGRAPI Conference on Graphics, Patterns and Images.

[14]  Yu Li,et al.  LIME: Low-Light Image Enhancement via Illumination Map Estimation , 2017, IEEE Transactions on Image Processing.

[15]  Christian Ledig,et al.  Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[16]  Chi-Keung Tang,et al.  Deep High Dynamic Range Imaging with Large Foreground Motions , 2017, ECCV.

[17]  Lilong Shi,et al.  The Rehabilitation of MaxRGB , 2010, CIC.

[18]  Luca Bogoni,et al.  Extending dynamic range of monochrome and color images through fusion , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[19]  Wai-kuen Cham,et al.  Gradient-Directed Multiexposure Composition , 2012, IEEE Transactions on Image Processing.

[20]  Ravi Ramamoorthi,et al.  Deep high dynamic range imaging of dynamic scenes , 2017, ACM Trans. Graph..

[21]  Erik Reinhard,et al.  Photographic tone reproduction for digital images , 2002, ACM Trans. Graph..

[22]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[23]  Jan Kautz,et al.  Exposure Fusion , 2009, 15th Pacific Conference on Computer Graphics and Applications (PG'07).

[24]  Richard Szeliski,et al.  High dynamic range video , 2003, ACM Trans. Graph..

[25]  Yanning Zhang,et al.  Attention-Guided Network for Ghost-Free High Dynamic Range Imaging , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Nam Ik Cho,et al.  Probabilistic motion pixel detection for the reduction of ghost artifacts in high dynamic range images from multiple exposures , 2014, EURASIP J. Image Video Process..

[27]  Sang Uk Lee,et al.  Ghost-Free High Dynamic Range Imaging , 2010, ACCV.

[28]  Nam Ik Cho,et al.  A Multi-Exposure Image Fusion Based on the Adaptive Weights Reflecting the Relative Pixel Intensity and Global Gradient , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[29]  Chul Lee,et al.  Ghost-Free High Dynamic Range Imaging via Rank Minimization , 2014, IEEE Signal Processing Letters.

[30]  Rafal Mantiuk,et al.  Display adaptive tone mapping , 2008, ACM Trans. Graph..

[31]  Joachim Weickert,et al.  Freehand HDR Imaging of Moving Scenes with Simultaneous Resolution Enhancement , 2011, Comput. Graph. Forum.

[32]  Nam Ik Cho,et al.  A multi-exposure image fusion algorithm without ghost effect , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[33]  Nam Ik Cho,et al.  Reduction of ghost effect in exposure fusion by detecting the ghost pixels in saturated and non-saturated regions , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[34]  Greg Ward,et al.  Automatic High-Dynamic Range Image Generation for Dynamic Scenes , 2008, IEEE Computer Graphics and Applications.

[35]  Suk-Ju Kang,et al.  Deep Recursive HDRI: Inverse Tone Mapping Using Generative Adversarial Networks , 2018, ECCV.

[36]  Nam Ik Cho,et al.  High Dynamic Range and Super-Resolution Imaging From a Single Image , 2018, IEEE Access.

[37]  Lei Zhang,et al.  Learning a Deep Single Image Contrast Enhancer from Multi-Exposure Images , 2018, IEEE Transactions on Image Processing.

[38]  Yoshihiro Kanamori,et al.  Deep reverse tone mapping , 2017, ACM Trans. Graph..

[39]  Zhou Wang,et al.  Multi-exposure image fusion: A patch-wise approach , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[40]  Ce Liu,et al.  Exploring new representations and applications for motion analysis , 2009 .

[41]  Tae-Hyun Oh,et al.  Robust High Dynamic Range Imaging by Rank Minimization , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Karol Myszkowski,et al.  High Dynamic Range Imaging and Low Dynamic Range Expansion for Generating HDR Content , 2009, Eurographics.

[43]  S. Acton,et al.  Image enhancement using a contrast measure in the compressed domain , 2003, IEEE Signal Processing Letters.

[44]  Jitendra Malik,et al.  Recovering high dynamic range radiance maps from photographs , 1997, SIGGRAPH '08.

[45]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[46]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Nam Ik Cho,et al.  Joint High Dynamic Range Imaging and Super-Resolution From a Single Image , 2019, IEEE Access.

[48]  Delu Zeng,et al.  A fusion-based enhancing method for weakly illuminated images , 2016, Signal Process..

[49]  Ching-Te Chiu,et al.  Pseudo-Multiple-Exposure-Based Tone Fusion With Local Region Adjustment , 2015, IEEE Transactions on Multimedia.

[50]  Jun Hu,et al.  HDR Deghosting: How to Deal with Saturation? , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Guilherme J. M. Rosa The Elements of Statistical Learning: Data Mining, Inference, and Prediction by HASTIE, T., TIBSHIRANI, R., and FRIEDMAN, J , 2010 .

[52]  Yun Fu,et al.  Residual Dense Network for Image Super-Resolution , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[53]  Rafal Mantiuk,et al.  Multi-exposure image stacks for testing HDR deghosting methods , 2017 .

[54]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[55]  Wolfgang Heidrich,et al.  HDR-VDP-2: a calibrated visual metric for visibility and quality predictions in all luminance conditions , 2011, ACM Trans. Graph..

[56]  Anna Tomaszewska,et al.  Image Registration for Multi-exposure High Dynamic Range Image Acquisition , 2007 .

[57]  Hans-Peter Seidel,et al.  Optimal HDR reconstruction with linear digital cameras , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[58]  Diego Gutierrez,et al.  Evaluation of reverse tone mapping through varying exposure conditions , 2009, ACM Trans. Graph..