论文信息 - HDR image reconstruction from a single exposure using deep CNNs

HDR image reconstruction from a single exposure using deep CNNs

Camera sensors can only capture a limited range of luminance simultaneously, and in order to create high dynamic range (HDR) images a set of different exposures are typically combined. In this paper we address the problem of predicting information that have been lost in saturated image areas, in order to enable HDR reconstruction from a single exposure. We show that this problem is well-suited for deep learning algorithms, and propose a deep convolutional neural network (CNN) that is specifically designed taking into account the challenges in predicting HDR values. To train the CNN we gather a large dataset of HDR images, which we augment by simulating sensor saturation for a range of cameras. To further boost robustness, we pre-train the CNN on a simulated HDR dataset created from a subset of the MIT Places database. We demonstrate that our approach can reconstruct high-resolution visually convincing HDR results in a wide range of situations, and that it generalizes well to reconstruction of images captured with arbitrary and low-end cameras that use unknown camera response functions and post-processing. Furthermore, we compare to existing methods for HDR expansion, and show high quality results also for image based lighting. Finally, we evaluate the results in a subjective experiment performed on an HDR display. This shows that the reconstructed HDR images are visually convincing, with large improvements as compared to existing methods.

[1] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[4] Patrick Le Callet,et al. High Dynamic Range Video - From Acquisition, to Display and Applications , 2016 .

[5] Steve Mann,et al. ON BEING `UNDIGITAL' WITH DIGITAL CAMERAS: EXTENDING DYNAMIC RANGE BY COMBINING DIFFERENTLY EXPOSED PICTURES , 1995 .

[6] Geoffrey E. Hinton,et al. Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[7] Kun Zhou,et al. High dynamic range image hallucination , 2007, SIGGRAPH '07.

[8] Francesco Banterle,et al. Inverse tone mapping , 2006, GRAPHITE '06.

[9] Yoshua Bengio,et al. Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[10] Thomas Richter,et al. Fine-tuning JPEG-XT compression performance using large-scale objective quality testing , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[11] Manuel Menezes de Oliveira Neto,et al. High-quality brightness enhancement functions for real-time reverse tone mapping , 2009, The Visual Computer.

[12] A. Gilchrist,et al. Perception of Lightness and Illumination in a World of One Reflectance , 1984, Perception.

[13] Jitendra Malik,et al. Recovering high dynamic range radiance maps from photographs , 1997, SIGGRAPH '08.

[14] Thomas Brox,et al. U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[15] Bolei Zhou,et al. Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[16] Kaiming He,et al. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] KronanderJoel,et al. HDR image reconstruction from a single exposure using deep CNNs , 2017 .

[18] Diego Gutierrez,et al. Evaluation of reverse tone mapping through varying exposure conditions , 2009, ACM Trans. Graph..

[19] Xiaoou Tang,et al. Compression Artifacts Reduction by a Deep Convolutional Network , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[20] Kurt Debattista,et al. A Psychophysical Evaluation of Inverse Tone Mapping Techniques , 2009, Comput. Graph. Forum.

[21] Scott J. Daly,et al. Decontouring: prevention and removal of false contour artifacts , 2004, IS&T/SPIE Electronic Imaging.

[22] Jonas Unger,et al. Adaptive dualISO HDR reconstruction , 2015, EURASIP J. Image Video Process..

[23] Yoshua Bengio,et al. Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[24] Shree K. Nayar,et al. What is the space of camera response functions? , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[25] Yoshihiro Kanamori,et al. Deep reverse tone mapping , 2017, ACM Trans. Graph..

[26] Wolfgang Heidrich,et al. High dynamic range display systems , 2004, ACM Trans. Graph..

[27] Gustav Theodor Fechner,et al. Elements of psychophysics , 1966 .

[28] Trevor Darrell,et al. Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29] Wolfgang Heidrich,et al. Glare encoding of high dynamic range images , 2011, CVPR 2011.

[30] Vincent Dumoulin,et al. Deconvolution and Checkerboard Artifacts , 2016 .

[31] Bastian Leibe,et al. Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32] Stefan Gustavson,et al. High-dynamic-range video for photometric measurement of illumination , 2007, Electronic Imaging.

[33] Gordon Wetzstein,et al. Convolutional Sparse Coding for High Dynamic Range Imaging , 2016, Comput. Graph. Forum.

[34] Kurt Debattista,et al. Expanding low dynamic range videos for high dynamic range applications , 2008, SCCG.

[35] Harald Brendel,et al. Creating cinematic wide gamut HDR-video for the evaluation of tone mapping operators and HDR-displays , 2014, Electronic Imaging.

[36] Panos Nasiopoulos,et al. Evaluating the Performance of Existing Full-Reference Quality Metrics on High Dynamic Range (HDR) Video Content , 2018, 1803.04815.

[37] Alexei A. Efros,et al. Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[39] Manuel Menezes de Oliveira Neto,et al. High-Quality Reverse Tone Mapping for a Wide Range of Exposures , 2014, 2014 27th SIBGRAPI Conference on Graphics, Patterns and Images.

[40] Erik Reinhard,et al. High Dynamic Range Imaging: Acquisition, Display, and Image-Based Lighting (The Morgan Kaufmann Series in Computer Graphics) , 2005 .

[41] Hiroshi Ishikawa,et al. Let there be color! , 2016, ACM Trans. Graph..

[42] Pradeep Sen,et al. A versatile HDR video production system , 2011, ACM Trans. Graph..

[43] Scott J. Daly,et al. Bit-depth extension using spatiotemporal microdither based on models of the equivalent input noise of the visual system , 2003, IS&T/SPIE Electronic Imaging.

[44] Diego Gutierrez,et al. Dynamic range expansion based on image statistics , 2015, Multimedia Tools and Applications.

[45] Hans-Peter Seidel,et al. Enhancement of Bright Video Features for HDR Displays , 2008 .

[46] Karol Myszkowski,et al. High Dynamic Range Imaging and Low Dynamic Range Expansion for Generating HDR Content , 2009, Eurographics.

[47] Wolfgang Heidrich,et al. Ldr2Hdr: on-the-fly reverse tone mapping of legacy video and photographs , 2007, ACM Trans. Graph..

[48] Erik Reinhard,et al. Do HDR displays support LDR content?: a psychophysical evaluation , 2007, ACM Trans. Graph..

[49] Anders Ynnerman,et al. A unified framework for multi-sensor HDR video reconstruction , 2013, Signal Process. Image Commun..

[50] Dominique Thoreau,et al. Survey of Temporal Brightness Artifacts in Video Tone Mapping , 2014 .

[51] Sitaram Bhagavathy,et al. Multi-Scale Probabilistic Dithering for Suppressing Banding Artifacts in Digital Images , 2007, 2007 IEEE International Conference on Image Processing.

[52] Hao Li,et al. High-Resolution Image Inpainting Using Multi-scale Neural Patch Synthesis , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[53] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[54] Ravi Ramamoorthi,et al. Deep high dynamic range imaging of dynamic scenes , 2017, ACM Trans. Graph..

[55] Erik Reinhard,et al. High Dynamic Range Imaging: Acquisition, Display, and Image-Based Lighting , 2010 .

[56] Kurt Debattista,et al. Advanced High Dynamic Range Imaging: Theory and Practice , 2011 .

[57] Jinsong Zhang,et al. Learning High Dynamic Range from Outdoor Panoramas , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[58] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[59] Laurence Meylan,et al. The Reproduction of Specular Highlights on High Dynamic Range Displays , 2006, CIC.

[60] Shree K. Nayar,et al. High dynamic range imaging: spatially varying pixel exposures , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[61] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[62] Pavel Zemcík,et al. Compression Artifacts Removal Using Convolutional Neural Networks , 2016, J. WSCG.