Deep reverse tone mapping

Inferring a high dynamic range (HDR) image from a single low dynamic range (LDR) input is an ill-posed problem where we must compensate lost data caused by under-/over-exposure and color quantization. To tackle this, we propose the first deep-learning-based approach for fully automatic inference using convolutional neural networks. Because a naive way of directly inferring a 32-bit HDR image from an 8-bit LDR image is intractable due to the difficulty of training, we take an indirect approach; the key idea of our method is to synthesize LDR images taken with different exposures (i.e., bracketed images) based on supervised learning, and then reconstruct an HDR image by merging them. By learning the relative changes of pixel values due to increased/decreased exposures using 3D deconvolutional networks, our method can reproduce not only natural tones without introducing visible noise but also the colors of saturated pixels. We demonstrate the effectiveness of our method by comparing our results not only with those of conventional methods but also with ground-truth HDR images.

[1]  Shanmuganathan Raman,et al.  InternetHDR: Enhancing an LDR image using visually similar Internet images , 2014, 2014 Twentieth National Conference on Communications (NCC).

[2]  Kurt Debattista,et al.  Advanced High Dynamic Range Imaging: Theory and Practice , 2011 .

[3]  Stefan Gustavson,et al.  Unified HDR reconstruction from raw CFA data , 2013, IEEE International Conference on Computational Photography (ICCP).

[4]  Jinsong Zhang,et al.  Learning High Dynamic Range from Outdoor Panoramas , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[5]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[6]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[7]  Andrew L. Maas Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[8]  Diego Gutierrez,et al.  Evaluation of reverse tone mapping through varying exposure conditions , 2009, ACM Trans. Graph..

[9]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[10]  Ravi Ramamoorthi,et al.  Deep high dynamic range imaging of dynamic scenes , 2017, ACM Trans. Graph..

[11]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[12]  Antonio Torralba,et al.  Generating Videos with Scene Dynamics , 2016, NIPS.

[13]  Shree K. Nayar,et al.  What is the space of camera response functions? , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[14]  Kurt Debattista,et al.  A framework for inverse tone mapping , 2007, The Visual Computer.

[15]  Wolfgang Heidrich,et al.  Ldr2Hdr: on-the-fly reverse tone mapping of legacy video and photographs , 2007, ACM Trans. Graph..

[16]  Erik Reinhard,et al.  Do HDR displays support LDR content?: a psychophysical evaluation , 2007, ACM Trans. Graph..

[17]  Nitish Srivastava,et al.  Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.

[18]  Manuel Menezes de Oliveira Neto,et al.  High-Quality Reverse Tone Mapping for a Wide Range of Exposures , 2014, 2014 27th SIBGRAPI Conference on Graphics, Patterns and Images.

[19]  Stefan Winkler,et al.  Recovering badly exposed objects from digital photos using internet images , 2014, Electronic Imaging.

[20]  Fan Yang,et al.  Physiological inverse tone mapping based on retina response , 2013, The Visual Computer.

[21]  Lilong Shi,et al.  The effect of exposure on MaxRGB color constancy , 2010, Electronic Imaging.

[22]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[23]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[24]  Diego Gutierrez,et al.  Content-Aware Reverse Tone Mapping , 2016 .

[25]  Jan Kautz,et al.  Consistent tone reproduction , 2008 .

[26]  Wolfgang Heidrich,et al.  HDR-VDP-2: a calibrated visual metric for visibility and quality predictions in all luminance conditions , 2011, ACM Trans. Graph..

[27]  G. Miller Learning to Forget , 2004, Science.

[28]  Diego Gutierrez,et al.  Selective Reverse Tone Mapping , 2010 .

[29]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[31]  Ramesh Raskar,et al.  Unbounded High Dynamic Range Photography Using a Modulo Camera , 2015, 2015 IEEE International Conference on Computational Photography (ICCP).

[32]  Erik Reinhard,et al.  Photographic tone reproduction for digital images , 2002, ACM Trans. Graph..

[33]  Rob Fergus,et al.  Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks , 2015, NIPS.

[34]  Pradeep Sen,et al.  A versatile HDR video production system , 2011, ACM Trans. Graph..

[35]  Ran He,et al.  Beyond Face Rotation: Global and Local Perception GAN for Photorealistic and Identity Preserving Frontal View Synthesis , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[36]  Krista A. Ehinger,et al.  Recognizing scene viewpoint using panoramic place representation , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[37]  Diego Gutierrez,et al.  Dynamic range expansion based on image statistics , 2015, Multimedia Tools and Applications.

[38]  Theodore Lim,et al.  Generative and Discriminative Voxel Modeling with Convolutional Neural Networks , 2016, ArXiv.

[39]  Greg Ward,et al.  High dynamic range imaging , 2001, SIGGRAPH '04.

[40]  Sebastian Scherer,et al.  VoxNet: A 3D Convolutional Neural Network for real-time object recognition , 2015, 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[41]  Steve Mann,et al.  ON BEING `UNDIGITAL' WITH DIGITAL CAMERAS: EXTENDING DYNAMIC RANGE BY COMBINING DIFFERENTLY EXPOSED PICTURES , 1995 .

[42]  Hans-Peter Seidel,et al.  Enhancement of Bright Video Features for HDR Displays , 2008 .

[43]  Gordon Wetzstein,et al.  Convolutional Sparse Coding for High Dynamic Range Imaging , 2016, Comput. Graph. Forum.

[44]  Lorenzo Torresani,et al.  Learning Spatiotemporal Features with 3D Convolutional Networks , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[45]  Feng Xiao,et al.  High Dynamic Range Imaging of Natural Scenes , 2002, CIC.

[46]  ShirleyPeter,et al.  Photographic tone reproduction for digital images , 2002 .

[47]  Jitendra Malik,et al.  Recovering high dynamic range radiance maps from photographs , 1997, SIGGRAPH '08.

[48]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[49]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[50]  Jianxiong Xiao,et al.  3D ShapeNets: A deep representation for volumetric shapes , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51]  Wolfgang Heidrich,et al.  HDR-VDP-2: a calibrated visual metric for visibility and quality predictions in all luminance conditions , 2011, SIGGRAPH 2011.

[52]  Lilong Shi,et al.  The Rehabilitation of MaxRGB , 2010, CIC.

[53]  Kun Zhou,et al.  High dynamic range image hallucination , 2007, SIGGRAPH '07.

[54]  Francesco Banterle,et al.  Inverse tone mapping , 2006, GRAPHITE '06.

[55]  Jan Kautz,et al.  Loss Functions for Image Restoration With Neural Networks , 2017, IEEE Transactions on Computational Imaging.

[56]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[57]  Ming Yang,et al.  3D Convolutional Neural Networks for Human Action Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[58]  Gabriel Kreiman,et al.  Unsupervised Learning of Visual Structure using Predictive Generative Networks , 2015, ArXiv.

[59]  Alexei A. Efros,et al.  Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[60]  Touradj Ebrahimi,et al.  Visual attention in LDR and HDR images , 2015 .

[61]  Jan Kautz,et al.  Exposure Fusion , 2007, 15th Pacific Conference on Computer Graphics and Applications (PG'07).

[62]  Gabriel Eilertsen,et al.  HDR image reconstruction from a single exposure using deep CNNs , 2017, ACM Trans. Graph..

[63]  Manuel Menezes de Oliveira Neto,et al.  High-quality brightness enhancement functions for real-time reverse tone mapping , 2009, The Visual Computer.

[64]  Bolei Zhou,et al.  Learning Deep Features for Scene Recognition using Places Database , 2014, NIPS.

[65]  Kurt Debattista,et al.  Expanding low dynamic range videos for high dynamic range applications , 2008, SCCG.

[66]  Ching-Te Chiu,et al.  Pseudo-Multiple-Exposure-Based Tone Fusion With Local Region Adjustment , 2015, IEEE Transactions on Multimedia.