Brightness-Based Convolutional Neural Network for Thermal Image Enhancement

In this paper, we propose a convolutional neural network for thermal image enhancement by incorporating the brightness domain with a residual-learning technique, which improves the performance of enhancement and speed of convergence. Typically, the training domain uses the same domain as that of the target image; however, we evaluated several domains to determine the most suitable one for the network. In the analyses, we first compared the performance of networks that were trained by the corresponding regions of color-based and aligned infrared-based images, respectively, including thermal, far, and near spectra. Then, four RGB-based domains, namely, gray, lightness, intensity, and brightness were evaluated. Finally, the proposed network architecture was determined by considering the residual and brightness domains. The results of the analyses indicated that the brightness domain was the best training domain for enhancing the thermal images. The experimental results confirm that the proposed network, which can be trained in approximately one hour, outperforms the conventional learning-based approaches for thermal image enhancement, in terms of several image quality metrics and a qualitative evaluation. Furthermore, the results demonstrate that the brightness domain is effective as the training domain and can be used to increase the performance of existing networks.

[1]  Roland Siegwart,et al.  People detection and tracking from aerial thermal views , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[2]  Charless C. Fowlkes,et al.  Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Namil Kim,et al.  Thermal Image Enhancement using Convolutional Neural Network , 2016, 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[4]  Wenda Zhao,et al.  Variational infrared image enhancement based on adaptive dual-threshold gradient field equalization , 2014 .

[5]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[6]  Horst Bischof,et al.  Fast and accurate image upscaling with super-resolution forests , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Qiang Chen,et al.  Network In Network , 2013, ICLR.

[8]  Chu Kiong Loo,et al.  Home Alone Faint Detection Surveillance System Using Thermal Camera , 2010, 2010 Second International Conference on Computer Research and Development.

[9]  Jiaolong Xu,et al.  Pedestrian Detection at Day/Night Time with Visible and FIR Cameras: A Comparison , 2016, Sensors.

[10]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Rob Fergus,et al.  Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[12]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Thomas B. Moeslund,et al.  Pedestrian Counting with Occlusion Handling Using Stereo Thermal Cameras , 2016, Sensors.

[14]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15]  Fengshou Gu,et al.  Thermal image enhancement using bi-dimensional empirical mode decomposition in combination with relevance vector machine for rotating machinery fault diagnosis , 2013 .

[16]  Dong Liang,et al.  Robust pedestrian detection in thermal infrared imagery using a shape distribution histogram feature and modified sparse representation classification , 2015, Pattern Recognit..

[17]  Wojciech Matusik,et al.  Statistics of Infrared Images , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Euntai Kim,et al.  Efficient Pedestrian Detection at Nighttime Using a Thermal Camera , 2017, Sensors.

[20]  Qi Li,et al.  A novel method of infrared image denoising and edge enhancement , 2008, Signal Process..

[21]  Thomas S. Huang,et al.  Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[22]  Xiaoou Tang,et al.  Accelerating the Super-Resolution Convolutional Neural Network , 2016, ECCV.

[23]  Aníbal Ollero,et al.  An Intelligent System for False Alarm Reduction in Infrared Forest-Fire Detection , 2000, IEEE Intell. Syst..

[24]  Hongxun Yao,et al.  Image Evaluation Factors , 2005, ICIAR.

[25]  Kok-Swee Sim,et al.  Infrared image enhancement using adaptive trilateral contrast enhancement , 2015, Pattern Recognit. Lett..

[26]  Xiaoou Tang,et al.  Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Xiangzhi Bai,et al.  Infrared image enhancement through contrast enhancement by using multiscale new top-hat transform , 2011 .

[28]  Sabine Süsstrunk,et al.  Color image dehazing using the near-infrared , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[29]  Patricia Ladret,et al.  The blur effect: perception and estimation with a new no-reference perceptual blur metric , 2007, Electronic Imaging.

[30]  A. Prata,et al.  Retrieval of volcanic ash particle size, mass and optical depth from a ground-based thermal infrared camera , 2009 .

[31]  Sabine Süsstrunk,et al.  Correlation-based joint acquisition and demosaicing of visible and near-infrared images , 2011, 2011 18th IEEE International Conference on Image Processing.

[32]  Namil Kim,et al.  Multispectral pedestrian detection: Benchmark dataset and baseline , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Alan C. Bovik,et al.  Making a “Completely Blind” Image Quality Analyzer , 2013, IEEE Signal Processing Letters.

[34]  Angel Domingo Sappa,et al.  Multispectral Image Feature Points , 2012, Sensors.

[35]  Michael Felsberg,et al.  A thermal Object Tracking benchmark , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[36]  Guixi Liu,et al.  A new approach for infrared image contrast enhancement , 2006, International Symposium on Advanced Optical Manufacturing and Testing Technologies (AOMATT).

[37]  James W. Davis,et al.  Background-subtraction using contour-based fusion of thermal and visible imagery , 2007, Comput. Vis. Image Underst..

[38]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[39]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[40]  Kyoung Mu Lee,et al.  Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[41]  Shuang Wang,et al.  Joint Prior Learning for Visual Sensor Network Noisy Image Super-Resolution , 2016, Sensors.

[42]  Bai Lian HISTOGRAM DOUBLE EQUALIZATION IN INFRARED IMAGE , 2003 .

[43]  Sergey Ioffe,et al.  Rethinking the Inception Architecture for Computer Vision , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[45]  Stefan Harmeling,et al.  Image denoising: Can plain neural networks compete with BM3D? , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Haidi Ibrahim,et al.  Brightness Preserving Dynamic Histogram Equalization for Image Contrast Enhancement , 2007, IEEE Transactions on Consumer Electronics.

[47]  Arnold C. Goldberg,et al.  Application of dual-band infrared focal plane arrays to tactical and strategic military problems , 2003, SPIE Optics + Photonics.

[48]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[49]  Angel Domingo Sappa,et al.  Multispectral piecewise planar stereo using Manhattan-world assumption , 2013, Pattern Recognit. Lett..

[50]  Paulo Vinicius Koerich Borges,et al.  Practical Infrared Visual Odometry , 2016, IEEE Transactions on Intelligent Transportation Systems.

[51]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[52]  Sergio Escalera,et al.  Multi-modal RGB–Depth–Thermal Human Body Segmentation , 2016, International Journal of Computer Vision.

[53]  Sabine Süsstrunk,et al.  Colouring the Near-Infrared , 2008, CIC.

[54]  Luc Van Gool,et al.  Anchored Neighborhood Regression for Fast Example-Based Super-Resolution , 2013, 2013 IEEE International Conference on Computer Vision.

[55]  Gustavo de Veciana,et al.  An information fidelity criterion for image quality assessment using natural scene statistics , 2005, IEEE Transactions on Image Processing.

[56]  Guillaume-Alexandre Bilodeau,et al.  An iterative integrated framework for thermal-visible image registration, sensor fusion, and people tracking for video surveillance applications , 2012, Comput. Vis. Image Underst..