A Novel Deep Learning Method for Thermal to Annotated Thermal-Optical Fused Images

Abstract Thermal Images profile the passive radiation of objects and capture them in grayscale images. Such images have a very different distribution of data compared to optical colored images. We present here a work that produces a grayscale thermo-optical fused mask given a thermal input. This is a deep learning based pioneering work since to the best of our knowledge, there exists no other work on thermal-optical grayscale fusion. Our method is also unique in the sense that the deep learning method we are proposing here works on the Discrete Wavelet Transform (DWT) domain instead of the gray level domain. As a part of this work, we also present a new and unique database for obtaining the region of interest in thermal images based on an existing thermal visual paired database, containing the Region of Interest on 5 different classes of data. Finally, we are proposing a simple low cost overhead statistical measure for identifying the region of interest in the fused images, which we call as the Region of Fusion (RoF). Experiments on the database show encouraging results in identifying the region of interest in the fused images. We also show that they can be processed better in the mixed form rather than with only thermal images.

[1]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[2]  刘波,et al.  An image fusion algorithm of infrared thermal and optical images for pig contour , 2013 .

[3]  Vijay John,et al.  Deep Learning Thermal Image Translation for Night Vision Perception , 2020, ACM Trans. Intell. Syst. Technol..

[4]  George Tzanetakis,et al.  Audio Analysis using the Discrete Wavelet Transform , 2001 .

[5]  Edgar Simo-Serra,et al.  Let there be Color!: Joint End-to-end Learning of Global and Local Image Priors for Automatic Image Colorization with Simultaneous Classification , 2016 .

[6]  Satya Prakash Yadav,et al.  Image fusion using hybrid methods in multimodality medical images , 2020, Medical & Biological Engineering & Computing.

[7]  Myeongsu Kang,et al.  Multiple Wavelet Coefficients Fusion in Deep Residual Networks for Fault Diagnosis , 2019, IEEE Transactions on Industrial Electronics.

[8]  Marina Ivasic-Kos,et al.  Thermal Imaging Dataset for Person Detection , 2019, 2019 42nd International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[9]  Michael Felsberg,et al.  Generating Visible Spectrum Images from Thermal Infrared , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[10]  Sebastian Bock,et al.  A Proof of Local Convergence for the Adam Optimizer , 2019, 2019 International Joint Conference on Neural Networks (IJCNN).

[11]  J. E. Fowler,et al.  The redundant discrete wavelet transform and additive noise , 2005, IEEE Signal Processing Letters.

[12]  Jiaolong Xu,et al.  Pedestrian Detection at Day/Night Time with Visible and FIR Cameras: A Comparison , 2016, Sensors.

[13]  Kang Ryoung Park,et al.  Thermal Image Reconstruction Using Deep Learning , 2020, IEEE Access.

[14]  Bohyung Han,et al.  Learning Multi-domain Convolutional Neural Networks for Visual Tracking , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Zhenzhong Chen,et al.  Thermal to Visible Facial Image Translation Using Generative Adversarial Networks , 2018, IEEE Signal Processing Letters.

[16]  Feng Shu,et al.  Short-term traffic flow prediction based on spatio-temporal analysis and CNN deep learning , 2019, Transportmetrica A: Transport Science.

[17]  Himanshu Jindal,et al.  Novelty in Image Reconstruction using DWT and CLAHE , 2017 .

[18]  Ahmed Taha,et al.  A Passive Approach for Detecting Image Splicing Based on Deep Learning and Wavelet Transform , 2020 .

[19]  L. Yaroslavsky Processing and Fusion of Thermal and Video Sequences for Terrestrial Long Range Observation Systems , 2004 .

[20]  W. Zhou,et al.  An Ensembled Deep Learning Model Outperforms Human Experts in Diagnosing Biliary Atresia from Sonographic Gallbladder Images , 2020, medRxiv.

[21]  Mohamed Elhoseny,et al.  Deep learning model for real-time image compression in Internet of Underwater Things (IoUT) , 2020, Journal of Real-Time Image Processing.

[22]  James W. Davis,et al.  A Two-Stage Template Approach to Person Detection in Thermal Imagery , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[23]  Hui Fan,et al.  Multifocus Image Fusion Using Wavelet-Domain-Based Deep CNN , 2019, Comput. Intell. Neurosci..

[24]  Abhishek Dutta,et al.  The VGG Image Annotator (VIA) , 2019, ArXiv.

[25]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[26]  Guoyu Lu,et al.  An Alternative of LiDAR in Nighttime: Unsupervised Depth Estimation Based on Single Thermal Image , 2021, 2021 IEEE Winter Conference on Applications of Computer Vision (WACV).

[27]  Priti P. Rege,et al.  Pixel level fusion techniques for SAR and optical images: A review , 2020, Inf. Fusion.

[28]  Junsheng Shi,et al.  Intelligent Colorization for Thermal Infrared Image Based on CNN , 2020, 2020 IEEE International Conference on Information Technology,Big Data and Artificial Intelligence (ICIBA).

[29]  Satish Kumar Singh,et al.  A Novel Registration & Colorization Technique for Thermal to Cross Domain Colorized Images , 2021, ArXiv.

[30]  Paul Wilmes,et al.  Deep neural networks outperform human expert's capacity in characterizing bioleaching bacterial biofilm composition , 2019, Biotechnology reports.

[31]  Nicolas Usunier,et al.  End-to-End Object Detection with Transformers , 2020, ECCV.

[32]  Olivier Debeir,et al.  Robust Perceptual Night Vision in Thermal Colorization , 2020, VISIGRAPP.

[33]  Gautam Sanyal,et al.  A Robust Image Steganography using DWT Difference Modulation (DWTDM) , 2012 .

[34]  Lai-Man Po,et al.  SCGAN: Saliency Map-Guided Colorization With Generative Adversarial Network , 2020, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Wei Liu,et al.  Diverse Image Annotation , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Namil Kim,et al.  Multispectral pedestrian detection: Benchmark dataset and baseline , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Yunhan Li,et al.  Thermal Infrared Image Colorization for Nighttime Driving Scenes With Top-Down Guided Attention , 2021, IEEE Transactions on Intelligent Transportation Systems.

[38]  Md. Imdadul Islam,et al.  Human face recognition with combination of DWT and machine learning , 2020, J. King Saud Univ. Comput. Inf. Sci..

[39]  Bobby Lukose,et al.  Image Denoising Using Discrete Wavelet Transform , 2014 .

[40]  Kishor P. Upla,et al.  ThermISRnet: an efficient thermal image super-resolution network , 2021, Optical Engineering.

[41]  I. Flores-Parra,et al.  Segmentation of thermal infrared images of cucumber leaves using K-means clustering for estimating leaf wetness duration , 2020, International Journal of Agricultural and Biological Engineering.

[42]  Kilian Q. Weinberger,et al.  Fast Image Tagging , 2013, ICML.

[43]  Nurhan Gürsel Özmen,et al.  Worm gear condition monitoring and fault detection from thermal images via deep learning method , 2020, Eksploatacja i Niezawodnosc - Maintenance and Reliability.

[44]  Qian Chen,et al.  Thermal Infrared Colorization via Conditional Generative Adversarial Network , 2018, Infrared Physics & Technology.

[45]  Mang Ye,et al.  Grayscale Enhancement Colorization Network for Visible-Infrared Person Re-Identification , 2022, IEEE Transactions on Circuits and Systems for Video Technology.

[46]  Laurent Younes,et al.  Geodesic Image Matching: A Wavelet Based Energy Minimization Scheme , 2005, EMMCVPR.