A New Deep Learning Based Multi-Spectral Image Fusion Method

In this paper, we present a new effective infrared (IR) and visible (VIS) image fusion method by using a deep neural network. In our method, a Siamese convolutional neural network (CNN) is applied to automatically generate a weight map which represents the saliency of each pixel for a pair of source images. A CNN plays a role in automatic encoding an image into a feature domain for classification. By applying the proposed method, the key problems in image fusion, which are the activity level measurement and fusion rule design, can be figured out in one shot. The fusion is carried out through the multi-scale image decomposition based on wavelet transform, and the reconstruction result is more perceptual to a human visual system. In addition, the visual qualitative effectiveness of the proposed fusion method is evaluated by comparing pedestrian detection results with other methods, by using the YOLOv3 object detector using a public benchmark dataset. The experimental results show that our proposed method showed competitive results in terms of both quantitative assessment and visual quality.

[1]  Xinnan Fan,et al.  A Thermal Infrared and Visible Images Fusion Based Approach for Multitarget Detection under Complex Environment , 2015 .

[2]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[3]  Anup Basu,et al.  Cross-Scale Coefficient Selection for Volumetric Medical Image Fusion , 2013, IEEE Transactions on Biomedical Engineering.

[4]  Haiyan Jin,et al.  A fusion method for visible and infrared images based on contrast pyramid with teaching learning based optimization , 2014 .

[5]  Bin Li,et al.  Multimodal Medical Volumetric Data Fusion Using 3-D Discrete Shearlet Transform and Global-to-Local Rule , 2014, IEEE Transactions on Biomedical Engineering.

[6]  Rick S. Blum,et al.  A new automated quality assessment algorithm for image fusion , 2009, Image Vis. Comput..

[7]  Shutao Li,et al.  Pixel-level image fusion with simultaneous orthogonal matching pursuit , 2012, Inf. Fusion.

[8]  L. Yang,et al.  Multimodality medical image fusion based on multiscale geometric analysis of contourlet transform , 2008, Neurocomputing.

[9]  Junjun Jiang,et al.  FusionGAN: A generative adversarial network for infrared and visible image fusion , 2019, Inf. Fusion.

[10]  Bin Xiao,et al.  Union Laplacian pyramid with multiple features for medical image fusion , 2016, Neurocomputing.

[11]  A. Bovik,et al.  A universal image quality index , 2002, IEEE Signal Processing Letters.

[12]  B. S. Manjunath,et al.  Multi-sensor image fusion using the wavelet transform , 1994, Proceedings of 1st International Conference on Image Processing.

[13]  Shaowen Yao,et al.  A survey of infrared and visual image fusion methods , 2017 .

[14]  Qi Li,et al.  Detail preserved fusion of visible and infrared images using regional saliency extraction and multi-scale image decomposition , 2015 .

[15]  Qian Yin,et al.  A New Strategy to Improve Image Fusion Effect , 2006, 2006 International Conference on Machine Learning and Cybernetics.

[16]  Baohua Zhang,et al.  A fusion algorithm for infrared and visible images based on saliency analysis and non-subsampled Shearlet transform , 2015 .

[17]  Ashish Khare,et al.  Fusion of multimodal medical images using Daubechies complex wavelet transform - A multiresolution approach , 2014, Inf. Fusion.

[18]  Shutao Li,et al.  Group-Sparse Representation With Dictionary Learning for Medical Image Denoising and Fusion , 2012, IEEE Transactions on Biomedical Engineering.

[19]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[20]  Jiayi Ma,et al.  Infrared and visible image fusion methods and applications: A survey , 2018, Inf. Fusion.

[21]  Yu Han,et al.  A new image fusion performance metric based on visual information fidelity , 2013, Inf. Fusion.

[22]  Sujata Chaudhari,et al.  Yolo Real Time Object Detection , 2020 .

[23]  Yu Liu,et al.  Simultaneous image fusion and denoising with adaptive sparse representation , 2015, IET Image Process..

[24]  Alexander Toet,et al.  A morphological pyramidal image decomposition , 1989, Pattern Recognit. Lett..

[25]  B. S. Manjunath,et al.  Multisensor Image Fusion Using the Wavelet Transform , 1995, CVGIP Graph. Model. Image Process..

[26]  G. Qu,et al.  Information measure for performance of image fusion , 2002 .

[27]  Zheng Liu,et al.  Directive Contrast Based Multimodal Medical Image Fusion in NSCT Domain , 2013, IEEE Transactions on Multimedia.

[28]  Yu Liu,et al.  A general framework for image fusion based on multi-scale transform and sparse representation , 2015, Inf. Fusion.

[29]  Cedric Nishan Canagarajah,et al.  Pixel- and region-based image fusion with complex wavelets , 2007, Inf. Fusion.

[30]  J. Wesley Roberts,et al.  Assessment of image fusion procedures using entropy, image quality, and multispectral classification , 2008 .