IR-MSDNet: Infrared and Visible Image Fusion Based On Infrared Features and Multiscale Dense Network

Infrared (IR) and visible images are heterogeneous data, and their fusion is one of the important research contents in the remote sensing field. In the last decade, deep networks have been widely used in image fusion due to their ability to preserve high-level semantic information. However, due to the lower resolution of IR images, deep learning-based methods may not be able to retain the salient features of IR images. In this article, a novel IR and visible image fusion based on IR Features & Multiscale Dense Network (IR-MSDNet) is proposed to preserve the content and key target features from both visible and IR images in the fused image. It comprises an encoder, a multiscale decoder, a traditional processing unit, and a fused unit, and can capture incredibly rich background details in visible images and prominent target details in IR features. When the dense and multiscale features are fused, the background details are obtained by utilizing attention strategy, and then combined with complimentary edge features. While IR features are extracted by traditional quadtree decomposition and Bezier interpolation, and further intensified by refinement. Finally, both the decoded multiscale features and IR features are used to reconstruct the final fused image. Experimental evaluation with other state-of-the-art fusion methods validates the superiority of our proposed IR-MSDNet in both subjective and objective evaluation metrics. Additional objective evaluation conducted on the object detection (OD) task further verifies that the proposed IR-MSDNet has greatly enhanced the details in the fused images, which bring the best OD results.

[1]  Wei Cai,et al.  A region-based multi-sensor image fusion scheme using pulse-coupled neural network , 2006, Pattern Recognit. Lett..

[2]  Michael Teutsch,et al.  Generalization ability of region proposal networks for multispectral person detection , 2019, Automatic Target Recognition XXIX.

[3]  R. Venkatesh Babu,et al.  DeepFuse: A Deep Unsupervised Approach for Exposure Fusion with Extreme Exposure Image Pairs , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[4]  Hong Huo,et al.  PFAF-Net: Pyramid Feature Network for Multimodal Fusion , 2020, IEEE Sensors Letters.

[5]  Jiayi Ma,et al.  Infrared and visible image fusion via gradient transfer and total variation minimization , 2016, Inf. Fusion.

[6]  Yu Zhang,et al.  Infrared and visual image fusion through infrared feature extraction and visual information preservation , 2017 .

[7]  LiShutao,et al.  Image matting for fusion of multi-focus images in dynamic scenes , 2013 .

[8]  Namil Kim,et al.  Multispectral pedestrian detection: Benchmark dataset and baseline , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Yu Liu,et al.  Multi-focus image fusion with a deep convolutional neural network , 2017, Inf. Fusion.

[10]  Josef Kittler,et al.  Infrared and Visible Image Fusion using a Deep Learning Framework , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[11]  Rabab Kreidieh Ward,et al.  Image Fusion With Convolutional Sparse Representation , 2016, IEEE Signal Processing Letters.

[12]  J. Wesley Roberts,et al.  Assessment of image fusion procedures using entropy, image quality, and multispectral classification , 2008 .

[13]  Junjun Jiang,et al.  FusionGAN: A generative adversarial network for infrared and visible image fusion , 2019, Inf. Fusion.

[14]  Yu Liu,et al.  Multi-focus image fusion with dense SIFT , 2015, Inf. Fusion.

[15]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[16]  Shutao Li,et al.  Image Fusion With Guided Filtering , 2013, IEEE Transactions on Image Processing.

[17]  Simon J. Godsill,et al.  A Nonreference Image Fusion Metric Based on the Regional Importance Measure , 2009, IEEE Journal of Selected Topics in Signal Processing.

[18]  Manas Kamal Bhuyan,et al.  Image Fusion Using Adjustable Non-subsampled Shearlet Transform , 2019, IEEE Transactions on Instrumentation and Measurement.

[19]  Kai Zeng,et al.  Perceptual Quality Assessment for Multi-Exposure Image Fusion , 2015, IEEE Transactions on Image Processing.

[20]  Yu Liu,et al.  IFCNN: A general image fusion framework based on convolutional neural network , 2020, Inf. Fusion.

[21]  Zhi Gao,et al.  Improved Faster R-CNN With Multiscale Feature Fusion and Homography Augmentation for Vehicle Detection in Remote Sensing Images , 2019, IEEE Geoscience and Remote Sensing Letters.

[22]  Jiayi Ma,et al.  Infrared and visible image fusion methods and applications: A survey , 2018, Inf. Fusion.

[23]  Shutao Li,et al.  Multifocus image fusion using region segmentation and spatial frequency , 2008, Image Vis. Comput..

[24]  Veysel Aslantas,et al.  Fusion of multi-focus images using differential evolution algorithm , 2010, Expert Syst. Appl..

[25]  Haifeng Li,et al.  Dictionary learning method for joint sparse representation-based image fusion , 2013 .

[26]  Vladimir Petrovic,et al.  Objective image fusion performance measure , 2000 .

[27]  Hong Huo,et al.  Diverse Capsules Network Combining Multiconvolutional Layers for Remote Sensing Image Scene Classification , 2020, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[28]  Zhongliang Jing,et al.  Visible and infrared image fusion using ℓ0-generalized total variation model , 2017, Science China Information Sciences.

[29]  Cedric Nishan Canagarajah,et al.  Pixel- and region-based image fusion with complex wavelets , 2007, Inf. Fusion.

[30]  Toet Alexander,et al.  TNO Image Fusion Dataset , 2014 .

[31]  Yue Qi,et al.  Infrared and visible image fusion method based on saliency detection in sparse domain , 2017 .

[32]  ChandaBhabatosh,et al.  Multi-focus image fusion using a morphology-based focus measure in a quad-tree structure , 2013 .

[33]  Li Zhang In Situ Image Segmentation Using the Convexity of Illumination Distribution of the Light Sources , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[35]  Yu Zhang,et al.  Quadtree-based multi-focus image fusion using a weighted focus-measure , 2015, Inf. Fusion.

[36]  Hui Li,et al.  DenseFuse: A Fusion Approach to Infrared and Visible Images , 2018, IEEE Transactions on Image Processing.

[37]  B. S. Manjunath,et al.  Multisensor Image Fusion Using the Wavelet Transform , 1995, CVGIP Graph. Model. Image Process..

[38]  Zhe Chen,et al.  Noise-robust dictionary learning with slack block-Diagonal structure for face recognition , 2020, Pattern Recognit..

[39]  Luciano Alparone,et al.  Remote sensing image fusion using the curvelet transform , 2007, Inf. Fusion.

[40]  Mohammad Haghighat,et al.  Fast-FMI: Non-reference image fusion metric , 2014, 2014 IEEE 8th International Conference on Application of Information and Communication Technologies (AICT).