论文信息 - Infrared and visible image fusion via detail preserving adversarial learning

Infrared and visible image fusion via detail preserving adversarial learning

Abstract TargefTablets can be detected easily from the background of infrared images due to their significantly discriminative thermal radiations, while visible images contain textural details with high spatial resolution which are beneficial to the enhancement of target recognition. Therefore, fused images with abundant detail information and effective target areas are desirable. In this paper, we propose an end-to-end model for infrared and visible image fusion based on detail preserving adversarial learning. It is able to overcome the limitations of the manual and complicated design of activity-level measurement and fusion rules in traditional fusion methods. Considering the specific information of infrared and visible images, we design two loss functions including the detail loss and target edge-enhancement loss to improve the quality of detail information and sharpen the edge of infrared targets under the framework of generative adversarial network. Our approach enables the fused image to simultaneously retain the thermal radiation with sharpening infrared target boundaries in the infrared image and the abundant textural details in the visible image. Experiments conducted on publicly available datasets demonstrate the superiority of our strategy over the state-of-the-art methods in both objective metrics and visual impressions. In particular, our results look like enhanced infrared images with clearly highlighted and edge-sharpened targets as well as abundant detail information.

[1] Alexei A. Efros,et al. Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[3] Hui Li,et al. DenseFuse: A Fusion Approach to Infrared and Visible Images , 2018, IEEE Transactions on Image Processing.

[4] Shutao Li,et al. Image Fusion With Guided Filtering , 2013, IEEE Transactions on Image Processing.

[5] Junjun Jiang,et al. Locality Preserving Matching , 2018, International Journal of Computer Vision.

[6] Gang Liu,et al. Multi-sensor image fusion based on fourth order partial differential equations , 2017, 2017 20th International Conference on Information Fusion (Fusion).

[7] Rick S. Blum,et al. A categorization of multiscale-decomposition-based image fusion schemes with a performance study for a digital camera application , 1999, Proc. IEEE.

[8] Rabab Kreidieh Ward,et al. Deep learning for pixel-level image fusion: Recent advances and future prospects , 2018, Inf. Fusion.

[9] Xinming Tang,et al. IMAGE FUSION AND IMAGE QUALITY ASSESSMENT OF FUSED IMAGES , 2013 .

[10] Jiayi Ma,et al. Infrared and visible image fusion via gradient transfer and total variation minimization , 2016, Inf. Fusion.

[11] Léon Bottou,et al. Wasserstein GAN , 2017, ArXiv.

[12] Jiayi Ma,et al. Infrared and visible image fusion methods and applications: A survey , 2018, Inf. Fusion.

[13] Yang Lei,et al. Novel fusion method for visible light and infrared images based on NSST–SF–PCNN , 2014 .

[14] Yue Qi,et al. Infrared and visible image fusion method based on saliency detection in sparse domain , 2017 .

[15] J. Wesley Roberts,et al. Assessment of image fusion procedures using entropy, image quality, and multispectral classification , 2008 .

[16] Yoshua Bengio,et al. Generative Adversarial Nets , 2014, NIPS.

[17] Shutao Li,et al. Visual attention guided image fusion with sparse representation , 2014 .

[18] Vps Naidu,et al. Image Fusion Technique using Multi-resolution Singular Value Decomposition , 2011 .

[19] Joan Bruna,et al. Super-Resolution with Deep Convolutional Sufficient Statistics , 2015, ICLR.

[20] Junjun Jiang,et al. LMR: Learning a Two-Class Classifier for Mismatch Removal , 2019, IEEE Transactions on Image Processing.

[21] Shutao Li,et al. Performance comparison of different multi-resolution transforms for image fusion , 2011, Inf. Fusion.

[22] Hao Chen,et al. CNNs-Based RGB-D Saliency Detection via Cross-View Transfer and Multiview Fusion , 2017 .

[23] Qi Li,et al. Infrared image enhancement through saliency feature analysis based on multi-scale decomposition , 2014 .

[24] Josef Kittler,et al. Infrared and Visible Image Fusion using a Deep Learning Framework , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[25] A. Bovik,et al. A universal image quality index , 2002, IEEE Signal Processing Letters.

[26] Hua Zong,et al. Infrared and visible image fusion based on visual saliency map and weighted least square optimization , 2017 .

[27] Alexei A. Efros,et al. Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28] Christian Ledig,et al. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[29] Rabab Kreidieh Ward,et al. Image Fusion With Convolutional Sparse Representation , 2016, IEEE Signal Processing Letters.

[30] Shutao Li,et al. Pixel-level image fusion: A survey of the state of the art , 2017, Inf. Fusion.

[31] Junwei Han,et al. CNNs-Based RGB-D Saliency Detection via Cross-View Transfer and Multiview Fusion. , 2018, IEEE transactions on cybernetics.

[32] Alex Graves,et al. DRAW: A Recurrent Neural Network For Image Generation , 2015, ICML.

[33] Durga Prasad Bavirisetti,et al. Fusion of Infrared and Visible Sensor Images Based on Anisotropic Diffusion and Karhunen-Loeve Transform , 2016, IEEE Sensors Journal.

[34] Yu Han,et al. A new image fusion performance metric based on visual information fidelity , 2013, Inf. Fusion.

[35] Raymond Y. K. Lau,et al. Least Squares Generative Adversarial Networks , 2016, 2017 IEEE International Conference on Computer Vision (ICCV).

[36] Alexander Toet,et al. Image fusion by a ration of low-pass pyramid , 1989, Pattern Recognit. Lett..

[37] W. Kong,et al. Adaptive fusion method of visible light and infrared images based on non-subsampled shearlet transform and fast non-negative matrix factorization , 2014 .

[38] Jun Huang,et al. Infrared and visible image fusion via saliency analysis and local edge-preserving multi-scale decomposition. , 2017, Journal of the Optical Society of America. A, Optics, image science, and vision.

[39] Qi Li,et al. Detail preserved fusion of visible and infrared images using regional saliency extraction and multi-scale image decomposition , 2015 .

[40] Jian Sun,et al. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[41] B. S. Manjunath,et al. Multisensor Image Fusion Using the Wavelet Transform , 1995, CVGIP Graph. Model. Image Process..

[42] Ravindra Dhuli,et al. Two-scale image fusion of visible and infrared images using saliency detection , 2016 .

[43] Yi Shen,et al. Region level based multi-focus image fusion using quaternion wavelet and normalized cut , 2014, Signal Process..

[44] Yu Liu,et al. Multi-focus image fusion with a deep convolutional neural network , 2017, Inf. Fusion.

[45] Yong Pei,et al. Multilevel Depth and Image Fusion for Human Activity Detection , 2013, IEEE Transactions on Cybernetics.

[46] Li Yan,et al. A fusion algorithm for infrared and visible images based on adaptive dual-channel unit-linking PCNN in NSCT domain , 2015 .

[47] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48] Rob Fergus,et al. Deep Generative Image Models using a Laplacian Pyramid of Adversarial Networks , 2015, NIPS.

[49] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[50] Gonzalo Pajares Martinsanz,et al. A wavelet-based image fusion tutorial , 2004 .

[51] N. Djilali,et al. In-fibre Bragg grating sensors for distributed temperature measurement in a polymer electrolyte membrane fuel cell , 2009 .

[52] Pieter Abbeel,et al. InfoGAN: Interpretable Representation Learning by Information Maximizing Generative Adversarial Nets , 2016, NIPS.

[53] Simon Osindero,et al. Conditional Generative Adversarial Nets , 2014, ArXiv.

[54] Jianping Fan,et al. Fusion method for infrared and visible images by using non-negative sparse representation , 2014 .

[55] Yuqing Gao,et al. Robust Fusion of Color and Depth Data for RGB-D Target Tracking Using Adaptive Range-Invariant Depth Models and Spatio-Temporal Consistency Constraints , 2018, IEEE Transactions on Cybernetics.

[56] Chuan Li,et al. Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks , 2016, ECCV.

[57] Shutao Li,et al. Group-Sparse Representation With Dictionary Learning for Medical Image Denoising and Fusion , 2012, IEEE Transactions on Biomedical Engineering.

[58] Namil Kim,et al. Pixel-Level Domain Transfer , 2016, ECCV.

[59] Paul S. Fisher,et al. Image quality measures and their performance , 1995, IEEE Trans. Commun..

[60] Myeong-Ryong Nam,et al. Fusion of multispectral and panchromatic Satellite images using the curvelet transform , 2005, IEEE Geoscience and Remote Sensing Letters.

[61] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[62] Junjun Jiang,et al. FusionGAN: A generative adversarial network for infrared and visible image fusion , 2019, Inf. Fusion.

[63] Thomas Brox,et al. Generating Images with Perceptual Similarity Metrics based on Deep Networks , 2016, NIPS.

[64] Yu Liu,et al. A general framework for image fusion based on multi-scale transform and sparse representation , 2015, Inf. Fusion.

[65] Cedric Nishan Canagarajah,et al. Pixel- and region-based image fusion with complex wavelets , 2007, Inf. Fusion.

[66] R. Venkatesh Babu,et al. DeepFuse: A Deep Unsupervised Approach for Exposure Fusion with Extreme Exposure Image Pairs , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).