论文信息 - Boosting Structure Consistency for Multispectral and Multimodal Image Registration

Boosting Structure Consistency for Multispectral and Multimodal Image Registration

Multispectral imaging plays a vital role in the area of computer vision and computational photography. As spectral band images can be misaligned due to imaging device movement or alternation, image registration is necessary to avoid spectral information distortion. The current registration measures specialized for multispectral data are typically robust yet complex, requiring excessive computation. The common measures such as sum of squared differences (SSD) and sum of absolute differences (SAD) are computationally efficient whereas they perform poorly on multispectral data. To cope with this challenge, we propose a structure consistency boosting (SCB) transform that aims at boosting the structural similarity of multispectral images. With SCB, the common measures can be employed for multispectral image registration. The SCB transform exploits the fact that inherent edge structures maintain relative saliency locally despite the nonlinear variation between band images. A statistical prior of the natural image, which is based on the gradient-intensity correlation, is explored to build a parametric form of SCB. Experimental results validate that the SCB transform outperforms current similarity enhancement algorithms, and performs better than the state-of-the-art multispectral registration measures. Thanks to the generality of the statistical prior, the SCB transform is also applicable to various multimodal data such as flash/no-flash images and medical images.

[1] David J. C. MacKay,et al. Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[2] Rynson W. H. Lau,et al. Saliency Detection with Flash and No-flash Image Pairs , 2014, ECCV.

[3] Michael F. Cohen,et al. Digital photography with flash and no-flash image pairs , 2004, ACM Trans. Graph..

[4] Shree K. Nayar,et al. Generalized Assorted Pixel Camera: Postcapture Control of Resolution, Dynamic Range, and Spectrum , 2010, IEEE Transactions on Image Processing.

[5] Daniel Rueckert,et al. Nonrigid registration using free-form deformations: application to breast MR images , 1999, IEEE Transactions on Medical Imaging.

[6] Assaf Zomet,et al. Learning how to inpaint from global image statistics , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[7] Yacov Hel-Or,et al. Matching by Tone Mapping: Photometric Invariant Template Matching , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8] Jitendra Malik,et al. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[9] Kihong Park,et al. Learning to Find Unpaired Cross-Spectral Correspondences , 2019, IEEE Transactions on Image Processing.

[10] Ayan Chakrabarti,et al. Statistics of real-world hyperspectral images , 2011, CVPR 2011.

[11] Antonio Torralba,et al. SIFT Flow: Dense Correspondence across Different Scenes , 2008, ECCV.

[12] Ramin Zabih,et al. Non-parametric Local Transforms for Computing Visual Correspondence , 1994, ECCV.

[13] Mingyue Ding,et al. Two Phase Non-Rigid Multi-Modal Image Registration Using Weber Local Descriptor-Based Similarity Metrics and Normalized Mutual Information , 2013, Sensors.

[14] Erik Reinhard,et al. High Dynamic Range Imaging: Acquisition, Display, and Image-Based Lighting , 2010 .

[15] Joachim Weickert,et al. Lucas/Kanade Meets Horn/Schunck: Combining Local and Global Optic Flow Methods , 2005, International Journal of Computer Vision.

[16] Sabine Süsstrunk,et al. Multi-spectral SIFT for scene category recognition , 2011, CVPR 2011.

[17] Guy Marchal,et al. Multimodality image registration by maximization of mutual information , 1997, IEEE Transactions on Medical Imaging.

[18] Nicholas Ayache,et al. The Correlation Ratio as a New Similarity Measure for Multimodal Image Registration , 1998, MICCAI.

[19] Hui-Liang Shen,et al. Normalized Total Gradient: A New Measure for Multispectral Image Registration , 2017, IEEE Transactions on Image Processing.

[20] R. Storn,et al. Differential Evolution: A Practical Approach to Global Optimization (Natural Computing Series) , 2005 .

[21] Seungryong Kim,et al. LAT: Local area transform for cross modal correspondence matching , 2017, Pattern Recognit..

[22] Rob Fergus,et al. Fast Image Deconvolution using Hyper-Laplacian Priors , 2009, NIPS.

[23] Jan Flusser,et al. Image registration methods: a survey , 2003, Image Vis. Comput..

[24] Martin J. Wainwright,et al. Image denoising using scale mixtures of Gaussians in the wavelet domain , 2003, IEEE Trans. Image Process..

[25] Qi Zhang,et al. Multi-modal and Multi-spectral Registration for Natural Images , 2014, ECCV.

[26] S. Nadarajah. A generalized normal distribution , 2005 .

[27] Jon Atli Benediktsson,et al. Exploiting spectral and spatial information in hyperspectral urban data with high resolution , 2004, IEEE Geoscience and Remote Sensing Letters.

[28] Til Aach,et al. Multispectral filter wheel cameras: modeling aberrations for filters in front of lens , 2011, Electronic Imaging.

[29] Gaofeng Meng,et al. Spectral Unmixing via Data-Guided Sparsity , 2014, IEEE Transactions on Image Processing.

[30] Takayuki Hamamoto,et al. RGB-NIR imaging with exposure bracketing for joint denoising and deblurring of low-light color images , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[31] Nassir Navab,et al. Structural image representation for image registration , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Workshops.

[32] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[33] Takayuki Hamamoto,et al. Enhancing Color Images of Extremely Low Light Scenes Based on RGB/NIR Images Acquisition With Different Exposure Times , 2015, IEEE Transactions on Image Processing.

[34] Matti Pietikäinen,et al. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, TPAMI-2008-09-0620 1 WLD: A Robust Local Image Descriptor , 2022 .

[35] Sadegh Abbasi,et al. Shape similarity retrieval under affine transform: application to multi-view object representation and recognition , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[36] Minh N. Do,et al. DASC: Robust Dense Descriptor for Multi-Modal and Multi-Spectral Correspondence Estimation , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[37] Jürgen Weese,et al. A comparison of similarity measures for use in 2-D-3-D medical image registration , 1998, IEEE Transactions on Medical Imaging.

[38] Sundaresh Ram,et al. Removing Camera Shake from a Single Photograph , 2009 .

[39] Minh N. Do,et al. DASC: Dense adaptive self-correlation descriptor for multi-modal and multi-spectral correspondence , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40] Jon Atli Benediktsson,et al. Segmentation and classification of hyperspectral images using watershed transformation , 2010, Pattern Recognit..

[41] Alan C. Evans,et al. BrainWeb: Online Interface to a 3D MRI Simulated Brain Database , 1997 .