Registration of Multimodal Remote Sensing Image Based on Deep Fully Convolutional Neural Network

Multimodal image registration is the fundamental technique for scene analysis with series remote sensing images of different spectrum region. Due to the highly nonlinear radiometric relationship, it is quite challenging to find common features between images of different modal types. This paper resorts to the deep neural network, and tries to learn descriptors for multimodal image patch matching, which is the key issue of image registration. A Siamese fully convolutional network is set up and trained with a novel loss function, which adopts the strategy of maximizing the feature distance between positive and hard negative samples. The two branches of the Siamese network are connected by the convolutional operation, resulting in the similarity score between the two input image patches. The similarity score value is used, not only for correspondence point location, but also for outlier identification. A generalized workflow for deep feature based multimodal RS image registration is constructed, including the training data curation, candidate feature point generation, and outlier removal. The proposed network is tested on a variety of optical, near infrared, thermal infrared, SAR, and map images. Experiment results verify the superiority over other state-of-the-art approaches.

[1]  Bin Li,et al.  RSCJ: Robust Sample Consensus Judging Algorithm for Remote Sensing Image Registration , 2012, IEEE Geoscience and Remote Sensing Letters.

[2]  Vincent Lepetit,et al.  LIFT: Learned Invariant Feature Transform , 2016, ECCV.

[3]  Andrea Vedaldi,et al.  Learning Covariant Feature Detectors , 2016, ECCV Workshops.

[4]  Peter Reinartz,et al.  Exploring the Potential of Conditional Adversarial Networks for Optical and SAR Image Matching , 2018, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[5]  Marc Niethammer,et al.  Quicksilver: Fast predictive image registration – A deep learning approach , 2017, NeuroImage.

[6]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.

[7]  Weidong Min,et al.  Remote Sensing Image Registration Using Convolutional Neural Network Features , 2018, IEEE Geoscience and Remote Sensing Letters.

[8]  Shuang Wang,et al.  A deep learning framework for remote sensing image registration , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[9]  Min Chen,et al.  Matching of Remote Sensing Images with Complex Background Variations via Siamese Convolutional Neural Network , 2018, Remote. Sens..

[10]  Adrien Bartoli,et al.  Fast Explicit Diffusion for Accelerated Features in Nonlinear Scale Spaces , 2013, BMVC.

[11]  Dengrong Zhang,et al.  A fast and fully automatic registration approach based on point features for multi-source remote-sensing images , 2008, Comput. Geosci..

[12]  Torsten Sattler,et al.  Comparative Evaluation of Hand-Crafted and Learned Local Features , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[13]  Ian Dowman,et al.  An improved model for automatic feature-based registration of SAR and SPOT images , 2001 .

[14]  Yongil Kim,et al.  Parameter Optimization for the Extraction of Matching Points Between High-Resolution Multisensor Images in Urban Areas , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[15]  Xiao Xiang Zhu,et al.  Identifying Corresponding Patches in SAR and Optical Images With a Pseudo-Siamese CNN , 2018, IEEE Geoscience and Remote Sensing Letters.

[16]  Iasonas Kokkinos,et al.  Discriminative Learning of Deep Convolutional Feature Point Descriptors , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[17]  Yosi Keller,et al.  Multimodal matching using a Hybrid Convolutional Neural Network , 2018, ArXiv.

[18]  Weiping Ni,et al.  Visual tracking using Siamese convolutional neural network with region proposal and domain specific updating , 2018, Neurocomputing.

[19]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Victor J. D. Tsai,et al.  Delaunay Triangulations in TIN Creation: An Overview and a Linear-Time Algorithm , 1993, Int. J. Geogr. Inf. Sci..

[21]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[22]  Jiri Matas,et al.  Working hard to know your neighbor's margins: Local descriptor learning loss , 2017, NIPS.

[23]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[24]  Xiangyu Zhang,et al.  Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  Rahul Sukthankar,et al.  MatchNet: Unifying feature and metric learning for patch-based matching , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[27]  Stefano Soatto,et al.  Domain-size pooling in local descriptors: DSP-SIFT , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Thomas Brox,et al.  Descriptor Matching with Convolutional Neural Networks: a Comparison to SIFT , 2014, ArXiv.

[29]  B. S. Manjunath,et al.  A contour-based approach to multisensor image registration , 1995, IEEE Trans. Image Process..

[30]  Cordelia Schmid,et al.  Evaluation of Interest Point Detectors , 2000, International Journal of Computer Vision.

[31]  Michael Ying Yang,et al.  Change Detection between Multimodal Remote Sensing Data Using Siamese CNN , 2018, ArXiv.

[32]  Shenghuo Zhu,et al.  Deep Learning of Invariant Features via Simulated Fixations in Video , 2012, NIPS.

[33]  Thomas Brox,et al.  Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[35]  Guillaume Charpiat,et al.  Aligning and Updating Cadaster Maps with Aerial Images by Multi-task, Multi-resolution Deep Learning , 2018, ACCV.

[36]  R. Schowengerdt,et al.  A robust technique for precise registration of radar and optical satellite images , 2005 .

[37]  Julie Delon,et al.  SAR-SIFT: A SIFT-Like Algorithm for SAR Images , 2015, IEEE Trans. Geosci. Remote. Sens..

[38]  Bin Fan,et al.  L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[39]  Songde Ma,et al.  Multisource data registration based on NURBS description of contours , 2008 .

[40]  Jan Flusser,et al.  Image registration methods: a survey , 2003, Image Vis. Comput..

[41]  Lorenzo Bruzzone,et al.  Robust Registration of Multimodal Remote Sensing Images Based on Structural Similarity , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[42]  Gregory Shakhnarovich,et al.  Deep Back-Projection Networks for Super-Resolution , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[43]  Gustavo Carneiro,et al.  Learning Local Image Descriptors with Deep Siamese and Triplet Convolutional Networks by Minimizing Global Loss Functions , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[44]  Luca Bertinetto,et al.  Fully-Convolutional Siamese Networks for Object Tracking , 2016, ECCV Workshops.

[45]  Xiao Xiang Zhu,et al.  Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources , 2017, IEEE Geoscience and Remote Sensing Magazine.

[46]  Yan Ke,et al.  PCA-SIFT: a more distinctive representation for local image descriptors , 2004, CVPR 2004.

[47]  Matthew A. Brown,et al.  Automatic Panoramic Image Stitching using Invariant Features , 2007, International Journal of Computer Vision.

[48]  Yan Li,et al.  Estimation of Mutual Information: A Survey , 2009, RSKT.

[49]  Yoshua Bengio,et al.  Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[50]  Viorica Patraucean,et al.  gvnn: Neural Network Library for Geometric Computer Vision , 2016, ECCV Workshops.

[51]  Raquel Urtasun,et al.  Exploiting Deep Matching and SAR Data for the Geo-Localization Accuracy Improvement of Optical Satellite Images , 2017, Remote. Sens..

[52]  Thomas Brox,et al.  FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[53]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.