A deep learning framework for matching of SAR and optical imagery

Abstract SAR and optical imagery provide highly complementary information about observed scenes. A combined use of these two modalities is thus desirable in many data fusion scenarios. However, any data fusion task requires measurements to be accurately aligned. While for both data sources images are usually provided in a georeferenced manner, the geo-localization of optical images is often inaccurate due to propagation of angular measurement errors. Many methods for the matching of homologous image regions exist for both SAR and optical imagery, however, these methods are unsuitable for SAR-optical image matching due to significant geometric and radiometric differences between the two modalities. In this paper, we present a three-step framework for sparse image matching of SAR and optical imagery, whereby each step is encoded by a deep neural network. We first predict regions in each image which are deemed most suitable for matching. A correspondence heatmap is then generated through a multi-scale, feature-space cross-correlation operator. Finally, outliers are removed by classifying the correspondence surface as a positive or negative match. Our experiments show that the proposed approach provides a substantial improvement over previous methods for SAR-optical image matching and can be used to register even large-scale scenes. This opens up the possibility of using both types of data jointly, for example for the improvement of the geo-localization of optical satellite imagery or multi-sensor stereogrammetry.

[1]  Nikos Komodakis,et al.  Learning to compare image patches via convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Yue Wu,et al.  A Novel Two-Step Registration Method for Remote Sensing Images Based on Deep and Local Features , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[3]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[4]  Peter Reinartz,et al.  Mutual-Information-Based Registration of TerraSAR-X and Ikonos Imagery in Urban Areas , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[6]  Maoguo Gong,et al.  A Novel Coarse-to-Fine Scheme for Automatic Image Registration Based on SIFT and Mutual Information , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[7]  Michael Schmitt,et al.  Matching of TerraSAR-X derived ground control points to optical image patches using deep learning , 2019 .

[8]  Jaakko Lehtinen,et al.  Progressive Growing of GANs for Improved Quality, Stability, and Variation , 2017, ICLR.

[9]  Yuming Xiang,et al.  OS-SIFT: A Robust SIFT-Like Algorithm for High-Resolution Optical-to-SAR Image Registration in Suburban Areas , 2018, IEEE Transactions on Geoscience and Remote Sensing.

[10]  Xiao Xiang Zhu,et al.  A framework for SAR-optical stereogrammetry over urban areas , 2018, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[11]  Jiri Matas,et al.  Working hard to know your neighbor's margins: Local descriptor learning loss , 2017, NIPS.

[12]  Peter Reinartz,et al.  Urban Atlas – DLR Processing Chain for Orthorectification of Prism and AVNIR-2 Images and TerraSAR-X as possible GCP Source , 2010 .

[13]  Vincent Lepetit,et al.  LIFT: Learned Invariant Feature Transform , 2016, ECCV.

[14]  Sandhya Banda,et al.  An overview of deep learning methods for image registration with focus on feature-based approaches , 2020, International Journal of Image and Data Fusion.

[15]  Xiao Xiang Zhu,et al.  A CNN for the identification of corresponding patches in SAR and optical imagery of urban scenes , 2017, 2017 Joint Urban Remote Sensing Event (JURSE).

[16]  Peter Reinartz,et al.  Modifications in the SIFT operator for effective SAR image matching , 2010 .

[17]  Qingwu Hu,et al.  RIFT: Multi-Modal Image Matching Based on Radiation-Variation Insensitive Feature Transform , 2019, IEEE Transactions on Image Processing.

[18]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[19]  Michael Schmitt,et al.  Deep Learning for SAR-Optical Image Matching , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.

[20]  Torsten Sattler,et al.  D2-Net: A Trainable CNN for Joint Description and Detection of Local Features , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Jitendra Malik,et al.  Hypercolumns for object segmentation and fine-grained localization , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[22]  Leslie N. Smith,et al.  Cyclical Learning Rates for Training Neural Networks , 2015, 2017 IEEE Winter Conference on Applications of Computer Vision (WACV).

[23]  Thomas Brox,et al.  Descriptor Matching with Convolutional Neural Networks: a Comparison to SIFT , 2014, ArXiv.

[24]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[25]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[26]  Rahul Sukthankar,et al.  MatchNet: Unifying feature and metric learning for patch-based matching , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Alexey Shvets,et al.  TernausNet: U-Net with VGG11 Encoder Pre-Trained on ImageNet for Image Segmentation , 2018, Computer-Aided Analysis of Gastrointestinal Videos.

[28]  Xiao Xiang Zhu,et al.  Identifying Corresponding Patches in SAR and Optical Images With a Pseudo-Siamese CNN , 2018, IEEE Geoscience and Remote Sensing Letters.

[29]  Lloyd H. Hughes,et al.  A SEMI-SUPERVISED APPROACH TO SAR-OPTICAL IMAGE MATCHING , 2019, ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[30]  Krystian Mikolajczyk,et al.  PN-Net: Conjoined Triple Deep Network for Learning Local Image Descriptors , 2016, ArXiv.

[31]  Y. Ye,et al.  HOPC: A NOVEL SIMILARITY METRIC BASED ON GEOMETRIC STRUCTURAL PROPERTIES FOR MULTI-MODAL REMOTE SENSING IMAGE MATCHING , 2016 .

[32]  Alexandre X. Falcão,et al.  Correcting rural building annotations in OpenStreetMap using convolutional neural networks , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[33]  Joachim Denzler,et al.  Registration of High Resolution Sar and Optical Satellite Imagery Using Fully Convolutional Networks , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.

[34]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[35]  Shuang Wang,et al.  A deep learning framework for remote sensing image registration , 2018, ISPRS Journal of Photogrammetry and Remote Sensing.

[36]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[37]  Xiao Xiang Zhu,et al.  Fusion of SAR and optical remote sensing data — Challenges and recent trends , 2017, 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[38]  Raquel Urtasun,et al.  Exploiting Deep Matching and SAR Data for the Geo-Localization Accuracy Improvement of Optical Satellite Images , 2017, Remote. Sens..

[39]  Tali Dekel,et al.  SinGAN: Learning a Generative Model From a Single Natural Image , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[40]  Gokhan Bilgin,et al.  Visual Saliency Aided SAR and Optical Image Matching , 2019, 2019 Innovations in Intelligent Systems and Applications Conference (ASYU).

[41]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[42]  Krystian Mikolajczyk,et al.  Learning local feature descriptors with triplets and shallow convolutional neural networks , 2016, BMVC.

[43]  Xiao Xiang Zhu,et al.  Towards automatic SAR-optical stereogrammetry over urban areas using very high resolution imagery , 2018, ISPRS journal of photogrammetry and remote sensing : official publication of the International Society for Photogrammetry and Remote Sensing.

[44]  Iasonas Kokkinos,et al.  Discriminative Learning of Deep Convolutional Feature Point Descriptors , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[45]  P. Reinartz,et al.  Automated Georeferencing of Optical Satellite Data with Integrated Sensor Model Improvement , 2012 .

[46]  Julie Delon,et al.  SAR-SIFT: A SIFT-Like Algorithm for SAR Images , 2015, IEEE Trans. Geosci. Remote. Sens..

[47]  Maoguo Gong,et al.  Remote Sensing Image Registration With Modified SIFT and Enhanced Feature Matching , 2017, IEEE Geoscience and Remote Sensing Letters.

[48]  Gabriela Csurka,et al.  R2D2: Repeatable and Reliable Detector and Descriptor , 2019, ArXiv.