Few-Shot Learning for Remote Sensing Image Retrieval With MAML

Few-shot remote sensing image retrieval is devoted to add new retrieval categories with a small number of labeled samples, and simultaneously achieve favorable retrieval performance for new categories and keep the primary retrieval performance for the original categories as far as possible. Few-shot learning has received considerable attention, however its applications in image retrieval, especially for remote sensing image retrieval, are still very few. In this paper, we redefine the few-shot image retrieval problem formally and further propose a few-shot retrieval method under model-agnostic meta-learning (MAML) framework, combined with ResNet and GeM as feature extraction module. Moreover, the optimal mean average precision (mAP) is used as ranking loss for defining the loss function of learning model, and a histogram binning approximation of mAP, which is differential, is thus employed so that the whole few-shot retrieval model can be end-to-end trained. The retrieval effectiveness and efficiency of our method are verified on UC-Merced and AID data sets.

[1]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[2]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[3]  Mihai Datcu,et al.  Exploiting Deep Features for Remote Sensing Image Retrieval: A Systematic Investigation , 2017, IEEE Transactions on Big Data.

[4]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Hugo Larochelle,et al.  Optimization as a Model for Few-Shot Learning , 2016, ICLR.

[6]  Shawn D. Newsam,et al.  Bag-of-visual-words and spatial extensions for land-use classification , 2010, GIS '10.

[7]  Hervé Jégou,et al.  Negative Evidences and Co-occurences in Image Retrieval: The Benefit of PCA and Whitening , 2012, ECCV.

[8]  Sergey Levine,et al.  Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks , 2017, ICML.

[9]  Florent Perronnin,et al.  Large-scale image retrieval with compressed Fisher vectors , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[10]  Gui-Song Xia,et al.  AID: A Benchmark Data Set for Performance Evaluation of Aerial Scene Classification , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[11]  James T. Kwok,et al.  Generalizing from a Few Examples , 2019, ACM Comput. Surv..

[12]  David Stutz,et al.  Neural Codes for Image Retrieval , 2015 .

[13]  Tie-Yan Liu,et al.  Learning to rank for information retrieval , 2009, SIGIR.

[14]  Albert Gordo,et al.  Deep Image Retrieval: Learning Global Representations for Image Search , 2016, ECCV.

[15]  Raquel Urtasun,et al.  Few-Shot Learning Through an Information Retrieval Lens , 2017, NIPS.

[16]  Cordelia Schmid,et al.  Improving Bag-of-Features for Large Scale Image Search , 2010, International Journal of Computer Vision.

[17]  Jon Almazán,et al.  Learning With Average Precision: Training Image Retrieval With a Listwise Loss , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[18]  Koichi Shinoda,et al.  Few-Shot Adaptation for Multimedia Semantic Indexing , 2018, ACM Multimedia.