Domain-Adapted Convolutional Networks for Satellite Image Classification: A Large-Scale Interactive Learning Workflow

Satellite imagery often exhibits large spatial extent areas that encompass object classes with considerable variability. This often limits large-scale model generalization with machine learning algorithms. Notably, acquisition conditions, including dates, sensor position, lighting condition, and sensor types, often translate into class distribution shifts introducing complex nonlinear factors and hamper the potential impact of machine learning classifiers. This paper investigates the challenge of exploiting satellite images using convolutional neural networks (CNN) for settlement classification where the class distribution shifts are significant. We present a large-scale human settlement mapping workflow based-off multiple modules to adapt a pretrained CNN to address the negative impact of distribution shift on classification performance. To extend a locally trained classifier onto large spatial extents areas we introduce several submodules: First, a human-in-the-loop element for relabeling of misclassified target domain samples to generate representative examples for model adaptation; second, an efficient hashing module to minimize redundancy and noisy samples from the mass-selected examples; and third, a novel relevance ranking module to minimize the dominance of source example on the target domain. The workflow presents a novel and practical approach to achieve large-scale domain adaptation with binary classifiers that are based-off CNN features. Experimental evaluations are conducted on areas of interest that encompass various image characteristics, including multisensors, multitemporal, and multiangular conditions. Domain adaptation is assessed on source–target pairs through the transfer loss and transfer ratio metrics to illustrate the utility of the workflow.

[1]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[2]  David Bull,et al.  A supervised hierarchical segmentation of remote-sensing images using a committee of multi-scale convolutional neural networks , 2016 .

[3]  Shiming Xiang,et al.  Efficient Multiple Feature Fusion With Hashing for Hyperspectral Imagery Classification: A Comparative Study , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[4]  Matthew D. Zeiler ADADELTA: An Adaptive Learning Rate Method , 2012, ArXiv.

[5]  B. Mihai,et al.  Mapping Land Cover Using Remote Sensing Data and GIS Techniques: A Case Study of Prahova Subcarpathians , 2016 .

[6]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7]  Mikhail F. Kanevski,et al.  SVM-Based Boosting of Active Learning Strategies for Efficient Domain Adaptation , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[8]  Yoshua Bengio,et al.  Domain Adaptation for Large-Scale Sentiment Classification: A Deep Learning Approach , 2011, ICML.

[9]  Pierre Alliez,et al.  Convolutional Neural Networks for Large-Scale Remote-Sensing Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[10]  G. Camps-Valls,et al.  Spectral alignment of multi-temporal cross-sensor images with automated kernel canonical correlation analysis , 2015 .

[11]  Anil M. Cheriyadat,et al.  Image Based Characterization of Formal and Informal Neighborhoods in an Urban Landscape , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[12]  Daniel Marcu,et al.  Domain Adaptation for Statistical Classifiers , 2006, J. Artif. Intell. Res..

[13]  Geoffrey E. Hinton,et al.  Stochastic Neighbor Embedding , 2002, NIPS.

[14]  Huanxin Zou,et al.  Transfer Sparse Subspace Analysis for Unsupervised Cross-View Scene Model Adaptation , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[15]  Lorenzo Bruzzone,et al.  Hashing-Based Scalable Remote Sensing Image Search and Retrieval in Large Archives , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[16]  Shiyong Cui,et al.  Pattern Retrieval in Large Image Databases Using Multiscale Coarse-to-Fine Cascaded Active Learning , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[17]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[18]  Zoubin Ghahramani,et al.  Deep Bayesian Active Learning with Image Data , 2017, ICML.

[19]  Melba M. Crawford,et al.  Adaptive Classification for Hyperspectral Image Data Using Manifold Regularization Kernel Machines , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[20]  Gustavo Camps-Valls,et al.  Semisupervised Manifold Alignment of Multimodal Remote Sensing Images , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[21]  Lorenzo Bruzzone,et al.  Unsupervised retraining of a maximum-likelihood classifier for the analysis of multitemporal remote sensing images , 1999, Remote Sensing.

[22]  Geoffrey E. Hinton,et al.  Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[23]  Anil M. Cheriyadat,et al.  Accelerating satellite image based large-scale settlement detection with GPU , 2012, BigSpatial '12.

[24]  Yoshua Bengio,et al.  Practical Recommendations for Gradient-Based Training of Deep Architectures , 2012, Neural Networks: Tricks of the Trade.

[25]  Melba M. Crawford,et al.  Domain Adaptation With Preservation of Manifold Geometry for Hyperspectral Image Classification , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[26]  Trevor Darrell,et al.  Long-term recurrent convolutional networks for visual recognition and description , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Fabio Pacifici,et al.  Understanding angular effects in VHR imagery and their significance for urban land-cover model portability: A study of two multi-angle in-track image sequences , 2015 .

[28]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[29]  R. R. Vatsavai,et al.  Complex settlement pattern extraction with multi-instance learning , 2013, Joint Urban Remote Sensing Event 2013.

[30]  Bernhard Schölkopf,et al.  Cost-Sensitive Active Learning With Lookahead: Optimizing Field Surveys for Remote Sensing Data Classification , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[31]  Ivan Laptev,et al.  Learning and Transferring Mid-level Image Representations Using Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[32]  Geoffrey E. Hinton A Practical Guide to Training Restricted Boltzmann Machines , 2012, Neural Networks: Tricks of the Trade.

[33]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[34]  Y. Nesterov A method for unconstrained convex minimization problem with the rate of convergence o(1/k^2) , 1983 .

[35]  Kristen Grauman,et al.  Kernelized locality-sensitive hashing for scalable image search , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[36]  Yoram Singer,et al.  Adaptive Subgradient Methods for Online Learning and Stochastic Optimization , 2011, J. Mach. Learn. Res..

[37]  Lorenzo Bruzzone,et al.  Definition of Effective Training Sets for Supervised Classification of Remote Sensing Images by a Novel Cost-Sensitive Active Learning Method , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[38]  Fan Zhang,et al.  Deep Convolutional Neural Networks for Hyperspectral Image Classification , 2015, J. Sensors.

[39]  Jamie Sherrah,et al.  Fully Convolutional Networks for Dense Semantic Labelling of High-Resolution Aerial Imagery , 2016, ArXiv.

[40]  Lorenzo Bruzzone,et al.  Kernel-Based Domain-Invariant Feature Selection in Hyperspectral Images for Transfer Learning , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Sungroh Yoon,et al.  Near-Data Processing for Machine Learning , 2016, ArXiv.

[42]  Melba M. Crawford,et al.  Manifold learning based feature extraction for classification of hyper-spectral data , 2013 .

[43]  Lorenzo Bruzzone,et al.  A Novel Approach to the Selection of Spatially Invariant Features for the Classification of Hyperspectral Images With Improved Generalization Capability , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[44]  Jamie Sherrah,et al.  Semantic Labeling of Aerial and Satellite Imagery , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[45]  Francesca Bovolo,et al.  A Novel Domain Adaptation Bayesian Classifier for Updating Land-Cover Maps With Class Differences in Source and Target Domains , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[46]  Gabriele Moser,et al.  Multimodal Classification of Remote Sensing Images: A Review and Future Directions , 2015, Proceedings of the IEEE.

[47]  Melba M. Crawford,et al.  Active Learning: Any Value for Classification of Remotely Sensed Data? , 2013, Proceedings of the IEEE.

[48]  Ishwar K. Sethi,et al.  Confidence-based active learning , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Sebastian Ruder,et al.  An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[50]  Melba M. Crawford,et al.  Manifold-Learning-Based Feature Extraction for Classification of Hyperspectral Data: A Review of Advances in Manifold Learning , 2014, IEEE Signal Processing Magazine.

[51]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[52]  Luis Gómez-Chova,et al.  Graph Matching for Adaptation in Remote Sensing , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[53]  Koen E. A. van de Sande,et al.  Empowering Visual Categorization With the GPU , 2011, IEEE Transactions on Multimedia.

[54]  Naif Alajlan,et al.  Large-Scale Image Classification Using Active Learning , 2014, IEEE Geoscience and Remote Sensing Letters.