Low-Shot Learning for the Semantic Segmentation of Remote Sensing Imagery

Recent advances in computer vision using deep learning with RGB imagery (e.g., object recognition and detection) have been made possible thanks to the development of large annotated RGB image data sets. In contrast, multispectral image (MSI) and hyperspectral image (HSI) data sets contain far fewer labeled images, in part due to the wide variety of sensors used. These annotations are especially limited for semantic segmentation, or pixelwise classification, of remote sensing imagery because it is labor intensive to generate image annotations. Low-shot learning algorithms can make effective inferences despite smaller amounts of annotated data. In this paper, we study low-shot learning using self-taught feature learning for semantic segmentation. We introduce: 1) an improved self-taught feature learning framework for HSI and MSI data and 2) a semisupervised classification algorithm. When these are combined, they achieve the state-of-the-art performance on remote sensing data sets that have little annotated training data available. These low-shot learning frameworks will reduce the manual image annotation burden and improve semantic segmentation performance for remote sensing imagery.

[1]  Gang Wang,et al.  Spectral-spatial classification of hyperspectral image using autoencoders , 2013, 2013 9th International Conference on Information, Communications & Signal Processing.

[2]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[3]  Pascal Vincent,et al.  Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion , 2010, J. Mach. Learn. Res..

[4]  Ian D. Reid,et al.  RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Scott D. Brown,et al.  Advances in wide-area hyperspectral image simulation , 2003, SPIE Defense + Commercial Sensing.

[6]  Yoshua Bengio,et al.  Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[7]  Chunhong Pan,et al.  Learning Deep Dictionary for Hyperspectral Image Denoising , 2015, IEICE Trans. Inf. Syst..

[8]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9]  Zhengrong Zou,et al.  Hyperspectral Imagery Classification Based on Rotation-Invariant Spectral–Spatial Feature , 2014, IEEE Geoscience and Remote Sensing Letters.

[10]  Luc Van Gool,et al.  The Pascal Visual Object Classes (VOC) Challenge , 2010, International Journal of Computer Vision.

[11]  Johannes R. Sveinsson,et al.  Spectral and spatial classification of hyperspectral data using SVMs and morphological profiles , 2008, 2007 IEEE International Geoscience and Remote Sensing Symposium.

[12]  Geoffrey J. Hay,et al.  Geographic Object-Based Image Analysis (GEOBIA): A new name for a new discipline , 2008 .

[13]  Filiberto Pla,et al.  Spectral–Spatial Pixel Characterization Using Gabor Filters for Hyperspectral Image Classification , 2013, IEEE Geoscience and Remote Sensing Letters.

[14]  Camille Couprie,et al.  Learning Hierarchical Features for Scene Labeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[15]  Seunghoon Hong,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[16]  Jürgen Schmidhuber,et al.  Stacked Convolutional Auto-Encoders for Hierarchical Feature Extraction , 2011, ICANN.

[17]  M. Siegel,et al.  Hyperspectral classification via deep networks and superpixel segmentation , 2015 .

[18]  Yihong Gong,et al.  Linear spatial pyramid matching using sparse coding for image classification , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Honglak Lee,et al.  An Analysis of Single-Layer Networks in Unsupervised Feature Learning , 2011, AISTATS.

[20]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Shuyuan Yang,et al.  Sparse Spatio-Spectral LapSVM With Semisupervised Kernel Propagation for Hyperspectral Image Classification , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[22]  Garrison W. Cottrell,et al.  Robust classification of objects, faces, and flowers using natural image statistics , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[24]  Wei Xiong,et al.  Stacked Convolutional Denoising Auto-Encoders for Feature Representation , 2017, IEEE Transactions on Cybernetics.

[25]  Jie Geng,et al.  Hyperspectral image classification via contextual deep learning , 2015, EURASIP Journal on Image and Video Processing.

[26]  Sildomar T. Monteiro,et al.  Spectral angle based unary energy functions for spatial-spectral hyperspectral classification using Markov random fields , 2016, 2016 8th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS).

[27]  John R. Miller,et al.  Imaging chlorophyll fluorescence with an airborne narrow-band multispectral camera for vegetation stress detection , 2009 .

[28]  Bing Zhang,et al.  Independent component analysis for hyperspectral imagery plant classification , 2005, IS&T/SPIE Electronic Imaging.

[29]  Harry N. Gross,et al.  An Advanced Synthetic Image Generation Model and its Application to Multi/Hyperspectral Algorithm Development , 1999 .

[30]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[31]  Antonio J. Plaza,et al.  A New Hybrid Strategy Combining Semisupervised Classification and Unmixing of Hyperspectral Data , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[32]  Jing Wang,et al.  Independent component analysis-based dimensionality reduction with applications in hyperspectral image analysis , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[33]  Jitendra Malik,et al.  Simultaneous Detection and Segmentation , 2014, ECCV.

[34]  Brahim Chaib-draa,et al.  Parametric Exponential Linear Unit for Deep Convolutional Neural Networks , 2016, 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA).

[35]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  José M. Bioucas-Dias,et al.  Does independent component analysis play a role in unmixing hyperspectral data? , 2003, IEEE Transactions on Geoscience and Remote Sensing.

[37]  Geoffrey J. Hay,et al.  Geographic object-based image analysis (GEOBIA): emerging trends and future opportunities , 2018 .

[38]  Daniel L. K. Yamins,et al.  Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition , 2014, PLoS Comput. Biol..

[39]  Antonio J. Plaza,et al.  Dimensionality reduction and classification of hyperspectral image data using sequences of extended morphological transformations , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[40]  Mi Zhang,et al.  Learning Dual Multi-Scale Manifold Ranking for Semantic Segmentation of High-Resolution Images , 2017, Remote. Sens..

[41]  Markus Gerke,et al.  The ISPRS benchmark on urban object classification and 3D building reconstruction , 2012 .

[42]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[43]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[44]  Giorgos Mountrakis,et al.  A meta-analysis of remote sensing research on supervised pixel-based land-cover image classification processes: General guidelines for practitioners and future research , 2016 .

[45]  M. F. Baumgardner,et al.  220 Band AVIRIS Hyperspectral Image Data Set: June 12, 1992 Indian Pine Test Site 3 , 2015 .

[46]  Andrew L. Maas Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[47]  Michele Volpi,et al.  Semantic segmentation of urban scenes by learning local class interactions , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[48]  Zhou Guo,et al.  On combining multiscale deep learning features for the classification of hyperspectral remote sensing imagery , 2015 .

[49]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[50]  Timothy Dozat,et al.  Incorporating Nesterov Momentum into Adam , 2016 .

[51]  Luis Gómez-Chova,et al.  Semisupervised Image Classification With Laplacian Support Vector Machines , 2008, IEEE Geoscience and Remote Sensing Letters.

[52]  Maryam Imani,et al.  BOUNDARY BASED SUPERVISED CLASSIFICATION OF HYPERSPECTRAL IMAGES WITH LIMITED TRAINING SAMPLES , 2013 .

[53]  Xiaoqiang Lu,et al.  Remote Sensing Image Scene Classification: Benchmark and State of the Art , 2017, Proceedings of the IEEE.

[54]  Xi Zhang,et al.  Learning from Synthetic Data Using a Stacked Multichannel Autoencoder , 2015, 2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA).

[55]  Jon Atli Benediktsson,et al.  Advances in Spectral-Spatial Classification of Hyperspectral Images , 2013, Proceedings of the IEEE.

[56]  Ronan Collobert,et al.  Learning to Refine Object Segments , 2016, ECCV.

[57]  Yubin Lan,et al.  Multispectral imaging systems for airborne remote sensing to support agricultural production management , 2010 .

[58]  Gang Wang,et al.  Deep Learning-Based Classification of Hyperspectral Data , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[59]  Kate Saenko,et al.  Learning Deep Object Detectors from 3D Models , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[60]  Alain Rakotomamonjy,et al.  Automatic Feature Learning for Spatio-Spectral Image Classification With Sparse SVM , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[61]  T. Sejnowski,et al.  Cone selectivity derived from the responses of the retinal cone mosaic to natural scenes. , 2007, Journal of vision.

[62]  Ronan Collobert,et al.  Learning to Segment Object Candidates , 2015, NIPS.

[63]  Sebastian Thrun,et al.  Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[64]  Marc'Aurelio Ranzato,et al.  Semi-supervised learning of compact document representations with deep networks , 2008, ICML '08.

[65]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[66]  Bo Du,et al.  Deep Learning for Remote Sensing Data: A Technical Tutorial on the State of the Art , 2016, IEEE Geoscience and Remote Sensing Magazine.

[67]  Andriana Olmos,et al.  A biologically inspired algorithm for the recovery of shading and reflectance images , 2004 .

[68]  GhassemianHassan,et al.  Improving hyperspectral image classification by combining spectral, texture, and shape features , 2015 .

[69]  Rajat Raina,et al.  Self-taught learning: transfer learning from unlabeled data , 2007, ICML '07.

[70]  A. Izenman Reduced-rank regression for the multivariate linear model , 1975 .

[71]  Joydeep Ghosh,et al.  An Active Learning Approach to Hyperspectral Data Classification , 2008, IEEE Transactions on Geoscience and Remote Sensing.

[72]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[73]  Gabriele Moser,et al.  2017 IEEE GRSS Data Fusion Contest: Open Data for Global Multimodal Land Use Classification [Technical Committees] , 2017 .

[74]  Amit Jain,et al.  A multiscale representation including opponent color features for texture recognition , 1998, IEEE Trans. Image Process..

[75]  Garrison W. Cottrell,et al.  Looking around the backyard helps to recognize faces and digits , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[76]  Wei Liu,et al.  Exploring Representativeness and Informativeness for Active Learning , 2019, IEEE Transactions on Cybernetics.

[77]  Bing Liu,et al.  A semi-supervised convolutional neural network for hyperspectral image classification , 2017 .

[78]  Harri Valpola,et al.  From neural PCA to deep unsupervised learning , 2014, ArXiv.

[79]  Koby Crammer,et al.  A theory of learning from different domains , 2010, Machine Learning.

[80]  Ying Wang,et al.  Gated Convolutional Neural Network for Semantic Segmentation in High-Resolution Images , 2017, Remote. Sens..

[81]  William J. Emery,et al.  Object-Based Convolutional Neural Network for High-Resolution Imagery Classification , 2017, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[82]  Antonio J. Plaza,et al.  A Quantitative and Comparative Assessment of Unmixing-Based Feature Extraction Techniques for Hyperspectral Image Classification , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[83]  Lorenzo Bruzzone,et al.  Extended profiles with morphological attribute filters for the analysis of hyperspectral data , 2010 .

[84]  Rajat Raina,et al.  Efficient sparse coding algorithms , 2006, NIPS.

[85]  Ronald Kemker,et al.  Self-Taught Feature Learning for Hyperspectral Image Classification , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[86]  Mikhail F. Kanevski,et al.  A Survey of Active Learning Algorithms for Supervised Remote Sensing Image Classification , 2011, IEEE Journal of Selected Topics in Signal Processing.

[87]  Christopher Kanan Active Object Recognition with a Space-Variant Retina , 2013 .

[88]  Xueming Qian,et al.  Semantic Annotation of High-Resolution Satellite Images via Weakly Supervised Learning , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[89]  Jon Atli Benediktsson,et al.  A Study on the Effectiveness of Different Independent Component Analysis Algorithms for Hyperspectral Image Classification , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[90]  Rich Caruana,et al.  Multitask Learning , 1998, Encyclopedia of Machine Learning and Data Mining.

[91]  Dharmendra Singh,et al.  An assessment of independent component analysis for detection of military targets from hyperspectral images , 2011, Int. J. Appl. Earth Obs. Geoinformation.

[92]  Juha Karhunen,et al.  Advances in Independent Component Analysis and Learning Machines , 2015 .

[93]  Albert Rango,et al.  Multispectral Remote Sensing from Unmanned Aircraft: Image Processing Workflows and Applications for Rangeland Environments , 2011, Remote. Sens..

[94]  Jiang Li,et al.  Dimensionality reduction of hyperspectral data using discrete wavelet transform feature extraction , 2002, IEEE Trans. Geosci. Remote. Sens..

[95]  Sepp Hochreiter,et al.  Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) , 2015, ICLR.

[96]  Jason Weston,et al.  Semisupervised Neural Networks for Efficient Hyperspectral Image Classification , 2010, IEEE Transactions on Geoscience and Remote Sensing.

[97]  Rui Zhang,et al.  Semi-Supervised Hyperspectral Image Classification Using Spatio-Spectral Laplacian Support Vector Machine , 2014, IEEE Geoscience and Remote Sensing Letters.

[98]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[99]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[100]  Jon Atli Benediktsson,et al.  Automatic Framework for Spectral–Spatial Classification Based on Supervised Feature Extraction and Morphological Attribute Profiles , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[101]  J. D. Wegner,et al.  SEMANTIC SEGMENTATION OF AERIAL IMAGES WITH AN ENSEMBLE OF CNNS , 2016, ISPRS Annals of Photogrammetry, Remote Sensing and Spatial Information Sciences.

[102]  Hamid R. Rabiee,et al.  When Pixels Team up: Spatially Weighted Sparse Coding for Hyperspectral Image Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[103]  Sebastian Thrun,et al.  Is Learning The n-th Thing Any Easier Than Learning The First? , 1995, NIPS.

[104]  Pablo J. Zarco-Tejada,et al.  Thermal and Narrowband Multispectral Remote Sensing for Vegetation Monitoring From an Unmanned Aerial Vehicle , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[105]  Garrison W. Cottrell,et al.  Modeling the Object Recognition Pathway: A Deep Hierarchical Model Using Gnostic Fields , 2015, CogSci.

[106]  Yang Lu,et al.  Hyperspectral Image Classification Based on Three-Dimensional Scattering Wavelet Transform , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[107]  Vladlen Koltun,et al.  Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials , 2011, NIPS.

[108]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[109]  Yansheng Li,et al.  Unsupervised Spectral–Spatial Feature Learning With Stacked Sparse Autoencoder for Hyperspectral Imagery Classification , 2015, IEEE Geoscience and Remote Sensing Letters.

[110]  H. Ghassemian,et al.  Improving hyperspectral image classification by combining spectral, texture, and shape features , 2015 .

[111]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[112]  Terrence J. Sejnowski,et al.  The “independent components” of natural scenes are edge filters , 1997, Vision Research.

[113]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[114]  Ping Zhong,et al.  A Multiple Conditional Random Fields Ensemble Model for Urban Area Detection in Remote Sensing Optical Images , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[115]  Jiasong Zhu,et al.  Discriminative Gabor Feature Selection for Hyperspectral Image Classification , 2013, IEEE Geoscience and Remote Sensing Letters.

[116]  David J Tolhurst,et al.  Independent components of color natural scenes resemble V1 neurons in their spatial and color tuning. , 2004, Journal of neurophysiology.

[117]  Gustavo Camps-Valls,et al.  Semi-Supervised Graph-Based Hyperspectral Image Classification , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[118]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[119]  T. W. Anderson Estimating Linear Restrictions on Regression Coefficients for Multivariate Normal Distributions , 1951 .

[120]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[121]  Tapani Raiko,et al.  Semi-supervised Learning with Ladder Networks , 2015, NIPS.

[122]  Lorenzo Bruzzone,et al.  A Novel Transductive SVM for Semisupervised Classification of Remote-Sensing Images , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[123]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[124]  Marc'Aurelio Ranzato,et al.  Efficient Learning of Sparse Representations with an Energy-Based Model , 2006, NIPS.

[125]  Tong Zhang,et al.  A Framework for Learning Predictive Structures from Multiple Tasks and Unlabeled Data , 2005, J. Mach. Learn. Res..

[126]  David W. Aha,et al.  Unsupervised and transfer learning challenge , 2011, The 2011 International Joint Conference on Neural Networks.

[127]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[128]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[129]  Junwei Han,et al.  Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images , 2016, IEEE Transactions on Geoscience and Remote Sensing.

[130]  Wojciech Zaremba,et al.  Improved Techniques for Training GANs , 2016, NIPS.

[131]  Nikolaus Kriegeskorte,et al.  Frontiers in Systems Neuroscience Systems Neuroscience , 2022 .

[132]  LinLin Shen,et al.  Three-Dimensional Gabor Wavelets for Pixel-Based Hyperspectral Imagery Classification , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[133]  Lorenzo Bruzzone,et al.  Transductive SVMs for semisupervised classification of hyperspectral data , 2005, Proceedings. 2005 IEEE International Geoscience and Remote Sensing Symposium, 2005. IGARSS '05..

[134]  Zhenhua Wang,et al.  Synthesizing Training Images for Boosting Human 3D Pose Estimation , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[135]  Antonio M. López,et al.  The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).