Satellite Image Time Series Classification With Pixel-Set Encoders and Temporal Self-Attention

Satellite image time series, bolstered by their growing availability, are at the forefront of an extensive effort towards automated Earth monitoring by international institutions. In particular, large-scale control of agricultural parcels is an issue of major political and economic importance. In this regard, hybrid convolutional-recurrent neural architectures have shown promising results for the automated classification of satellite image time series. We propose an alternative approach in which the convolutional layers are advantageously replaced with encoders operating on unordered sets of pixels to exploit the typically coarse resolution of publicly available satellite images. We also propose to extract temporal features using a bespoke neural architecture based on self-attention instead of recurrent networks. We demonstrate experimentally that our method not only outperforms previous state-of-the-art approaches in terms of precision, but also significantly decreases processing time and memory requirements. Lastly, we release a large open-access annotated dataset as a benchmark for future work on satellite image time series.

[1]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[2]  Yun Shi,et al.  3D Convolutional Neural Networks for Crop Classification with Multi-Temporal Remote Sensing Images , 2018, Remote. Sens..

[3]  Geoffrey I. Webb,et al.  Temporal Convolutional Neural Network for the Classification of Satellite Image Time Series , 2018, Remote. Sens..

[4]  C. Qi Deep Learning on Point Sets for 3 D Classification and Segmentation , 2016 .

[5]  Maria Vakalopoulou,et al.  Detecting Urban Changes with Recurrent Neural Networks from Multitemporal Sentinel-2 Data , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.

[6]  Jun-Sik Kim,et al.  Pixel-Level Matching for Video Object Segmentation Using Convolutional Neural Networks , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[7]  Clement Atzberger,et al.  How much does multi-temporal Sentinel-2 data improve crop type classification? , 2018, Int. J. Appl. Earth Obs. Geoinformation.

[8]  R. Reulke,et al.  Remote Sensing and Spatial Information Sciences , 2005 .

[9]  Marc Rußwurm,et al.  Self-Attention for Raw Optical Satellite Time Series Classification , 2019, ArXiv.

[10]  Marco Körner,et al.  Temporal Vegetation Modelling Using Long Short-Term Memory Networks for Crop Identification from Medium-Resolution Multi-spectral Satellite Images , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[11]  Hui Lin,et al.  Mapping Paddy Rice Using a Convolutional Neural Network (CNN) with Landsat 8 Datasets in the Dongting Lake Area, China , 2018, Remote. Sens..

[12]  B. Wardlow,et al.  Large-area crop mapping using time-series MODIS 250 m NDVI data: An assessment for the U.S. Central Great Plains , 2008 .

[13]  Yoshua Bengio,et al.  Convolutional networks for images, speech, and time series , 1998 .

[14]  Zachary Chase Lipton A Critical Review of Recurrent Neural Networks for Sequence Learning , 2015, ArXiv.

[15]  Dino Ienco,et al.  Land Cover Classification via Multitemporal Spatial Data by Deep Recurrent Neural Networks , 2017, IEEE Geoscience and Remote Sensing Letters.

[16]  Mariana Belgiu,et al.  Sentinel-2 cropland mapping using pixel-based and object-based time-weighted dynamic time warping analysis , 2018 .

[17]  Hang Zhou,et al.  Deep learning based multi-temporal crop classification , 2019, Remote Sensing of Environment.

[18]  Nesrine Chehata,et al.  Time-Space Tradeoff in Deep Learning Models for Crop Classification on Satellite Multi-Spectral Image Time Series , 2019, IGARSS 2019 - 2019 IEEE International Geoscience and Remote Sensing Symposium.

[19]  C. Tucker Red and photographic infrared linear combinations for monitoring vegetation , 1979 .

[20]  Andrew Zisserman,et al.  Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Marc Rußwurm,et al.  Convolutional LSTMs for Cloud-Robust Segmentation of Remote Sensing Imagery , 2018, NIPS 2018.

[22]  Giorgos Mallinis,et al.  A Hidden Markov Models Approach for Crop Classification: Linking Crop Phenology to Time Series of Multi-Sensor Remote Sensing Data , 2015, Remote. Sens..

[23]  Ross B. Girshick,et al.  Focal Loss for Dense Object Detection , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Himanshu Sharma,et al.  A Deep Learning Hybrid CNN Framework Approach for Vegetation Cover Mapping Using Deep Features , 2017, 2017 13th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS).

[25]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[26]  Gérard Dedieu,et al.  Assessment of an Operational System for Crop Type Map Production Using High Temporal and Spatial Resolution Satellite Optical Imagery , 2015, Remote. Sens..

[27]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[28]  Marc Rußwurm,et al.  Multi-Temporal Land Cover Classification with Sequential Recurrent Encoders , 2018, ISPRS Int. J. Geo Inf..

[29]  Andrew Zisserman,et al.  Convolutional Two-Stream Network Fusion for Video Action Recognition , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  P. Gong,et al.  Efficient corn and soybean mapping with temporal extendability: A multi-year experiment using Landsat imagery , 2014 .

[31]  Guido Lemoine,et al.  Parcel based classification for agricultural mapping and monitoring using multi-temporal satellite image sequences , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[32]  Leonidas J. Guibas,et al.  PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Lukasz Kaiser,et al.  Attention is All you Need , 2017, NIPS.

[34]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[35]  Matthias Drusch,et al.  Sentinel-2: ESA's Optical High-Resolution Mission for GMES Operational Services , 2012 .

[36]  Nesrine Chehata,et al.  Crop-Rotation Structured Classification using Multi-Source Sentinel Images and LPIS for Crop Type Mapping , 2018, IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium.

[37]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[38]  Nataliia Kussul,et al.  Deep Learning Classification of Land Cover and Crop Types Using Remote Sensing Data , 2017, IEEE Geoscience and Remote Sensing Letters.

[39]  Kaiming He,et al.  Focal Loss for Dense Object Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[40]  Emmanuel Christophe,et al.  ORFEO TOOLBOX : A COMPLETE SOLUTION FOR MAPPING FROM HIGH RESOLUTION SATELLITE IMAGES , 2008 .

[41]  Emile Ndikumana,et al.  Deep Recurrent Neural Network for Agricultural Classification using multitemporal SAR Sentinel-1 for Camargue, France , 2018, Remote. Sens..

[42]  M. J. Pringle,et al.  SUPPORT VECTOR MACHINE CLASSIFICATION OF OBJECT-BASED DATA FOR CROP MAPPING, USING MULTI-TEMPORAL LANDSAT IMAGERY , 2012 .

[43]  Soe W. Myint,et al.  A support vector machine to identify irrigated crop types using time-series Landsat NDVI data , 2015, Int. J. Appl. Earth Obs. Geoinformation.

[44]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[45]  A. Skidmore,et al.  Vegetation phenology from Sentinel-2 and field cameras for a Dutch barrier island , 2018, Remote Sensing of Environment.

[46]  Matthias Bethge,et al.  ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness , 2018, ICLR.