Exploring Google Earth Engine Platform for Big Data Processing: Classification of Multi-Temporal Satellite Imagery for Crop Mapping

Many applied problems arising in agricultural monitoring and food security require reliable crop maps at national or global scale. Large scale crop mapping requires processing and management of large amount of heterogeneous satellite imagery acquired by various sensors that consequently leads to a “Big Data” problem. The main objective of this study is to explore efficiency of using the Google Earth Engine (GEE) platform when classifying multi-temporal satellite imagery with potential to apply the platform for a larger scale (e.g. country level) and multiple sensors (e.g. Landsat-8 and Sentinel-2). In particular, multiple state-of-the-art classifiers available in the GEE platform are compared to produce a high resolution (30 m) crop classification map for a large territory (~28,100 km2 and 1.0 M ha of cropland). Though this study does not involve large volumes of data, it does address efficiency of the GEE platform to effectively execute complex workflows of satellite data processing required with large scale applications such as crop mapping. The study discusses strengths and weaknesses of classifiers, assesses accuracies that can be achieved with different classifiers for the Ukrainian landscape, and compares them to the benchmark classifier using a neural network approach that was developed in our previous studies. The study is carried out for the Joint Experiment of Crop Assessment and Monitoring (JECAM) test site in Ukraine covering the Kyiv region (North of Ukraine) in 2013. We found that Google Earth Engine (GEE) provides very good performance in terms of enabling access to the remote sensing products through the cloud platform and providing pre-processing; however, in terms of classification accuracy, the neural network based approach outperformed support vector machine (SVM), decision tree and random forest classifiers available in GEE.

[1]  Joanne C. White,et al.  Generation of dense time series synthetic Landsat data through data blending with MODIS using a spatial and temporal adaptive reflectance fusion model. , 2009 .

[2]  Albert Y. Zomaya,et al.  Remote sensing big data computing: Challenges and opportunities , 2015, Future Gener. Comput. Syst..

[3]  Guido Lemoine,et al.  Parcel-Based Crop Classification in Ukraine Using Landsat-8 Data and Sentinel-1A Data , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[4]  Nataliia Kussul,et al.  Efficiency Assessment of Multitemporal C-Band Radarsat-2 Intensity and Landsat-8 Surface Reflectance Satellite Imagery for Crop Classification in Ukraine , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[5]  Yoram Singer,et al.  Pegasos: primal estimated sub-gradient solver for SVM , 2011, Math. Program..

[6]  N. Littlestone Learning Quickly When Irrelevant Attributes Abound: A New Linear-Threshold Algorithm , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[7]  G. Dedieu,et al.  SMAC: a simplified method for the atmospheric correction of satellite measurements in the solar spectrum , 1994 .

[8]  Stéphane Dupuy,et al.  Towards a set of agrosystem-specific cropland mapping methods to address the global cropland diversity , 2016 .

[9]  Mathew R. Schwaller,et al.  On the blending of the Landsat and MODIS surface reflectance: predicting daily Landsat surface reflectance , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[10]  Nataliia Kussul,et al.  The use of satellite data for agriculture drought risk quantification in Ukraine , 2016 .

[11]  Francisco Javier Gallego,et al.  Efficiency assessment of using satellite data for crop area estimation in Ukraine , 2014, Int. J. Appl. Earth Obs. Geoinformation.

[12]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[13]  Martha C. Anderson,et al.  Landsat-8: Science and Product Vision for Terrestrial Global Change Research , 2014 .

[14]  Rajiv Ranjan,et al.  Towards building a data-intensive index for big data computing - A case study of Remote Sensing data processing , 2015, Inf. Sci..

[15]  Olga Kussul,et al.  Winter Wheat Yield Forecasting: a Comparative Analysis of Results of Regression and Biophysical Models , 2013 .

[16]  Nataliia Kussul,et al.  Comparison of biophysical and satellite predictors for wheat yield forecasting in Ukraine , 2015 .

[17]  Olga Kussul,et al.  Analysis of Applicability of Neural Networks for Classification of Satellite Data , 2007 .

[18]  Lin Yan,et al.  Automated crop field extraction from multi-temporal Web Enabled Landsat Data , 2014 .

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[21]  Javier Gallego,et al.  Efficiency Assessment of Different Approaches to Crop Classification Based on Satellite and Ground Observations , 2012 .

[22]  Nataliia Kussul,et al.  Geospatial Intelligence and Data Fusion Techniques for Sustainable Development Problems , 2015, ICTERI.

[23]  Nataliia Kussul,et al.  Grid Technologies for Satellite Data Processing and Management Within International Disaster Monitoring Projects , 2011, Grid and Cloud Database Management.

[24]  Russell G. Congalton,et al.  Assessing the accuracy of remotely sensed data : principles and practices , 1998 .

[25]  Andrey Yu. Shelestov,et al.  Disaster Risk Assessment Based on Heterogeneous Geospatial Information , 2010 .

[26]  Heather McNairn,et al.  Integration of optical and Synthetic Aperture Radar (SAR) imagery for delivering operational annual crop inventories , 2009 .

[27]  Russell G. Congalton,et al.  A review of assessing the accuracy of classifications of remotely sensed data , 1991 .

[28]  J. Gallego,et al.  Accuracy, Objectivity and Efficiency of Remote Sensing for Agricultural Statistics , 2010 .

[29]  Mykola Lavreniuk,et al.  Large-Scale Classification of Land Cover Using Retrospective Satellite Data , 2016 .

[30]  Zhengwei Yang,et al.  Deriving crop specific covariate data sets from multi-year NASS geospatial cropland data layers , 2013, 2013 IEEE International Geoscience and Remote Sensing Symposium - IGARSS.

[31]  D. Roy,et al.  Multi-temporal MODIS-Landsat data fusion for relative radiometric normalization, gap filling, and prediction of Landsat data , 2008 .

[32]  Subhransu Maji,et al.  Classification using intersection kernel support vector machines is efficient , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[33]  Zhe Zhu,et al.  Object-based cloud and cloud shadow detection in Landsat imagery , 2012 .

[34]  Alberto Moreira,et al.  Reconstruction of missing data in interferometric SAR systems , 2013, 2013 IEEE International Geoscience and Remote Sensing Symposium - IGARSS.

[35]  Steffen Fritz,et al.  The Need for Improved Maps of Global Cropland , 2013 .

[36]  Nataliia Kussul,et al.  Winter wheat yield forecasting in Ukraine based on Earth observation, meteorological data and biophysical models , 2013, Int. J. Appl. Earth Obs. Geoinformation.

[37]  Sergey V. Skakun,et al.  Reconstruction of Missing Data in Time-Series of Optical Satellite Images Using Self-Organizing Kohonen Maps , 2014 .