A Time Series Mining Approach for Agricultural Area Detection

Acquiring meaningful data to be employed in building training sets for classification models is a costly task, both in terms of difficult to find suitable samples as well as their quantity. In this sense, Active Learning (AL) improves the training set building by providing an efficient way to select only essential data to be attached to the training set, consequently reducing its size and even enhancing model's accuracy, when compared to random sample selection. In this paper, we proposed a framework for time series classification in order to monitor sugarcane area in São Paulo, Brazil. The AL approach consisted of selecting seasonal time series information from less than 1 percent of each class’ pixels to build the training set and evaluate this selection by an expert user supported by distance measurements, repeating this process until both distance measurement thresholds were satisfied. In most years, the classification results presented about 90 percent of correlation with official estimates based on both traditional and satellite image analysis methods. This framework can then help Land Use Change (LUC) monitoring as it produced similar results compared to other methods that demands more human and financial resources to be adopted.

[1]  Mikhail F. Kanevski,et al.  A Survey of Active Learning Algorithms for Supervised Remote Sensing Image Classification , 2011, IEEE Journal of Selected Topics in Signal Processing.

[2]  Zhaohui Xue,et al.  Phenology-Driven Land Cover Classification and Trend Analysis Based on Long-term Remote Sensing Image Series , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[3]  Timothy C. Coburn J. de Gruijter, D. Brus, M. Bierkens, and M. Knotters: Sampling for Natural Resource Monitoring , 2008 .

[4]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[5]  Hugo Carrão,et al.  Contribution of multispectral and multitemporal information from MODIS images to land cover classification , 2008 .

[6]  Shuwen Zhang,et al.  Regional land cover classification from MODIS time-series and geographical data using support vetor machine , 2010, 2010 IEEE Youth Conference on Information, Computing and Telecommunications.

[7]  Irma J. Terpenning,et al.  STL : A Seasonal-Trend Decomposition Procedure Based on Loess , 1990 .

[8]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[9]  André Stumpf,et al.  Spatio–Temporal Clustering and Active Learning for Change Classification in Satellite Image Time Series , 2016, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[10]  Gabriele Moser,et al.  Partially Supervised classification of remote sensing images through SVM-based probability density estimation , 2005, IEEE Transactions on Geoscience and Remote Sensing.

[11]  Francesca Bovolo,et al.  Updating Land-Cover Maps by Classification of Image Time Series: A Novel Change-Detection-Driven Transfer Learning Approach , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[12]  William J. Emery,et al.  SVM Active Learning Approach for Image Classification Using Spatial Information , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[13]  E. Ulian,et al.  Elevated CO2 increases photosynthesis, biomass and productivity, and modifies gene expression in sugarcane. , 2008, Plant, cell & environment.

[14]  William J. Emery,et al.  Using active learning to adapt remote sensing image classifiers , 2011 .

[15]  B. Datt,et al.  On the relationship between training sample size and data dimensionality: Monte Carlo analysis of broadband multi-temporal classification , 2005 .

[16]  Li Wang,et al.  Feature Selection of Time Series MODIS Data for Early Crop Classification Using Random Forest: A Case Study in Kansas, USA , 2015, Remote. Sens..

[17]  Werner G. Müller,et al.  Collecting Spatial Data: Optimum Design of Experiments for Random Fields , 1998 .

[18]  David G. Stork,et al.  Pattern Classification , 1973 .

[19]  Melba M. Crawford,et al.  Active Learning: Any Value for Classification of Remotely Sensed Data? , 2013, Proceedings of the IEEE.

[20]  P. Pinter,et al.  Elevated atmospheric CO2 improved Sorghum plant water status by ameliorating the adverse effects of drought , 2001 .

[21]  Jungho Im,et al.  Support vector machines in remote sensing: A review , 2011 .

[22]  Daniel Alves Aguiar,et al.  Studies on the Rapid Expansion of Sugarcane for Ethanol Production in São Paulo State (Brazil) Using Landsat Data , 2010, Remote. Sens..

[23]  I. Jolliffe Discarding Variables in a Principal Component Analysis. Ii: Real Data , 1973 .

[24]  Cibele Hummel do Amaral,et al.  Principal component analysis applied to a time series of MODIS images: the spatio-temporal variability of the Pantanal wetland, Brazil , 2015, Wetlands Ecology and Management.

[25]  Lorenzo Bruzzone,et al.  A Fast Cluster-Assumption Based Active-Learning Technique for Classification of Remote Sensing Images , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[26]  Heather McNairn,et al.  Multiple kernels learning for classification of agricultural time series data , 2014, 2014 The Third International Conference on Agro-Geoinformatics.

[27]  Ratih Kartika Dewi,et al.  Feature extraction for identification of sugarcane rust disease , 2014, Proceedings of International Conference on Information, Communication Technology and System (ICTS) 2014.

[28]  Y. L. Everingham,et al.  Advanced satellite imagery to classify sugarcane crop characteristics , 2007, Agronomy for Sustainable Development.

[29]  S. Wand,et al.  Responses of wild C4 and C3 grass (Poaceae) species to elevated atmospheric CO2 concentration: a meta‐analytic test of current theories and perceptions , 1999 .

[30]  Ratih Kartika Dewi,et al.  Sugarcane leaf disease detection and severity estimation based on segmented spots image , 2014, Proceedings of International Conference on Information, Communication Technology and System (ICTS) 2014.

[31]  A. Rogers,et al.  Photosynthesis, Productivity, and Yield of Maize Are Not Affected by Open-Air Elevation of CO2 Concentration in the Absence of Drought1[OA] , 2006, Plant Physiology.

[32]  J. A. Marengo Mudanças climáticas globais e seus efeitos sobre a biodiversidade: caracterização do clima atual e definição das alterações climáticas para o território brasileiro ao longo do século XXI , 2006 .

[33]  Giles M. Foody,et al.  Toward intelligent training of supervised image classifications: directing training data acquisition for SVM classification , 2004 .

[34]  M. Canty Image Analysis, Classification, and Change Detection in Remote Sensing , 2006 .

[35]  Taskin Kavzoglu,et al.  A kernel functions analysis for support vector machines for land cover classification , 2009, Int. J. Appl. Earth Obs. Geoinformation.

[36]  D. J. Brus,et al.  Sampling for Natural Resource Monitoring , 2006 .

[37]  Bernardo Friedrich Theodor Rudorff,et al.  Multi‐temporal analysis of MODIS data to classify sugarcane crop , 2006 .

[38]  R. Lunetta,et al.  Land-cover change detection using multi-temporal MODIS NDVI data , 2006 .

[39]  Jiang Li Remote sensing image information mining with HPC cluster and DryadLINQ , 2011, ACM-SE '11.

[40]  Lorenzo Bruzzone,et al.  Batch-Mode Active-Learning Methods for the Interactive Classification of Remote Sensing Images , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[41]  Ian T. Jolliffe,et al.  Discarding Variables in a Principal Component Analysis. I: Artificial Data , 1972 .

[42]  Peijun Sun,et al.  Autumn crop identification using high-spatial-temporal resolution time series data generated by MODIS and Landsat remote sensing images , 2014, 2014 IEEE Geoscience and Remote Sensing Symposium.

[43]  Yang Shao,et al.  An evaluation of time-series smoothing algorithms for land-cover classifications using MODIS-NDVI multi-temporal data , 2016 .

[44]  Fengmei Yao,et al.  Improved maize cultivated area estimation over a large scale combining MODIS–EVI time series data and crop phenological information , 2014 .

[45]  Antonio J. Plaza,et al.  This article has been accepted for inclusion in a future issue of this journal. Content is final as presented, with the exception of pagination. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 1 Spectral–Spatial Classification of Hyperspectral Data Usi , 2022 .

[46]  William J. Emery,et al.  Active Learning Methods for Remote Sensing Image Classification , 2009, IEEE Transactions on Geoscience and Remote Sensing.

[47]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[48]  P. Beck,et al.  Improved monitoring of vegetation dynamics at very high latitudes: A new method using MODIS NDVI , 2006 .

[49]  Safa Rejichi,et al.  Feature extraction using PCA for VHR satellite image time series spatio-temporal classification , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).