Classification Analysis of NDVI Time Series in Metric Spaces for Sugarcane Identification

In Brazil, agribusiness is an important task for the economy, since it provides a substantial part of the country’s Gross Domestic Product. Besides that, interest in biofuels has grown, considering they make viable the use of renewable energy. Brazil is the world’s largest producer of sugarcane, which enables a large ethanol production. Thus, to monitor agricultural areas is important to support decision making. However, the amount of generated and stored data about these areas has been increasing in such a way that far exceeds the human capacity to manually analyze and extract information from it. That is why automatic and scalable data mining approaches are necessary. This work focuses on the sugarcane classification task, taking as input NDVI time series extracted from remote sensing images. Existing related works propose to analyze non-metric features spaces using the DTW distance function as a basis. Here we demonstrate that analyzing the multidimensional space with Minkowski distance provides better results, considering a variety of classifiers. kNN using L2 distance performed similarly or better than using DTW. We also demonstrate a data configuration with geolocation for training XGBoost, with results better than state-of-the-art.

[1]  Christophe Rigotti,et al.  Unsupervised Spatiotemporal Mining of Satellite Image Time Series Using Grouped Frequent Sequential Patterns , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[2]  Hamideh Afsarmanesh,et al.  Semi-supervised self-training for decision tree classifiers , 2017, Int. J. Mach. Learn. Cybern..

[3]  Henri Maître,et al.  A Method of Clustering Combination Applied to Satellite Image Analysis , 2007, 14th International Conference on Image Analysis and Processing (ICIAP 2007).

[4]  Nuno Constantino Castro,et al.  Time Series Data Mining , 2009, Encyclopedia of Database Systems.

[5]  Wesley W. Chu,et al.  An index-based approach for similarity search supporting time warping in large sequence databases , 2001, Proceedings 17th International Conference on Data Engineering.

[6]  Shuliang Wang,et al.  Data Mining and Knowledge Discovery , 2005, Mathematical Principles of the Internet.

[7]  J. C. Price,et al.  Estimating leaf area index from satellite data , 1993, IEEE Trans. Geosci. Remote. Sens..

[8]  Andrew R. Post,et al.  Temporal data mining. , 2008, Clinics in laboratory medicine.

[9]  Jurandir Zullo,et al.  SITS for estimating sugarcane production , 2017, 2017 9th International Workshop on the Analysis of Multitemporal Remote Sensing Images (MultiTemp).

[10]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[11]  Chokri Ben Amar,et al.  MLP neural network using constructive training algorithm: application to face recognition and facial expression recognition , 2017, Int. J. Intell. Syst. Technol. Appl..

[12]  Marcela Xavier Ribeiro,et al.  A New Approach to Classify Sugarcane Fields Based on Association Rules , 2018 .

[13]  Eamonn J. Keogh,et al.  Everything you know about Dynamic Time Warping is Wrong , 2004 .

[14]  J. Zullo,et al.  Clustering analysis applied to NDVI/NOAA multitemporal images to improve the monitoring process of sugarcane crops , 2011, 2011 6th International Workshop on the Analysis of Multi-temporal Remote Sensing Images (Multi-Temp).

[15]  Renata Ribeiro do Valle Gonçalves,et al.  Agricultural monitoring using clustering techniques on satellite image time series of low spatial resolution , 2017, 2017 9th International Workshop on the Analysis of Multitemporal Remote Sensing Images (MultiTemp).

[16]  Xi Liu,et al.  Pivot selection for metric-space indexing , 2016, Int. J. Mach. Learn. Cybern..

[17]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[18]  Kim Larsen,et al.  Generalized Naive Bayes Classifiers , 2005, SKDD.

[19]  J. Zullo,et al.  Feature extraction for NDVI AVHRR/NOAA time series classification , 2011, 2011 6th International Workshop on the Analysis of Multi-temporal Remote Sensing Images (Multi-Temp).