Data Mining in Precision Agriculture: Management of Spatial Information

Precision Agriculture is the application of state-of-the-art GPS technology in connection with site-specific, sensor-based treatment of the crop. It can also be described as a data-driven approach to agriculture, which is strongly connected with a number of data mining problems. One of those is also an inherently important task in agriculture: yield prediction. The question is: can a field's yield be predicted in-season using available geo-coded data sets?. In the past, a number of approaches have been proposed towards this problem. Often, a broad variety of regression models for non-spatial data have been used, like regression trees, neural networks and support vector machines. But in a cross-validation learning approach, issues with the assumption of the data records' statistical independence keep emerging. Hence, the geographical location of data records should clearly be considered while establishing a regression model and assessing its predictive performance. This paper gives a short overview of the available data, points out in detail the main issue with the classical learning approaches and presents a novel spatial cross-validation technique to overcome the problems with the classical approach towards the aforementioned yield prediction task.

[1]  Georg Ruß,et al.  Data Mining of Agricultural Yield Data: A Comparison of Regression Models , 2009, ICDM.

[2]  A. Brenning Spatial prediction models for landslide hazards: review, comparison and evaluation , 2005 .

[3]  J. V. Stafford,et al.  Mapping and interpreting the yield variation in cereal crops , 1996 .

[4]  Rudolf Kruse,et al.  Estimation of Neural Network Parameters for Wheat Yield Prediction , 2008, IFIP AI.

[5]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[6]  Roger Woodard,et al.  Interpolation of Spatial Data: Some Theory for Kriging , 1999, Technometrics.

[7]  Petra Perner,et al.  Advances in Data Mining , 2002, Lecture Notes in Computer Science.

[8]  D. Griffith Spatial Autocorrelation and Spatial Filtering , 2003 .

[9]  Limin Yang,et al.  A STRATEGY FOR ESTIMATING TREE CANOPY DENSITY USING LANDSAT 7 ETM+ AND HIGH RESOLUTION IMAGES OVER LARGE AREAS , 2001 .

[10]  U. Meier,et al.  Growth stages of mono- and dicotyledonous plants = Entwicklungsstadien mono- und dikotyler Pflanzen = Estadios de las plantas mono- y dicotiledóneas = Stades phénologiques des mono- et dicotylédones cultivées , 1997 .

[11]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[12]  Berthold Lausen,et al.  Estimating error rates in the classification of paired organs , 2008, Statistics in medicine.

[13]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[14]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[15]  Sven F. Crone,et al.  Forecasting with Computational Intelligence - An Evaluation of Support Vector Regression and Artificial Neural Networks for Time Series Prediction , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[16]  Rudolf Kruse,et al.  Data Mining with Neural Networks for Wheat Yield Prediction , 2008, ICDM.

[17]  J. I. Ortiz-Monasterio,et al.  Combining Field Surveys, Remote Sensing, and Regression Trees to Understand Yield Variations in an Irrigated Wheat Landscape , 2005, Agronomy Journal.

[18]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Alexei Pozdnoukhov,et al.  Data-driven topo-climatic mapping with machine learning methods , 2009 .

[21]  Mario A. Maggioni,et al.  Learning, Innovation and Growth Within Interconected Clusters: An Agent-Based Approach , 2009 .

[22]  P. Bühlmann Bootstraps for Time Series , 2002 .

[23]  Ojs Jki,et al.  Entwicklungsstadien mono- und dikotyler Pflanzen , 2010 .

[24]  P. Moran Notes on continuous stochastic phenomena. , 1950, Biometrika.

[25]  H. J. Heege,et al.  Prospects and results for optical systems for site-specific on-the-go control of nitrogen-top-dressing in Germany , 2008, Precision Agriculture.