Exploratory Hierarchical Clustering for Management Zone Delineation in Precision Agriculture

Precision Agriculture has become an emerging topic over the last ten years. It is concerned with the integration of information technology into agricultural processes. This is especially true for the ongoing and growing data collection in agriculture. Novel ground-based sensors, aerial and satellite imagery as well as soil sampling provide large georeferenced data sets with high spatial resolution. However, these data lead to the data mining problem of finding novel and useful information in these data sets. One of the key tasks in the area of precision agriculture is management zone delineation: given a data set of georeferenced data records with high spatial resolution, we would like to discover spatially mostly contiguous zones on the field which exhibit similar characteristics within the zones and different characteristics between zones. From a data mining point of view, this task comes down to a variant of spatial clustering with a constraint of keeping the resulting clusters spatially mostly contiguous. This article presents a novel approach tailored to the specifics of the available data, which do not allow for using an existing algorithm. A variant of hierarchical agglomerative clustering will be presented, in conjunction with a spatial constraint. Results on available multi-variate data sets and subsets will be presented.

[1]  Dimitrios Gunopulos,et al.  Automatic subspace clustering of high dimensional data for data mining applications , 1998, SIGMOD '98.

[2]  Stephan R. Sain,et al.  HARVIST: a system for agricultural and weather studies using advanced statistical methods , 2005 .

[3]  Paul R. Cohen,et al.  Advances in Intelligent Data Analysis IX, 9th International Symposium, IDA 2010, Tucson, AZ, USA, May 19-21, 2010. Proceedings , 2010, IDA.

[4]  D. Westfall,et al.  Evaluating Farmer Defined Management Zone Maps for Variable Rate Fertilizer Application , 2000, Precision Agriculture.

[5]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[6]  Mark Gahegan,et al.  ICEAGE: Interactive Clustering and Exploration of Large and High-Dimensional Geodata , 2003, GeoInformatica.

[7]  B. Koch,et al.  A synthesis of multi-disciplinary research in precision agriculture: site-specific management zones in the semi-arid western Great Plains of the USA , 2008, Precision Agriculture.

[8]  Kenneth A. Sudduth,et al.  Delineating productivity zones on claypan soil fields using apparent soil electrical conductivity , 2005 .

[9]  R. M. Lark,et al.  Forming Spatially Coherent Regions by Classification of Multi-Variate Data: An Example from the Analysis of Maps of Crop Yield , 1998, Int. J. Geogr. Inf. Sci..

[10]  Rodrigo Ortega,et al.  Determination of management zones in corn (Zea mays L.) based on soil fertility , 2007 .

[11]  Ickjai Lee,et al.  Multi-Level Clustering and its Visualization for Exploratory Spatial Analysis , 2002, GeoInformatica.

[12]  Daniel A. Keim,et al.  On Knowledge Discovery and Data Mining , 1997 .

[13]  Jiong Yang,et al.  STING: A Statistical Information Grid Approach to Spatial Data Mining , 1997, VLDB.

[14]  Michael Heisig,et al.  Estimation of yield zones using aerial images and yield data from a few tracks of a combine harvester , 2008, Precision Agriculture.

[15]  Claire Cardie,et al.  Intelligent Clustering with Instance-Level Constraints , 2002 .

[16]  Eyke Hüllermeier,et al.  Computational Intelligence for Knowledge-Based Systems Design, 13th International Conference on Information Processing and Management of Uncertainty, IPMU 2010, Dortmund, Germany, June 28 - July 2, 2010. Proceedings , 2010, IPMU.

[17]  Rudolf Kruse,et al.  Regression Models for Spatial Data: An Example from Precision Agriculture , 2010, ICDM.

[18]  Rudolf Kruse,et al.  Hierarchical Spatial Clustering for Management Zone Delineation in Precision Agriculture , 2010, Industrial Conference on Data Mining - Workshops.

[19]  Diansheng Guo,et al.  Regionalization with dynamically constrained agglomerative clustering and partitioning (REDCAP) , 2008, Int. J. Geogr. Inf. Sci..

[20]  Alexander Brenning,et al.  Data Mining in Precision Agriculture: Management of Spatial Information , 2010, IPMU.

[21]  Corina da Costa Freitas,et al.  Efficient regionalization techniques for socio‐economic geographical units using minimum spanning trees , 2006, Int. J. Geogr. Inf. Sci..

[22]  Petra Perner,et al.  Advances in Data Mining , 2002, Lecture Notes in Computer Science.

[23]  Christopher M. Gold,et al.  Voronoi Methods in GIS , 1996, Algorithmic Foundations of Geographic Information Systems.

[24]  R. M. Lark,et al.  Mapping Potential Crop Management Zones within Fields: Use of Yield-map Series and Patterns of Soil Physical Properties Identified by Electromagnetic Induction Sensing , 2005, Precision Agriculture.

[25]  Marc van Kreveld,et al.  Algorithmic Foundations of Geographic Information Systems , 1997, Lecture Notes in Computer Science.

[26]  Zhou Shi,et al.  Delineation of site-specific management zones using fuzzy clustering analysis in a coastal saline land , 2007 .

[27]  Rudolf Kruse,et al.  Feature Selection for Wheat Yield Prediction , 2009, SGAI Conf..

[28]  Alexander Brenning,et al.  Spatial Variable Importance Assessment for Yield Prediction in Precision Agriculture , 2010, IDA.

[29]  Mike Rees,et al.  5. Statistics for Spatial Data , 1993 .

[30]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[31]  Jiawei Han,et al.  CLARANS: A Method for Clustering Objects for Spatial Data Mining , 2002, IEEE Trans. Knowl. Data Eng..