Analysis of time series data on agroecosystem vegetation using predictive clustering trees

Abstract We present an approach to modelling interdependent types of vegetation that support different functions in a managed ecosystem. For optimal management, plants that provide economic output (e.g., crops) and those that support ecological functions (e.g., wild plants or ‘weeds’) should coexist in an agroecosystem. To make progress with understanding how such plant communities interact over time, we analyse paired time series data about the cover of crop and weed vegetation in oilseed rape fields. The percentage crop and weed cover were measured every 7–14 days at 128 sites in the UK, covering a wide range of localities and management regimes. To analyze the data, we first cluster the time course profiles of crop cover (using the k -medoids clustering algorithm and the dynamic time warping distance between time series). The clustering revealed five typical clusters of crop cover profiles that differed in terms of rate of increase, lag phase and maximum value, but were largely independent of the type of crop (winter/spring oil seed rape) and the weed management regime. Cluster membership for each crop cover profile was used as an additional independent variable (attribute) in the predictive modelling analysis that followed. We then constructed predictive clustering trees (a generalized form of decision trees) that predict the weed cover profile (time series) from independent (input) variables that include the crop cover cluster, other crops descriptors and environmental variables. The crop cover cluster was more informative in predicting the weed cover profile than any other input variable, including the type of crop and the crop transgenic status (conventional or genetically modified / herbicide tolerant). The approach was successful in identifying the interdependencies between the two types of vegetation. We envisage that it will have plentiful further practical use in relating and interpreting ecological or environmental time series.

[1]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[2]  Cathy Hawes,et al.  Effects on weed and invertebrate abundance and diversity of herbicide management in genetically modified herbicide-tolerant winter-sown oilseed rape , 2005, Proceedings of the Royal Society B: Biological Sciences.

[3]  Christopher Potter,et al.  Revealing Land Cover Change in California With Satellite Data , 2007 .

[4]  H. Wang,et al.  Scale invariance of daily runoff time series in agricultural watersheds , 2005 .

[5]  T. Warren Liao,et al.  Clustering of time series data - a survey , 2005, Pattern Recognit..

[6]  L. K. Ward,et al.  The role of weeds in supporting biological diversity within crop fields , 2003 .

[7]  Luc De Raedt,et al.  Top-Down Induction of Clustering Trees , 1998, ICML.

[8]  Mike Dale,et al.  Building Models of Ecological Dynamics Using HMM Based Temporal Data Clustering - A Preliminary Study , 2001, IDA.

[9]  David S. Stoffer,et al.  Time series analysis and its applications , 2000 .

[10]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[11]  Jean-François Mari,et al.  Temporal and spatial data mining with second-order hidden markov models , 2006, Soft Comput..

[12]  C. Hawes,et al.  Cumulative impact of GM herbicide-tolerant cropping on arable plants assessed through species-based and functional taxonomies , 2009, Environmental science and pollution research international.

[13]  Nicolas Viovy,et al.  Automatic Classification of Time Series (ACTS): A new clustering method for remote sensing time series , 2000 .

[14]  M. Hill,et al.  Weeds in fields with contrasting conventional and genetically modified herbicide-tolerant crops. I. Effects on abundance and diversity. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[15]  Cathy Hawes,et al.  Functional approaches for assessing plant and invertebrate abundance patterns in arable systems , 2009 .

[16]  L G Firbank,et al.  Crop management and agronomic context of the Farm Scale Evaluations of genetically modified herbicide-tolerant crops. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[17]  Mike J. May,et al.  Management of genetically modified herbicide–tolerant sugar beet for spring and autumn environmental benefit , 2005, Proceedings of the Royal Society B: Biological Sciences.

[18]  Cathy Hawes,et al.  Arable plant communities as indicators of farming practice , 2010 .

[19]  Cathy Hawes,et al.  Design, analysis and statistical power of the Farm-Scale Evaluations of genetically modified herbicide-tolerant crops , 2003 .

[20]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[21]  Sašo Džeroski,et al.  Relations between the oilseed rape volunteer seedbank, and soil factors, weed functional groups and geographical location in the UK , 2008 .