Spatio-Temporal Prediction of the Epidemic Spread of Dangerous Pathogens Using Machine Learning Methods

Real-time identification of the occurrence of dangerous pathogens is of crucial importance for the rapid execution of countermeasures. For this purpose, spatial and temporal predictions of the spread of such pathogens are indispensable. The R package papros developed by the authors offers an environment in which both spatial and temporal predictions can be made, based on local data using various deterministic, geostatistical regionalisation, and machine learning methods. The approach is presented using the example of a crops infection by fungal pathogens, which can substantially reduce the yield if not treated in good time. The situation is made more difficult by the fact that it is particularly difficult to predict the behaviour of wind-dispersed pathogens, such as powdery mildew (Blumeria graminis f. sp. tritici). To forecast pathogen development and spatial dispersal, a modelling process scheme was developed using the aforementioned R package, which combines regionalisation and machine learning techniques. It enables the prediction of the probability of yield- relevant infestation events for an entire federal state in northern Germany at a daily time scale. To run the models, weather and climate information are required, as is knowledge of the pathogen biology. Once fitted to the pathogen, only weather and climate information are necessary to predict such events, with an overall accuracy of 68% in the case of powdery mildew at a regional scale. Thereby, 91% of the observed powdery mildew events are predicted.

[1]  Ángeles Saavedra,et al.  Weather datasets generated using kriging techniques to calibrate building thermal simulations with TRNSYS , 2016 .

[2]  D. Altman,et al.  Statistics Notes: Diagnostic tests 1: sensitivity and specificity , 1994 .

[3]  D. Shepard A two-dimensional interpolation function for irregularly-spaced data , 1968, ACM National Conference.

[4]  A modified drought index for WMO RA VI , 2011 .

[5]  V. Rossi,et al.  A dynamic simulation model for powdery mildew epidemics on winter wheat , 2003 .

[6]  Ole Winther,et al.  Bayesian Inference for Spatio-temporal Spike-and-Slab Priors , 2015, J. Mach. Learn. Res..

[7]  J. Zadoks A decimal code for the growth stages of cereals , 1974 .

[8]  F van den Bosch,et al.  Disease-weather relationships for powdery mildew and yellow rust on winter wheat. , 2008, Phytopathology.

[9]  Yong Luo,et al.  Dynamics in concentrations of Blumeria graminis f. sp tritici conidia and its relationship to local weather conditions and disease index in wheat , 2011, European Journal of Plant Pathology.

[10]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[11]  P. Fiener,et al.  Comparison and evaluation of spatial interpolation schemes for daily rainfall in data scarce regions , 2012 .

[12]  G. Matheron Principles of geostatistics , 1963 .

[13]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[15]  Stephen R. Delwiche,et al.  Multiple view image analysis of freefalling U.S. wheat grains for damage assessment , 2013 .

[16]  S. Chander,et al.  InfoCrop: A dynamic simulation model for the assessment of crop yields, losses due to pests, and environmental impact of agro-ecosystems in tropical environments. I. Model description , 2006 .

[17]  S. Zhang,et al.  Plant disease recognition based on plant leaf image. , 2015 .

[18]  J. Verreet,et al.  Regional Monitoring for Disease Prediction and Optimization of Plant Protection Measuares: The IPM Wheat Model. , 2000, Plant disease.

[19]  Reza Ehsani,et al.  Field detection of anthracnose crown rot in strawberry using spectroscopy technology , 2017, Comput. Electron. Agric..

[20]  Kristin K. Nicodemus,et al.  Letter to the Editor: On the stability and ranking of predictors from random forest variable importance measures , 2011, Briefings Bioinform..

[21]  Michael Biehl,et al.  Prototype-Based Classification for Image Analysis and Its Application to Crop Disease Diagnosis , 2016, WSOM.

[22]  N. Cressie The origins of kriging , 1990 .

[23]  Agustín Rubio,et al.  Geostatistical modelling of air temperature in a mountainous region of Northern Spain , 2007 .

[24]  Serge Savary,et al.  Simulating multiple pest damage in varying winter wheat production situations , 2008 .

[25]  C. Bernhofer,et al.  Comparison of spatial interpolation methods for the estimation of precipitation distribution in Distrito Federal, Brazil , 2015, Theoretical and Applied Climatology.