Application of spatio-temporal data in site-specific maize yield prediction with machine learning methods

In order to meet the requirements of sustainability and to determine yield drivers and limiting factors, it is now more likely that traditional yield modelling will be carried out using artificial intelligence (AI). The aim of this study was to predict maize yields using AI that uses spatio-temporal training data. The paper has advanced a new method of maize yield prediction, which is based on spatio-temporal data mining. To find the best solution, various models were used: counter-propagation artificial neural networks (CP-ANNs), XY-fused Querynetworks (XY-Fs), supervised Kohonen networks (SKNs), neural networks with Rectangular Linear Activations (ReLU), extreme gradient boosting (XGBoost), support-vector machine (SVM), and different subsets of the independent variables in five vegetation periods. Input variables for modelling included: soil parameters (pH, P2O5, K2O, Zn, clay content, ECa, draught force, Cone index), micro-relief averages, and meteorological parameters for the 63 treatment units in a 15.3 ha research field. The best performing method (XGBoost) reached 92.1% and 95.3% accuracy on the training and the test sets. Additionally, a novel method was introduced to treat individual units in a lattice system. The lattice-based smoothing performed an additional increase in Area under the curve (AUC) to 97.5% over the individual predictions of the XGBoost model. The models were developed using 48 different subsets of variables to determine which variables consistently contributed to prediction accuracy. By comparing the resulting models, it was shown that the best regression model was Extreme Gradient Boosting Trees, with 92.1% accuracy (on the training set). In addition, the method calculates the influence of the spatial distribution of site-specific soil fertility on maize grain yields. This paper provides a new method of spatio-temporal data analyses, taking the most important influencing factors on maize yields into account.

[1]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[2]  L. Buydens,et al.  Supervised Kohonen networks for classification problems , 2006 .

[3]  Mahdi Vasighi,et al.  A MATLAB toolbox for Self Organizing Maps and supervised neural network learning strategies , 2012 .

[4]  Rudolf Kruse,et al.  Feature Selection for Wheat Yield Prediction , 2009, SGAI Conf..

[5]  Yueling Zhao,et al.  The application of the spatio-temporal data mining algorithm in maize yield prediction , 2013, Math. Comput. Model..

[6]  M. Meirvenne,et al.  Optimum three-point linkage set up for improving the quality of soil spectra and the accuracy of soil phosphorus measured using an on-line visible and near infrared sensor , 2009 .

[7]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[8]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[9]  Miklós Neményi,et al.  An application of tillage force mapping as a cropping management tool , 2006 .

[10]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[11]  C. Folberth,et al.  Spatio-temporal downscaling of gridded crop model yield estimates based on machine learning , 2019, Agricultural and Forest Meteorology.

[12]  H. Ramon,et al.  On-line measurement of some selected soil properties using a VIS–NIR sensor , 2007 .

[13]  Teuvo Kohonen,et al.  Self-Organizing Maps, Third Edition , 2001, Springer Series in Information Sciences.

[14]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[15]  Teuvo Kohonen,et al.  Self-organized formation of topologically correct feature maps , 2004, Biological Cybernetics.

[16]  T. Kohonen Self-organized formation of topographically correct feature maps , 1982 .

[17]  A. Benczúr,et al.  Prediction and characterization of human ageing-related proteins by using machine learning , 2018, Scientific Reports.

[18]  Yuxin Miao,et al.  Identifying important factors influencing corn yield and grain quality variability using artificial neural networks , 2006, Precision Agriculture.

[19]  Xanthoula Eirini Pantazi,et al.  Wheat yield prediction using machine learning and advanced sensing techniques , 2016, Comput. Electron. Agric..

[20]  Jing Liu,et al.  Neural networks for setting target corn yields , 2000 .

[21]  Georg Ruß,et al.  Data Mining of Agricultural Yield Data: A Comparison of Regression Models , 2009, ICDM.

[22]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[23]  T. Iizumi,et al.  Global crop yield forecasting using seasonal climate information from a multi-model ensemble , 2018, Climate Services.

[24]  Henrik Skov Midtiby,et al.  Plant species classification using deep convolutional neural network , 2016 .

[25]  Miklós Neményi,et al.  Effects of soil compaction on cereal yield , 2017 .

[26]  Andreas Kamilaris,et al.  Deep learning in agriculture: A survey , 2018, Comput. Electron. Agric..

[27]  Juan Frausto Solís,et al.  A New Method for Optimal Cropping Pattern , 2009, MICAI.

[28]  András A. Benczúr,et al.  Machine Learning Based Session Drop Prediction in LTE Networks and Its SON Aspects , 2015, 2015 IEEE 81st Vehicular Technology Conference (VTC Spring).

[29]  Minghua Zhang,et al.  Simulation and Prediction of Soybean Growth and Development under Field Conditions , 2010 .

[30]  John P. Fulton,et al.  Integration of high resolution remotely sensed data and machine learning techniques for spatial prediction of soil properties and corn yield , 2018, Comput. Electron. Agric..

[31]  J. Zupan,et al.  Neural Networks in Chemistry , 1993 .

[32]  J. Besag Spatial Interaction and the Statistical Analysis of Lattice Systems , 1974 .

[33]  Razvan Pascanu,et al.  On the Number of Linear Regions of Deep Neural Networks , 2014, NIPS.

[34]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[35]  Marcel Salathé,et al.  Using Deep Learning for Image-Based Plant Disease Detection , 2016, Front. Plant Sci..

[36]  Vladimir Vapnik,et al.  Support-vector networks , 2004, Machine Learning.

[37]  Junliang Fan,et al.  Comparison of Support Vector Machine and Extreme Gradient Boosting for predicting daily global solar radiation using temperature and precipitation in humid subtropical climates: A case study in China , 2018 .

[38]  F. Anctil,et al.  Site-specific early season potato yield forecast by neural network in Eastern Canada , 2011, Precision Agriculture.