Winter Wheat Yield Prediction Using Convolutional Neural Networks from Environmental and Phenological Data

Crop yield forecasting depends on many interactive factors including crop genotype, weather, soil, and management practices. This study analyzes the performance of machine learning and deep learning methods for winter wheat yield prediction using extensive datasets of weather, soil, and crop phenology. We propose a convolutional neural network (CNN) which uses the 1-dimentional convolution operation to capture the time dependencies of environmental variables. The proposed CNN, evaluated along with other machine learning models for winter wheat yield prediction in Germany, outperformed all other models tested. To address the seasonality, weekly features were used that explicitly take soil moisture and meteorological events into account. Our results indicated that nonlinear models such as deep learning models and XGboost are more effective in finding the functional relationship between the crop yield and input data compared to linear models and deep neural networks had a higher prediction accuracy than XGboost. One of the main limitations of machine learning models is their black box property. Therefore, we moved beyond prediction and performed feature selection, as it provides key results towards explaining yield prediction (variable importance by time). As such, our study indicates which variables have the most significant effect on winter wheat yield.

[1]  Volker Meyer,et al.  The effect of soil moisture anomalies on maize yield in Germany , 2017 .

[2]  Salah Sukkarieh,et al.  Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review , 2018, Comput. Electron. Agric..

[3]  Sangram Ganguly,et al.  Data Science for Weather Impacts on Crop Yield , 2020, Frontiers in Sustainable Food Systems.

[4]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[5]  Mohsen Shahhosseini,et al.  Maize yield and nitrate loss prediction with machine learning algorithms , 2019, Environmental Research Letters.

[6]  Mladen Todorovic,et al.  Estimation of daily potato crop evapotranspiration using three different machine learning algorithms and four scenarios of available meteorological data , 2020 .

[7]  Hongchun Qu,et al.  Wild blueberry yield prediction using a combination of computer simulation and machine learning algorithms , 2020, Comput. Electron. Agric..

[8]  Lizhi Wang,et al.  Optimizing Selection and Mating in Genomic Selection with a Look-Ahead Approach: An Operations Research Framework , 2019, G3: Genes, Genomes, Genetics.

[9]  L. Shapley A Value for n-person Games , 1988 .

[10]  Özer Çinar,et al.  Comparison of artificial neural network and regression pedotransfer functions for prediction of soil water retention and saturated hydraulic conductivity , 2006 .

[11]  N. Meinshausen,et al.  The effects of climate extremes on global agricultural yields , 2019, Environmental Research Letters.

[12]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[13]  Chih-Fong Tsai,et al.  The distance function effect on k-nearest neighbor classification for medical datasets , 2016, SpringerPlus.

[14]  J. Palta,et al.  Heat Stress in Wheat during Reproductive and Grain-Filling Phases , 2011 .

[15]  I. Nyagumbo,et al.  Evaluating machine learning algorithms for predicting maize yield under conservation agriculture in Eastern and Southern Africa , 2020, SN Applied Sciences.

[16]  A. E. Hoerl,et al.  Ridge regression: biased estimation for nonorthogonal problems , 2000 .

[17]  Weijian Zhang,et al.  Nighttime warming increases winter-sown wheat yield across major Chinese cropping regions , 2017 .

[18]  Marco Bindi,et al.  Sensitivity of European wheat to extreme weather , 2017, Field Crops Research.

[19]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[20]  R. Q. Cannell,et al.  Effects of waterlogging at different stages of development on the growth and yield of winter wheat , 1980 .

[21]  Xiaoguang Chen,et al.  China feels the heat: negative impacts of high temperatures on China's rice sector† , 2018, Australian Journal of Agricultural and Resource Economics.

[22]  R. T. Walczak,et al.  Using Support Vector Machines to Develop Pedotransfer Functions for Water Retention of Soils in Poland , 2008 .

[23]  Chunyan Li,et al.  Design of an integrated climatic assessment indicator (ICAI) for wheat production: A case study in Jiangsu Province, China , 2019, Ecological Indicators.

[24]  Gorka Landeras,et al.  Comparison of artificial neural network models and empirical and semi-empirical equations for daily reference evapotranspiration estimation in the Basque Country (Northern Spain) , 2008 .

[25]  H. Weigel,et al.  Quantifying the response of wheat yields to heat stress: The role of the experimental setup , 2018 .

[26]  J. I. Ortiz-Monasterio,et al.  Extreme heat effects on wheat senescence in India , 2012 .

[27]  I. F. Wardlaw,et al.  A Comparison of the Effect of High Temperature on Grain Development in Wheat and Rice , 1989 .

[28]  Puyu Feng,et al.  Dynamic wheat yield forecasts are improved by a hybrid approach using a biophysical model and machine learning technique , 2020, Agricultural and Forest Meteorology.

[29]  N. Safaei,et al.  Gasoline prices and their relationship to the number of fatal crashes on U.S. roads , 2021 .

[30]  Lance R. Gibson,et al.  Yield Components of Wheat Grown under High Temperature Stress during Reproductive Growth , 1999 .

[31]  Ayalew Kassahun,et al.  Crop yield prediction using machine learning: A systematic literature review , 2020, Comput. Electron. Agric..

[32]  Lizhi Wang,et al.  Crop Yield Prediction Using Deep Neural Networks , 2019, Front. Plant Sci..

[33]  Yasaman Esfandiari,et al.  Applications of Deep Learning in Intelligent Transportation Systems , 2020, Journal of Big Data Analytics in Transportation.

[34]  Lizhi Wang,et al.  Predicting yield performance of parents in plant breeding: A neural collaborative filtering approach , 2020, PloS one.

[35]  Lizhi Wang,et al.  A CNN-RNN Framework for Crop Yield Prediction , 2019, Frontiers in Plant Science.

[36]  Martha C. Anderson,et al.  Comparative assessment of environmental variables and machine learning algorithms for maize yield prediction in the US Midwest , 2020, Environmental Research Letters.

[37]  Muhammed A. Hassan,et al.  Exploring the potential of tree-based ensemble methods in solar radiation modeling , 2017 .

[38]  S. Imandoust,et al.  Application of K-Nearest Neighbor (KNN) Approach for Predicting Economic Events: Theoretical Background , 2013 .

[39]  Frank Ewert,et al.  The implication of irrigation in climate change impact assessment: a European‐wide study , 2015, Global change biology.