FramePests: A Comprehensive Framework for Crop Pests Modeling and Forecasting

Crop pests are among the greatest threats to food security, generating broad economic, social, and environmental impacts. These pests interact with their hosts and the environment through complex pathways, and it is increasingly common to find professionals from different areas gathering into projects that attempt to deal with this complexity. We propose a framework called FramePests guiding steps and activities for crop pest modeling and forecasting. From theoretical references about carrying out mappings and systematic reviews of the literature, the framework proposes a series of steps leading to a state of science as a knowledge base for modeling tasks. Then, two modeling solutions, based on data and knowledge are used. Finally, the model outputs and performances are compared. The application of the proposed framework was demonstrated for coffee leaf rust modeling, for which we obtained a data-based model built using a gradient boosting algorithm (XGBoost) with a mean absolute error of 7.19% and a knowledge-based model represented by a hierarchical multi-criteria decision structure with an accuracy of 56.03%. A complementary study for our case study allowed us to explore how elements of a data-based model can improve a knowledge-based model, improving its accuracy by 7.07%. and showed that knowledge-based modeling can be an alternative to data-based modeling when the available dataset has approximately 60 instances. Data-based models tend to have better performance, but their replicability is conditioned by the diversity in the dataset used. Knowledge-based models may be simpler but allow expert supervision, and these models are not usually tied to specific sites.

[1]  Jacques Avelino,et al.  The intensity of a coffee rust epidemic is dependent on production situations , 2006 .

[2]  R. Rayner Germination and penetration studies on coffee rust (Hemileia vastatrix B. & Br.). , 1961 .

[3]  Jean-Noël Aubertot,et al.  Injury Profile SIMulator, a Qualitative Aggregative Modelling Framework to Predict Crop Injury Profile as a Function of Cropping Practices, and the Abiotic and Biotic Environment. I. Conceptual Bases , 2013, PloS one.

[4]  Albertus Eskes,et al.  Advances in Coffee Rust Research , 1989 .

[5]  P. Tixier,et al.  Unraveling the complexity of coffee leaf rust behavior and development in different Coffea arabica agro-ecosystems. , 2020, Phytopathology.

[6]  Juan Carlos Corrales,et al.  Estimation of coffee rust infection and growth through two-level classifier ensembles based on expert knowledge , 2018 .

[7]  A. Cristancho La roya del cafeto en Colombia : Impacto manejo y costos del control , 2011 .

[8]  Juan Carlos Corrales,et al.  Rule-based expert system for detection of coffee rust warnings in colombian crops , 2019, J. Intell. Fuzzy Syst..

[9]  S. Savary,et al.  Effects of crop management patterns on coffee rust epidemics , 2004 .

[10]  M. Kogan,et al.  Integrated pest management: historical perspectives and contemporary developments. , 1998, Annual review of entomology.

[11]  Marco Cristancho,et al.  The coffee rust crises in Colombia and Central America (2008–2013): impacts, plausible causes and proposed solutions , 2015, Food Security.

[12]  J. Vandermeer,et al.  Qualitative Dynamics of the Coffee Rust Epidemic: Educating Intuition with Theoretical Ecology , 2014 .

[13]  Xiaojin Zhu,et al.  Introduction to Semi-Supervised Learning , 2009, Synthesis Lectures on Artificial Intelligence and Machine Learning.

[14]  Leo Stroosnijder,et al.  Evaluating quantitative and qualitative models: An application for nationwide water erosion assessment in Ethiopia , 2011, Environ. Model. Softw..

[15]  J. Stenberg,et al.  A Conceptual Framework for Integrated Pest Management. , 2017, Trends in plant science.

[16]  Luiz Henrique Antunes Rodrigues,et al.  Análise da epidemia da ferrugem do cafeeiro com árvore de decisão , 2008 .

[17]  Douglas G. Altman,et al.  Practical statistics for medical research , 1990 .

[18]  J. Waller Coffee rust—epidemiology and control , 1982 .

[19]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[20]  The future of food and agriculture: Trends and challenges , 2016 .

[21]  R. Rice,et al.  Shade Effects on the Dispersal of Airborne Hemileia vastatrix Uredospores. , 2016, Phytopathology.

[22]  Thomas Reinartz,et al.  CRISP-DM 1.0: Step-by-step data mining guide , 2000 .

[23]  Marion Robert,et al.  CMFDM: A methodology to guide the design of a conceptual model of farmers' decision-making processes , 2016 .

[24]  Laurence V. Madden,et al.  The study of plant disease epidemics , 2007 .

[25]  S. Coakley,et al.  Predicting Stripe Rust Severity on Winter Wheat Using an Improved Method for Analyzing Meteorological and Rust Data , 1988 .

[26]  Dharmendra Naidu,et al.  A comparison of qualitative and quantitative methods of detecting earnings management: evidence from two Fijian private and two Fijian state-owned entities , 2013 .

[27]  Juan Carlos Corrales,et al.  Towards Detecting Crop Diseases and Pest by Supervised Learning , 2015 .

[28]  Browne,et al.  Cross-Validation Methods. , 2000, Journal of mathematical psychology.

[29]  J. Avelino,et al.  Shade tree Chloroleucon eurycyclum promotes coffee leaf rust by reducing uredospore wash-off by rain , 2020 .

[30]  S. McCook,et al.  Global rust belt: Hemileia vastatrix and the ecological integration of world coffee production since 1850 , 2006 .

[31]  J. Avelino,et al.  Shade is conducive to coffee rust as compared to full sun exposure under standardized fruit load conditions , 2012 .

[32]  Juan Carlos Corrales,et al.  Discovering weather periods and crop properties favorable for coffee rust incidence from feature selection approaches , 2020, Comput. Electron. Agric..

[33]  P. Tixier,et al.  Forecast models of coffee leaf rust symptoms and signs based on identified microclimatic combinations in coffee-based agroforestry systems in Costa Rica , 2020 .

[34]  Alexander Herr,et al.  The uncertain impact of climate change on forest ecosystems - How qualitative modelling can guide future research for quantitative model development , 2016, Environ. Model. Softw..

[35]  L. Zambolim,et al.  Current status and management of coffee leaf rust in Brazil , 2016, Tropical Plant Pathology.

[36]  R. Clarke,et al.  Studies on the biology of Hemileia vastatrix Berk. & Br , 1963 .

[37]  Glauco de Souza Rolim,et al.  Machine learning algorithms for forecasting the incidence of Coffea arabica pests and diseases , 2020, International Journal of Biometeorology.

[38]  Juan Carlos Corrales,et al.  Feature selection for classification tasks: Expert knowledge or traditional methods? , 2018, J. Intell. Fuzzy Syst..

[39]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[40]  Yosef Jabareen,et al.  Building a Conceptual Framework: Philosophy, Definitions, and Procedure , 2009 .

[41]  C. A. A. Meira,et al.  Weather-based coffee leaf rust apparent infection rate modeling , 2018, International Journal of Biometeorology.

[42]  Juan Carlos Corrales,et al.  From Theory to Practice: A Data Quality Framework for Classification Tasks , 2018, Symmetry.

[43]  J. C. Sutton,et al.  Monitoring weather factors in relation to plant disease , 1984 .

[44]  S. Wolfert,et al.  Big Data in Smart Farming – A review , 2017 .

[45]  A. Kushalappa,et al.  Application of Survival Ratio for Monocyclic Process ofHemileia vastatrixin Predicting Coffee Rust Infection Rates , 1983 .

[46]  J. Waller,et al.  Coffee Pests, Diseases and their Management , 2007 .

[47]  James Joseph Biundo,et al.  Analysis of Contingency Tables , 1969 .

[48]  C. Staver,et al.  Coffee agroecosystem performance under full sun, shade, conventional and organic management regimes in Central America , 2011, Agroforestry Systems.

[49]  Cyril Ruckebusch,et al.  Statistical tests for comparison of quantitative and qualitative models developed with near infrared spectral data , 2003 .

[50]  Y. G. Prasad,et al.  Pest monitoring and forecasting. , 2012 .

[51]  Kai Petersen,et al.  Systematic Mapping Studies in Software Engineering , 2008, EASE.

[52]  Massimo Aria,et al.  bibliometrix: An R-tool for comprehensive science mapping analysis , 2017, J. Informetrics.

[53]  Jacob Cohen,et al.  The Equivalence of Weighted Kappa and the Intraclass Correlation Coefficient as Measures of Reliability , 1973 .

[54]  Barbara Kitchenham,et al.  Procedures for Performing Systematic Reviews , 2004 .

[55]  Juan Carlos Corrales,et al.  How to Address the Data Quality Issues in Regression Models: A Guided Process for Data Cleaning , 2018, Symmetry.