Treating stochasticity of olive-fruit fly's outbreaks via machine learning algorithms

Abstract Olive fruit fly trap measurements are used as one of the indicators for olive grove infestation, and therefore, as a consultation tool on spraying parameters. In this paper, machine learning techniques are used to predict the next olive fruit fly trap measurement, given input knowledge of previous trap measurements as well as an attribute that acts as a correlation model between the temperature and the development of a pest’s population, known as the Degree Day model. This is the first time the Degree Day model is utilized as input in classification algorithms for the prediction of olive fruit fly trap measurements. Various classification algorithms are employed and applied to different environmental settings, in extensive comparative experiments, in order to detect the impact of the latter on olive fruit fly population prediction.

[1]  Xin Gao,et al.  A protein-dependent side-chain rotamer library , 2011, BMC Bioinformatics.

[2]  José del Sagrado,et al.  Olive Fly Infestation Prediction Using Machine Learning Techniques , 2007, CAEPIA.

[3]  Ana L. C. Bazzan,et al.  Balancing Training Data for Automated Annotation of Keywords: a Case Study , 2003, WOB.

[4]  Tom Fawcett,et al.  Robust Classification for Imprecise Environments , 2000, Machine Learning.

[5]  H. Comins,et al.  Simulation of fruit fly population dynamics, with particular reference to the olive fruit fly, Dacus oleae , 1988 .

[6]  Markos Avlonitis,et al.  Complex networks and simulation strategies: An application to olive fruit fly dispersion , 2015, 2015 6th International Conference on Information, Intelligence, Systems and Applications (IISA).

[7]  Herna L. Viktor,et al.  Learning from imbalanced data sets with boosting and data generation: the DataBoost-IM approach , 2004, SKDD.

[8]  Gustavo E. A. P. A. Batista,et al.  A study of the behavior of several methods for balancing machine learning training data , 2004, SKDD.

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[10]  Luigi Ponti,et al.  Effects of climate warming on Olive and olive fly (Bactrocera oleae (Gmelin)) in California and Italy , 2009 .

[11]  Pietro Brunetti,et al.  FlySim: A Cellular Automata Model of BactroceraOleae (Olive Fruit Fly) Infestation and First Simulations , 2006, ACRI.

[12]  Stan Matwin,et al.  Addressing the Curse of Imbalanced Training Sets: One-Sided Selection , 1997, ICML.

[13]  Peter E. Hart,et al.  The condensed nearest neighbor rule (Corresp.) , 1968, IEEE Trans. Inf. Theory.

[14]  A. Kalra,et al.  Estimating soil moisture using remote sensing data: A machine learning approach , 2010 .

[15]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[16]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[17]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[18]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[19]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[20]  J. Ross Quinlan,et al.  Bagging, Boosting, and C4.5 , 1996, AAAI/IAAI, Vol. 1.

[21]  Robert E. Schapire,et al.  The Boosting Approach to Machine Learning An Overview , 2003 .

[22]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[23]  Zheng Fang,et al.  Identification of microRNA precursors based on random forest with network-level representation method of stem-loop structure , 2011, BMC Bioinformatics.

[24]  Pedro M. Domingos MetaCost: a general method for making classifiers cost-sensitive , 1999, KDD '99.

[25]  I. Tomek,et al.  Two Modifications of CNN , 1976 .

[26]  Shahram Jafari,et al.  An Expert System for Detection of Breast Cancer Using Data Preprocessing and Bayesian Network , 2011 .

[28]  Nathalie Japkowicz,et al.  The Class Imbalance Problem: Significance and Strategies , 2000 .

[29]  Stan Matwin,et al.  Machine Learning for the Detection of Oil Spills in Satellite Radar Images , 1998, Machine Learning.

[30]  David A. Cieslak,et al.  Combating imbalance in network intrusion datasets , 2006, 2006 IEEE International Conference on Granular Computing.

[31]  Andreas Stolcke,et al.  A study in machine learning from imbalanced data for sentence boundary detection in speech , 2006, Comput. Speech Lang..

[32]  L. Torres,et al.  The use of the cumulative degree-days to predict olive fly, Bactrocera oleae (Rossi), activity in traditional olive groves from the northeast of Portugal , 2011, Journal of Pest Science.

[33]  Salvatore Di Gregorio,et al.  Temperature Effects on Olive Fruit Fly Infestation in the FlySim Cellular Automata Model , 2009, IWNC.

[34]  Ernest Fraenkel,et al.  Sequence analysis A hypothesis-based approach for identifying the binding specificity of regulatory proteins from chromatin immunoprecipitation data , 2006 .

[35]  Mark Reynolds,et al.  A simulation modelling approach to forecast establishment and spread of Bactrocera fruit flies , 2012 .

[36]  David Murray,et al.  Estimating invertebrate pest losses in six major Australian grain crops , 2013 .

[37]  Anant Madabhushi,et al.  An active learning based classification strategy for the minority class problem: application to histopathology annotation , 2011, BMC Bioinformatics.

[38]  S. Voulgaris,et al.  Stochastic Modeling and Simulation of Olive Fruit Fly Outbreaks , 2013 .

[39]  F. Zalom,et al.  Olive fruit fly. , 2009 .

[40]  Héctor Corrada Bravo,et al.  Automated classification of bird and amphibian calls using machine learning: A comparison of methods , 2009, Ecol. Informatics.

[41]  Markos Avlonitis,et al.  The Role of Olive Trees Distribution and Fruit Bearing in Olive Fruit Fly Infestation , 2015, HAICTA.

[42]  M. Pappas,et al.  Effect of Relative Humidity on Longevity, Ovarian Maturation, and Egg Production in the Olive Fruit Fly (Diptera: Tephritidae) , 2009 .

[43]  Mark Reynolds,et al.  Web-based simulation of fruit fly to support biosecurity decision-making , 2012, Ecol. Informatics.

[44]  Mantao Xu,et al.  Classification of Imbalanced Data by Using the SMOTE Algorithm and Locally Linear Embedding , 2006, 2006 8th international Conference on Signal Processing.

[45]  Sara Pasquali,et al.  Use of individual-based models for population parameters estimation , 2007 .

[46]  Robert J. McQueen,et al.  Applying machine learning to agricultural data , 1995 .

[47]  George E. Haniotakis,et al.  Olive pest control: present status and prospects. , 2005 .

[48]  Nitesh V. Chawla,et al.  Species distribution modeling and prediction: A class imbalance problem , 2012, 2012 Conference on Intelligent Data Understanding.

[49]  J. Orbach Principles of Neurodynamics. Perceptrons and the Theory of Brain Mechanisms. , 1962 .

[50]  Valeria V. Krzhizhanovskaya,et al.  Machine learning methods for environmental monitoring and flood protection , 2011 .

[51]  Lloyd A. Smith,et al.  An investigation into the use of machine learning for determining oestrus in cows , 1996 .

[52]  Markos Avlonitis,et al.  On the problem of early detection of users interaction outbreaks via stochastic differential models , 2016, Eng. Appl. Artif. Intell..

[53]  A. Stamou,et al.  Stochastic processes and insect outbreak systems: application to olive fruit fly. , 2007 .

[54]  Vasile Palade,et al.  microPred: effective classification of pre-miRNAs for human miRNA gene prediction , 2009, Bioinform..

[55]  Markos Avlonitis,et al.  Environmental Impact on Predicting Olive Fruit Fly Population Using Trap Measurements , 2016, AIAI.

[56]  Lloyd T. Wilson,et al.  Degree-days: An aid in crop and pest management , 1983 .

[57]  Wei Fan,et al.  Bagging , 2009, Encyclopedia of Machine Learning.