The feasibility of co-existence between conventional and genetically modified crops: Using machine learning to analyse the output of simulation models

Simulation models are a commonly used tool for the study of the co-existence of conventional and genetically modified (GM) crops. Among other things, they allow us to investigate the effects of using different crop varieties, cropping systems and farming practices on the levels of adventitious presence of GM material in conventional crops. We propose to use machine learning methods to analyse the output of simulation models to learn co-existence rules that directly link the above mentioned causes and effects. The outputs of the GENESYS model, designed to study the co-existence of conventional and GM oilseed rape crops, were analysed by using the machine learning methods of regression tree induction and relational decision tree induction. Co-existence and adventitious presence of GM material were studied in several contexts, including gene flow between pairs of fields, the interactions of this process with farming practices (cropping systems), and gene flow in the context of an entire field plan. Accurate models were learned, which also make use of the relational aspects of a field plan, using information on the neighboring fields of a field, and the farming practices applied in it. The use of relational decision tree induction to analyse the results of simulation models is a novel approach and holds the promise of learning more general co-existence rules by allowing us to vary the target field within a chosen field plan, as well as to consider completely different field plans at the same time.

[1]  Ian H. Witten,et al.  Induction of model trees for predicting continuous classes , 1996 .

[2]  Stefan Kramer,et al.  Inducing classification and regression trees in first order logic , 2001 .

[3]  Ivan Bratko,et al.  Prolog Programming for Artificial Intelligence , 1986 .

[4]  Antoine Messéan,et al.  Spatial aspects of gene flow between rapeseed varieties and volunteers , 2005 .

[5]  Nathalie Colbach,et al.  GeneSys: a model of the influence of cropping system on gene escape from herbicide tolerant rapeseed crops to rape volunteers: II. Genetic exchanges among volunteer and cropped populations in a small region , 2001 .

[6]  Luc De Raedt,et al.  Attribute-Value Learning Versus Inductive Logic Programming: The Missing Links (Extended Abstract) , 1998, ILP.

[7]  Antoine Messéan,et al.  New case studies on the coexistence of GM and non-GM crops in European agriculture. , 2006 .

[8]  Nathalie Colbach,et al.  GENESYS : a model of the influence of cropping system on gene escape from herbicide tolerant rapeseed crops to rape volunteers. I. Temporal evolution of a population of rapeseed volunteers in a field , 2001 .

[9]  L. D. Raedt,et al.  Three companions for data mining in first order logic , 2001 .

[10]  Sašo Džeroski,et al.  Relations between the oilseed rape volunteer seedbank, and soil factors, weed functional groups and geographical location in the UK , 2008 .

[11]  Stefan Kramer,et al.  Structural Regression Trees , 1996, AAAI/IAAI, Vol. 1.

[12]  Saso Dzeroski,et al.  Using Simulation Models and Data Mining to Study Co-Existence of GM/Non-GM Crops at Regional Level , 2006, EnviroInfo.

[13]  Nathalie Colbach,et al.  Evaluation and use of a spatio-temporal model of cropping system effects on gene escape from transgenic oilseed rape varieties: Example of the GeneSys model applied to three co-existing herbicide tolerance transgenes , 2005 .

[14]  D. Hemming Gene flow and agriculture: relevance for transgenic crops. , 1999 .

[15]  Saso Dzeroski,et al.  First order random forests: Learning relational classifiers with complex aggregates , 2006, Machine Learning.

[16]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[17]  Hendrik Blockeel,et al.  Top-Down Induction of First Order Logical Decision Trees , 1998, AI Commun..

[18]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[19]  Nathalie Colbach,et al.  Sensitivity analyses for a model simulating demography and genotype evolutions with time: Application to GeneSys modelling gene flow between rape seed varieties and volunteers , 2004 .

[20]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[21]  Nathalie Colbach,et al.  GENESYS : A model of the effects of cropping system on gene flow from transgenic rapeseed , 1999 .