Environmental noise improves epistasis models of genetic data discovered using a computational evolution system

Common human diseases likely result from nonlinear interactions between multiple DNA sequence variations. One goal of human genetics is to use data mining and machine learning methods to identify combinations of genetic variations that are predictive of discrete measures of health in human population data. "Artificial evolution" approaches loosely based on real biological processes have been developed and applied in this domain, but it has recently been suggested that "computational evolution" approaches which incorporate additional biological and evolutionary complexity into existing algorithms will be more likely to solve problems of interest to biologists and biomedical researchers. Here we introduce a method to evolve compact solutions by adding environmental noise to a dataset during fitness evaluation. In ecological systems a highly specialized organism can fail to thrive as the environment changes. By introducing numerous small changes into training data, i.e. the environment, during evolution we similarly drive selection towards more general solutions. We show that this improves the power of the computational evolution system when modest amounts of noise are used. Furthermore, this method of changing the environment in which fitness is evaluated with small perturbations fits within the computational evolution framework and is an effective method of controlling solution size for problems where the data are likely to be noisy.

[1]  Jason H. Moore,et al.  Symbolic discriminant analysis of microarray data in autoimmune disease , 2002, Genetic epidemiology.

[2]  Jason H. Moore,et al.  The Ubiquitous Nature of Epistasis in Determining Susceptibility to Common Human Diseases , 2003, Human Heredity.

[3]  Jiang Gui,et al.  Symbolic Modeling of Epistasis , 2007, Human Heredity.

[4]  M. Huynen Exploring phenotype space through neutral evolution , 1996, Journal of Molecular Evolution.

[5]  Bill C. White,et al.  Does Complexity Matter? Artificial Evolution, Computational Evolution and the Genetic Analysis of Epistasis in Common Human Diseases. , 2009 .

[6]  M. Huynen,et al.  Smoothness within ruggedness: the role of neutrality in adaptation. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Jason H. Moore,et al.  Development and Evaluation of an Open-Ended Computational Evolution System for the Genetic Analysis of Susceptibility to Common Human Diseases , 2008, EvoBIO.

[8]  J. Miller,et al.  Guidelines: From artificial evolution to computational evolution: a research agenda , 2006, Nature Reviews Genetics.

[9]  Jason H. Moore,et al.  Genome-Wide Genetic Analysis Using Genetic Programming: The Critical Need for Expert Knowledge , 2007 .

[10]  Marc Ebner,et al.  How neutral networks influence evolvability , 2001, Complex..

[11]  Marc Ebner,et al.  On neutral networks and evolvability , 2001, Proceedings of the 2001 Congress on Evolutionary Computation (IEEE Cat. No.01TH8546).

[12]  C. Wilke,et al.  Robustness and Evolvability in Living Systems , 2006 .