Can Neural Network Constraints in GP Provide Power to Detect Genes Associated with Human Disease?

A major goal of human genetics is the identification of susceptibility genes associated with common, complex diseases. Identifying gene-gene and gene-environment interactions which comprise the genetic architecture for a majority of common diseases is a difficult challenge. To this end, novel computational approaches have been applied to studies of human disease. Previously, a GP neural network (GPNN) approach was employed. Although the GPNN method has been quite successful, a clear comparison of GPNN and GP alone to detect genetic effects has not been made. In this paper, we demonstrate that using NN evolved by GP can be more powerful than GP alone. This is most likely due to the confined search space of the GPNN approach, in comparison to a free form GP. This study demonstrates the utility of using GP to evolve NN in studies of the genetics of common, complex human disease.

[1]  Jason H. Moore,et al.  Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions , 2003, Bioinform..

[2]  Jason H. Moore,et al.  Power of multifactor dimensionality reduction for detecting gene‐gene interactions in the presence of genotyping error, missing data, phenocopy, and genetic heterogeneity , 2003, Genetic epidemiology.

[3]  William Shannon,et al.  Detecting epistatic interactions contributing to quantitative traits , 2004, Genetic epidemiology.

[4]  John R. Koza,et al.  Genetic programming 2 - automatic discovery of reusable programs , 1994, Complex Adaptive Systems.

[5]  Jason H. Moore,et al.  Symbolic discriminant analysis of microarray data in autoimmune disease , 2002, Genetic epidemiology.

[6]  John R. Koza,et al.  Genetic generation of both the weights and architecture for a neural network , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[7]  Jason H. Moore,et al.  Genetic Programming Neural Networks as a Bioinformatics Tool for Human Genetics , 2004, GECCO.

[8]  Jason H. Moore,et al.  Symbolic Discriminant Analysis for Mining Gene Expression Patterns , 2001, ECML.

[9]  C Kooperberg,et al.  Sequence Analysis Using Logic Regression , 2001, Genetic epidemiology.

[10]  Melanie Mitchell,et al.  An introduction to genetic algorithms , 1996 .

[11]  Riccardo Poli,et al.  Genetic and Evolutionary Computation – GECCO 2004 , 2004, Lecture Notes in Computer Science.

[12]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[13]  Daniel E. Weeks,et al.  The Complexity of Linkage Analysis with Neural Networks , 2001, Human Heredity.

[14]  Jason H. Moore,et al.  Cross Validation Consistency for the Assessment of Genetic Programming Results in Microarray Studies , 2003, EvoWorkshops.

[15]  J. Ott,et al.  Neural networks and disease association studies. , 2001, American journal of medical genetics.

[16]  T. Hastie,et al.  Classification of gene microarrays by penalized logistic regression. , 2004, Biostatistics.

[17]  D. Tregouet,et al.  Automated detection of informative combined effects in genetic association studies of complex traits. , 2003, Genome research.

[18]  C. Sing,et al.  A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation. , 2001, Genome research.

[19]  T. Reich,et al.  A perspective on epistasis: limits of models displaying no main effect. , 2002, American journal of human genetics.

[20]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[21]  Luc De Raedt,et al.  Machine Learning: ECML 2001 , 2001, Lecture Notes in Computer Science.

[22]  Jason H. Moore,et al.  Application Of Genetic Algorithms To The Discovery Of Complex Models For Simulation Studies In Human Genetics , 2002, GECCO.

[23]  Scott M. Williams,et al.  New strategies for identifying gene-gene interactions in hypertension , 2002, Annals of medicine.

[24]  Sara A. Solla,et al.  Multi-Locus Nonparametric Linkage Analysis of Complex Trait Loci with Neural Networks , 1998, Human Heredity.

[25]  S. Kardia,et al.  Context-dependent genetic effects in hypertension , 2000, Current hypertension reports.

[26]  Bill C White,et al.  Optimization of neural network architecture using genetic programming improves detection and modeling of gene-gene interactions in studies of human diseases , 2003, BMC Bioinformatics.