Identifying Compound Risk Factors of Disease by Evolutionary Learning of SNP Combinatorial Features

Most diseases are caused by complex processes of various factors. Although previous researches have tried to identify the causes of the disease, there are still lots of limitations to clarify the complex factors. Here, we present a disease classification model based on an evolutionary learning approach of combinatorial features using the data sets from the genetics and cohort studies. We implemented a system for finding the combinatorial risk factors and visualizing the results. Our results show that the proposed method not only improves classification accuracy but also identifies biologically meaningful sets of risk factors.

[1]  Byoung-Tak Zhang,et al.  Use of Evolutionary Hypernetworks for Mining Prostate Cancer Data , 2007 .

[2]  Byoung-Tak Zhang,et al.  Evolving hypernetwork classifiers for microRNA expression profile analysis , 2007, 2007 IEEE Congress on Evolutionary Computation.

[3]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[4]  Byoung-Tak Zhang,et al.  Finding Cancer-Related Gene Combinations Using a Molecular Evolutionary Algorithm , 2007, 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering.

[5]  Frances S. Turner,et al.  Computational disease gene identification: a concert of methods prioritizes type 2 diabetes and obesity candidate genes , 2006, Nucleic acids research.

[6]  Byoung-Tak Zhang,et al.  Evolving hypernetworks for pattern classification , 2007, 2007 IEEE Congress on Evolutionary Computation.

[7]  F. Collins,et al.  The Human Genome Project: Lessons from Large-Scale Biology , 2003, Science.

[8]  Byoung-Tak Zhang,et al.  Hypernetworks: A Molecular Evolutionary Architecture for Cognitive Learning and Memory , 2008, IEEE Computational Intelligence Magazine.

[9]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[10]  Byoung-Tak Zhang,et al.  Text Classifiers Evolved on a Simulated DNA Computer , 2006, 2006 IEEE International Conference on Evolutionary Computation.

[11]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..

[12]  A. Zhang,et al.  Information-theoretic identification of predictive SNPs and supervised visualization of genome-wide association studies , 2006, Nucleic acids research.