Exploration of gene–gene interaction effects using entropy-based methods

Gene–gene interaction may play important roles in complex disease studies, in which interaction effects coupled with single-gene effects are active. Many interaction models have been proposed since the beginning of the last century. However, the existing approaches including statistical and data mining methods rarely consider genetic interaction models, which make the interaction results lack biological or genetic meaning. In this study, we developed an entropy-based method integrating two-locus genetic models to explore such interaction effects. We performed our method to simulated and real data for evaluation. Simulation results show that this method is effective to detect gene–gene interaction and, furthermore, it is able to identify the best-fit model from various interaction models. Moreover, our method, when applied to malaria data, successfully revealed negative epistatic effect between sickle cell anemia and α+-thalassemia against malaria.

[1]  Wentian Li,et al.  A Complete Enumeration and Classification of Two-Locus Disease Models , 1999, Human Heredity.

[2]  D. Hunter Gene–environment interactions in human diseases , 2005, Nature Reviews Genetics.

[3]  K. Weiss,et al.  How many diseases does it take to map a gene with SNPs? , 2000, Nature Genetics.

[4]  Todd Holden,et al.  A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility. , 2006, Journal of theoretical biology.

[5]  Russell L. Malmberg,et al.  Epistasis for Fitness-Related Quantitative Traits in Arabidopsis thaliana Grown in the Field and in the Greenhouse , 2005, Genetics.

[6]  Robert Tibshirani,et al.  An Introduction to the Bootstrap , 1994 .

[7]  J. Ott,et al.  Trimming, weighting, and grouping SNPs in human case-control association studies. , 2001, Genome research.

[8]  C. Sing,et al.  A combinatorial partitioning method to identify multilocus genotypic partitions that predict quantitative trait variation. , 2001, Genome research.

[9]  K. Lunetta,et al.  Identifying SNPs predictive of phenotype using random forests , 2005, Genetic epidemiology.

[10]  William Shannon,et al.  Detecting epistatic interactions contributing to quantitative traits , 2004, Genetic epidemiology.

[11]  A Merry,et al.  A two‐locus model for the inheritance of a familial disease , 1979, Annals of human genetics.

[12]  Jason H. Moore,et al.  A global view of epistasis , 2005, Nature Genetics.

[13]  H. Cordell Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. , 2002, Human molecular genetics.

[14]  Jason H. Moore,et al.  Routine discovery of complex genetic models using genetic algorithms , 2004, Appl. Soft Comput..

[15]  R. Fisher XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance. , 1919, Transactions of the Royal Society of Edinburgh.

[16]  G. Church,et al.  Modular epistasis in yeast metabolism , 2005, Nature Genetics.

[17]  L. Penrose,et al.  THE CORRELATION BETWEEN RELATIVES ON THE SUPPOSITION OF MENDELIAN INHERITANCE , 2022 .

[18]  D. Clayton,et al.  Genome-wide association studies: theoretical and practical concerns , 2005, Nature Reviews Genetics.

[19]  Jason H. Moore,et al.  STUDENTJAMA. The challenges of whole-genome approaches to common diseases. , 2004, JAMA.

[20]  Chris S. Haley,et al.  Epistasis: too often neglected in complex trait studies? , 2004, Nature Reviews Genetics.

[21]  Yanbin Jia,et al.  Association study of an SNP combination pattern in the dopaminergic pathway in paranoid schizophrenia: a novel strategy for complex disorders , 2004, Molecular Psychiatry.

[22]  T. Reich,et al.  A perspective on epistasis: limits of models displaying no main effect. , 2002, American journal of human genetics.

[23]  J. H. Moore,et al.  Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. , 2001, American journal of human genetics.

[24]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[25]  David M. Evans,et al.  Two-Stage Two-Locus Models in Genome-Wide Association , 2006, PLoS genetics.

[26]  Wei-Shiung Yang,et al.  A genome-wide scan using tree-based association analysis for candidate loci related to fasting plasma glucose levels , 2003, BMC Genetics.

[27]  M. Xiong,et al.  Test for interaction between two unlinked loci. , 2006, American journal of human genetics.

[28]  Aravinda Chakravarti,et al.  Phenotype variation in two-locus mouse models of Hirschsprung disease: Tissue-specific interaction between Ret and Ednrb , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[29]  W. Bateson Mendel's Principles of Heredity , 1910, Nature.

[30]  D. Clayton,et al.  A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data: application to HLA in type 1 diabetes. , 2002, American journal of human genetics.

[31]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[32]  Christian Gieger,et al.  A common genetic variant in the NOS1 regulator NOS1AP modulates cardiac repolarization , 2006, Nature Genetics.

[33]  Mario Recker,et al.  Negative epistasis between the malaria-protective effects of α+-thalassemia and the sickle cell trait , 2005, Nature Genetics.

[34]  Jason H. Moore,et al.  Multifactor dimensionality reduction software for detecting gene-gene and gene-environment interactions , 2003, Bioinform..

[35]  David V Conti,et al.  A testing framework for identifying susceptibility genes in the presence of epistasis. , 2006, American journal of human genetics.

[36]  L. Cardon,et al.  Association study designs for complex diseases , 2001, Nature Reviews Genetics.