Hypergraph Supervised Search for Inferring Multiple Epistatic Interactions with Different Orders

Nonlinear interactive effects of Single Nucleotide Polymorphisms (SNPs), namely, epistatic interactions, have been receiving increasing attention in understanding the mechanism underlying susceptibility to complex diseases. Though many works have been done for their detection, most only focus on the detection of pairwise epistatic interactions. In this study, a Hypergraph Supervised Search (HgSS) is developed based on the co-information measure for inferring multiple epistatic interactions with different orders at a substantially reduced time cost. The co-information measure is employed to exhaustively quantify the interaction effects of low order SNP combinations, as well as the main effects of SNPs. Then, highly suspected SNP combinations and SNPs are used to construct a hypergraph. By deeply analyzing the hypergraph, some clues for better understanding the genetic architecture of complex diseases could be revealed. Experiments are performed on both simulation and real data sets. Results show that HgSS is promising in inferring multiple epistatic interactions with different orders.

[1]  A. J. Bell THE CO-INFORMATION LATTICE , 2003 .

[2]  Aidong Zhang,et al.  The interaction index, a novel information-theoretic metric for prioritizing interacting genetic variations and environmental factors , 2009, European Journal of Human Genetics.

[3]  P. Chanda,et al.  Comparison of information-theoretic to statistical methods for gene-gene interactions in the presence of genetic heterogeneity , 2010, BMC Genomics.

[4]  Te Sun Han,et al.  Multiple Mutual Informations and Multiple Interactions in Frequency Data , 1980, Inf. Control..

[5]  B. Maher,et al.  The case of the missing heritability , 2008 .

[6]  Dan Liu,et al.  Performance analysis of novel methods for detecting epistasis , 2011, BMC Bioinformatics.

[7]  B. Maher Personal genomes: The case of the missing heritability , 2008, Nature.

[8]  Aidong Zhang,et al.  Information-theoretic metrics for visualizing gene-environment interactions. , 2007, American journal of human genetics.

[9]  Ting Hu,et al.  Characterizing genetic interactions in human disease association studies using statistical epistasis networks , 2011, BMC Bioinformatics.

[10]  B. McKinney,et al.  Capturing the Spectrum of Interaction Effects in Genetic Association Studies by Simulated Evaporative Cooling Network Analysis , 2009, PLoS genetics.

[11]  J. Ott,et al.  Complement Factor H Polymorphism in Age-Related Macular Degeneration , 2005, Science.

[12]  Yuanke Zhang,et al.  EpiMiner: A three-stage co-information based method for detecting and visualizing epistatic interactions , 2014, Digit. Signal Process..

[13]  Aidong Zhang,et al.  Information-theoretic gene-gene and gene-environment interaction analysis of quantitative traits , 2009, BMC Genomics.

[14]  P. Chanda,et al.  AMBIENCE: A Novel Approach and Efficient Algorithm for Identifying Informative Genetic and Environmental Associations With Complex Phenotypes , 2008, Genetics.

[15]  David M. Herrington,et al.  An algorithm for learning maximum entropy probability models of disease risk that efficiently searches and sparingly encodes multilocus genomic interactions , 2009, Bioinform..

[16]  Qiang Yang,et al.  Detecting two-locus associations allowing for interactions in genome-wide association studies , 2010, Bioinform..

[17]  R. Jiang,et al.  Epistatic Module Detection for Case-Control Studies: A Bayesian Model with a Gibbs Sampling Strategy , 2009, PLoS genetics.

[18]  Todd Holden,et al.  A flexible computational framework for detecting, characterizing, and interpreting statistical patterns of epistasis in genetic studies of human disease susceptibility. , 2006, Journal of theoretical biology.

[19]  Julie A Simpson,et al.  Can genetic associations change with age? CFH and age-related macular degeneration. , 2012, Human molecular genetics.

[20]  Ting Hu,et al.  Statistical Epistasis Networks Reduce the Computational Complexity of Searching Three-Locus Genetic Models , 2012, Pacific Symposium on Biocomputing.