Computing Empirical P-Values for Estimating Gene-Gene Interactions in Genome-Wide Association Studies: A Parallel Computing Approach

In complex phenotypes (e.g., psychiatric diseases) single locus tests, commonly performed with genome-wide association studies, have proven to be limited in discovering strong gene associations. A growing body of evidence suggests that epistatic non-linear effects may be responsible for complex phenotypes arising from the interaction of different biological factors. A major issue in epistasis analysis is the computational burden due to the huge number of statistical tests to be performed when considering all the potential genotype combinations. In this work, we developed a computational efficient approach to compute empirical p-values concerning the presence of epistasis at a genome-wide scale in bipolar disorder, which is a typical example of complex phenotype with a relevant but unexplained genetic background. By running our approach we were able to identify 13 epistasis interactions between variants located in genes potentially involved in biological processes associated with the analyzed phenotype.

[1]  Arnaud Le Rouzic,et al.  Estimating directional epistasis , 2014, Front. Genet..

[2]  Jianxin Shi,et al.  Developing and evaluating polygenic risk prediction models for stratified disease prevention , 2016, Nature Reviews Genetics.

[3]  Ioannis Xenarios,et al.  FastEpistasis: a high performance computing solution for quantitative trait epistasis , 2010, Bioinform..

[4]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[5]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[6]  Emanuele Danovaro,et al.  Heterogeneous architectures for computational intensive applications: A cost-effectiveness analysis , 2014, J. Comput. Appl. Math..

[7]  Manuel A. R. Ferreira,et al.  Large-scale genome-wide association analysis of bipolar disorder identifies a new susceptibility locus near ODZ4 , 2011, Nature Genetics.

[8]  Patrick F Sullivan,et al.  Genome-wide association studies in psychiatry: what have we learned? , 2013, British Journal of Psychiatry.

[9]  N. Craddock,et al.  Genetics of bipolar disorder. , 2010, Journal of medical genetics.

[10]  Chris S. Haley,et al.  Detecting epistasis in human complex traits , 2014, Nature Reviews Genetics.

[11]  G. Rocheleau,et al.  A survey about methods dedicated to epistasis detection , 2015, Front. Genet..

[12]  Andrea Clematis,et al.  Image-Based Surface Matching Algorithm Oriented to Structural Biology , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[13]  T. Mackay,et al.  Erratum to: Why epistasis is important for tackling complex human disease genetics , 2015, Genome Medicine.