Eigen-Epistasis for detecting gene-gene interactions

BackgroundA large amount of research has been devoted to the detection and investigation of epistatic interactions in genome-wide association studies (GWASs). Most of the literature focuses on low-order interactions between single-nucleotide polymorphisms (SNPs) with significant main effects.ResultsIn this paper we propose an original approach for detecting epistasis at the gene level, without systematically filtering on significant genes. We first compute interaction variables for each gene pair by finding its Eigen-Epistasis component, defined as the linear combination of Gene SNPs having the highest correlation with the phenotype. The selection of significant effects is done using a penalized regression method based on Group Lasso controlling the False Discovery Rate.ConclusionThe method is tested against two recent alternative proposals from the literature using synthetic data, and shows good performances in different settings. We demonstrate the power of our approach by detecting new gene-gene interactions on three genome-wide association studies.

[1]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[2]  M. J. Cabello,et al.  Epistatic interaction between TLR4 and NOD2 in patients with Crohn's Disease: relation with risk and phenotype in a Spanish cohort. , 2016, Immunobiology.

[3]  C J Eastmond,et al.  HLA B27 and the genetics of ankylosing spondylitis. , 1978, Annals of the rheumatic diseases.

[4]  Christophe Ambroise,et al.  Beyond support in two-stage variable selection , 2015, Stat. Comput..

[5]  Tao Wang,et al.  A partial least‐square approach for modeling gene‐gene and gene‐environment interactions when multiple markers are genotyped , 2009, Genetic epidemiology.

[6]  E. L. Persons,et al.  Ankylosing Spondylitis , 1955, GP.

[7]  G. D'Angelo,et al.  Combining least absolute shrinkage and selection operator (LASSO) and principal-components analysis for detection of gene-gene interactions in genome-wide association studies , 2009, BMC proceedings.

[8]  N. Chatterjee,et al.  Powerful multilocus tests of genetic association in the presence of gene-gene and gene-environment interactions. , 2006, American journal of human genetics.

[9]  Alfonso Valencia,et al.  An Epistatic Interaction between the PAX8 and STK17B Genes in Papillary Thyroid Cancer Susceptibility , 2013, PloS one.

[10]  Cristina Y. González,et al.  Identification of epistatic interactions through genome-wide association studies in sporadic medullary and juvenile papillary thyroid carcinomas , 2015, BMC Medical Genomics.

[11]  Fengyu Zhang,et al.  An approach to incorporate linkage disequilibrium structure into genomic association analysis. , 2008, Journal of genetics and genomics = Yi chuan xue bao.

[12]  A. Boonen,et al.  Ankylosing spondylitis: an overview , 2002, Annals of the rheumatic diseases.

[13]  Nicholas B Larson,et al.  Kernel canonical correlation analysis for assessing gene–gene interactions and application to ovarian cancer , 2013, European Journal of Human Genetics.

[14]  Kristel Van Steen,et al.  Travelling the world of gene-gene interactions , 2012, Briefings Bioinform..

[15]  Trevor J. Hastie,et al.  Genome-wide association analysis by lasso penalized logistic regression , 2009, Bioinform..

[16]  David Haig,et al.  Does heritability hide in epistasis between linked SNPs? , 2011, European Journal of Human Genetics.

[17]  Jing He,et al.  Gene-based interaction analysis by incorporating external linkage disequilibrium information , 2010, European Journal of Human Genetics.

[18]  Fuzhong Xue,et al.  Detection for gene-gene co-association via kernel canonical correlation analysis , 2012, BMC Genetics.

[19]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[20]  Chris S. Haley,et al.  Detecting epistasis in human complex traits , 2014, Nature Reviews Genetics.

[21]  Peter Donnelly,et al.  Identification of multiple risk variants for ankylosing spondylitis through high-density genotyping of immune-related loci , 2013, Nature Genetics.

[22]  Xin Wang,et al.  Pathway‐Guided Identification of Gene‐Gene Interactions , 2014, Annals of human genetics.

[23]  Michael M. Ward,et al.  Genome-wide association study of ankylosing spondylitis identifies non-MHC susceptibility loci , 2010, Nature Genetics.

[24]  R. Inman,et al.  The Application of Clinical Genetics Dovepress the Genetic Basis of Ankylosing Spondylitis: New Insights into Disease Pathogenesis , 2022 .

[25]  Yuehua Cui,et al.  Gene-centric gene–gene interaction: A model-based kernel machine method , 2012, 1209.6502.

[26]  Qianqian Peng,et al.  A gene-based method for detecting gene–gene co-association in a case–control association study , 2010, European Journal of Human Genetics.

[27]  M. Epstein,et al.  Analysis of Gene-Gene Interactions Using Gene-Trait Similarity Regression , 2014, Human Heredity.

[28]  P I Terasaki,et al.  High association of an HL-A antigen, W27, with ankylosing spondylitis. , 1973, The New England journal of medicine.

[29]  Mariza de Andrade,et al.  Identification of gene-gene interaction using principal components , 2009, BMC proceedings.

[30]  M. Brown,et al.  Genetics and genomics of ankylosing spondylitis , 2010, Immunological reviews.

[31]  E. Lander,et al.  The mystery of missing heritability: Genetic interactions create phantom heritability , 2012, Proceedings of the National Academy of Sciences.

[32]  M. Perlman,et al.  Multivariate Detection of Gene‐Gene Interactions , 2012, Genetic epidemiology.

[33]  Hossein Baharvand,et al.  Genetics and genomics , 1998, Nature.

[34]  G. Rocheleau,et al.  A survey about methods dedicated to epistasis detection , 2015, Front. Genet..