TS: a powerful truncated test to detect novel disease associated genes using publicly available gWAS summary data

Background In the last decade, a large number of common variants underlying complex diseases have been identified through genome-wide association studies (GWASs). Summary data of the GWASs are freely and publicly available. The summary data is usually obtained through single marker analysis. Gene-based analysis offers a useful alternative and complement to single marker analysis. Results from gene level association tests can be more readily integrated with downstream functional and pathogenic investigations. Most existing gene-based methods fall into two categories: burden tests and quadratic tests. Burden tests are usually powerful when the directions of effects of causal variants are the same. However, they may suffer loss of statistical power when different directions of effects exist at the causal variants. The power of quadratic tests is not affected by the directions of effects but could be less powerful due to issues such as the large number of degree of freedoms. These drawbacks of existing gene based methods motivated us to develop a new powerful method to identify disease associated genes using existing GWAS summary data. Methods and Results In this paper, we propose a new truncated statistic method (TS) by utilizing a truncated method to find the genes that have a true contribution to the genetic association. Extensive simulation studies demonstrate that our proposed test outperforms other comparable tests. We applied TS and other comparable methods to the schizophrenia GWAS data and type 2 diabetes (T2D) GWAS meta-analysis summary data. TS identified more disease associated genes than comparable methods. Many of the significant genes identified by TS may have important mechanisms relevant to the associated traits. TS is implemented in C program TS, which is freely and publicly available online. Conclusions The proposed truncated statistic outperforms existing methods. It can be employed to detect novel traits associated genes using GWAS summary data.

[1]  Baolin Wu,et al.  Powerful statistical method to detect disease associated genes using publicly available GWAS summary data , 2018, bioRxiv.

[2]  V D Calhoun,et al.  Polymorphisms in MIR137HG and microRNA-137-regulated genes influence gray matter structure in schizophrenia , 2016, Translational Psychiatry.

[3]  L. Tsai,et al.  Validation of schizophrenia-associated genes CSMD1, C10orf26, CACNA1C and TCF4 as miR-137 targets , 2013, Molecular Psychiatry.

[4]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[5]  C. Dina,et al.  Genetic analysis of ADIPOR1 and ADIPOR2 candidate polymorphisms for type 2 diabetes in the Caucasian population. , 2006, Diabetes.

[6]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[7]  A. Minelli,et al.  The GRM7 gene, early response to risperidone, and schizophrenia: a genome-wide association study and a confirmatory pharmacogenetic analysis , 2016, The Pharmacogenomics Journal.

[8]  Bin Guo,et al.  Statistical methods to detect novel genetic variants using publicly available GWAS summary data , 2018, Comput. Biol. Chem..

[9]  Xiaofeng Zhu,et al.  Detecting association with rare variants for common diseases using haplotype-based methods , 2011 .

[10]  V. Haroutunian,et al.  N-linked glycosylation of cortical N-methyl-D-aspartate and kainate receptor subunits in schizophrenia , 2013, Neuroreport.

[11]  L. Bocchio-Chiavetto,et al.  Micro spies from the brain to the periphery: new clues from studies on microRNAs in neuropsychiatric disorders , 2014, Front. Cell. Neurosci..

[12]  Jianxin Shi,et al.  A rare functional noncoding variant at the GWAS-implicated MIR137/MIR2682 locus might confer risk to schizophrenia and bipolar disorder. , 2014, American journal of human genetics.

[13]  S. Kash,et al.  Deficiency of adiponectin receptor 2 reduces diet-induced insulin resistance but promotes type 2 diabetes. , 2007, Endocrinology.

[14]  Tanya M. Teslovich,et al.  An Expanded Genome-Wide Association Study of Type 2 Diabetes in Europeans , 2017, Diabetes.

[15]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[16]  A. Vita,et al.  Progressive loss of cortical gray matter in schizophrenia: a meta-analysis and meta-regression of longitudinal MRI studies , 2012, Translational Psychiatry.

[17]  S. Bonner-Weir,et al.  Involvement of c-Jun N-terminal Kinase in Oxidative Stress-mediated Suppression of Insulin Gene Expression* , 2002, The Journal of Biological Chemistry.

[18]  Yaeni Kim,et al.  Resveratrol increases AdipoR1 and AdipoR2 expression in type 2 diabetic nephropathy , 2016, Journal of Translational Medicine.

[19]  M. Itokawa,et al.  A polymorphism of the metabotropic glutamate receptor mGluR7 (GRM7) gene is associated with schizophrenia , 2008, Schizophrenia Research.

[20]  S. Shoelson,et al.  Type 2 diabetes as an inflammatory disease , 2011, Nature Reviews Immunology.

[21]  B. Richelsen,et al.  Adiponectin: action, regulation and association to insulin sensitivity , 2005, Obesity reviews : an official journal of the International Association for the Study of Obesity.

[22]  B S Weir,et al.  Truncated product method for combining P‐values , 2002, Genetic epidemiology.

[23]  Baolin Wu,et al.  On Efficient and Accurate Calculation of Significance P‐Values for Sequence Kernel Association Testing of Variant Set , 2016, Annals of human genetics.

[24]  A. Leonardi,et al.  TNF-mediated activation of the stress-activated protein kinase pathway: TNF receptor-associated factor 2 recruits and activates germinal center kinase related. , 1999, Journal of immunology.

[25]  Deanne M. Taylor,et al.  Powerful SNP-set analysis for case-control genome-wide association studies. , 2010, American journal of human genetics.

[26]  Hailiang Huang,et al.  Gene-Based Tests of Association , 2011, PLoS genetics.

[27]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[28]  Xihong Lin,et al.  Optimal tests for rare variant effects in sequencing association studies. , 2012, Biostatistics.

[29]  R. Huganir,et al.  Regulation of AMPA Receptor GluR1 Subunit Surface Expression by a 4.1N-Linked Actin Cytoskeletal Association , 2000, The Journal of Neuroscience.

[30]  Wei Pan,et al.  Asymptotic tests of association with multiple SNPs in linkage disequilibrium , 2009, Genetic epidemiology.

[31]  Yongyong Shi,et al.  Significant association of GRM7 and GRM8 genes with schizophrenia and major depressive disorder in the Han Chinese population , 2016, European Neuropsychopharmacology.

[32]  Johnny S. H. Kwan,et al.  GATES: a rapid and powerful gene-based association test using extended Simes procedure. , 2011, American journal of human genetics.

[33]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[34]  C. Spencer,et al.  Biological Insights From 108 Schizophrenia-Associated Genetic Loci , 2014, Nature.

[35]  Yun Liu,et al.  The -822G/A polymorphism in the promoter region of the MAP4K5 gene is associated with reduced risk of type 2 diabetes in Chinese Hans from Shanghai , 2006, Journal of Human Genetics.

[36]  M. Lidow,et al.  Calcium signaling dysfunction in schizophrenia: a unifying approach , 2003, Brain Research Reviews.

[37]  C. Shi,et al.  Tumor Necrosis Factor (TNF)-induced Germinal Center Kinase-related (GCKR) and Stress-activated Protein Kinase (SAPK) Activation Depends upon the E2/E3 Complex Ubc13-Uev1A/TNF Receptor-associated Factor 2 (TRAF2)* , 2003, The Journal of Biological Chemistry.

[38]  Evan Bolton,et al.  Database resources of the National Center for Biotechnology Information , 2017, Nucleic Acids Res..

[39]  Hilde van der Togt,et al.  Publisher's Note , 2003, J. Netw. Comput. Appl..

[40]  Xuexia Wang,et al.  An Optimally Weighted Combination Method to Detect Novel Disease Associated Genes Using Publicly Available GWAS Summary Data , 2019, bioRxiv.

[41]  Bin Guo,et al.  Powerful and efficient SNP‐set association tests across multiple phenotypes using GWAS summary data , 2018, Bioinform..

[42]  Wei Pan,et al.  Relationship between genomic distance‐based regression and kernel machine regression for multi‐marker association testing , 2011, Genetic epidemiology.

[43]  M. Cairns,et al.  MiR-137: an important player in neural development and neoplastic transformation , 2016, Molecular Psychiatry.

[44]  Wei Pan,et al.  Adaptive gene- and pathway-trait association testing with GWAS summary statistics , 2016, Bioinform..

[45]  Jian Wang,et al.  Genetic variation in adiponectin receptor 1 and adiponectin receptor 2 is associated with type 2 diabetes in the Old Order Amish. , 2005, Diabetes.