Statistical power and utility of meta-analysis methods for cross-phenotype genome-wide association studies

Advances in recent genome wide association studies (GWAS) suggest that pleiotropic effects on human complex traits are widespread. A number of classic and recent meta-analysis methods have been used to identify genetic loci with pleiotropic effects, but the overall performance of these methods is not well understood. In this work, we use extensive simulations and case studies of GWAS datasets to investigate the power and type-I error rates of ten meta-analysis methods. We specifically focus on three conditions commonly encountered in the studies of multiple traits: (1) extensive heterogeneity of genetic effects; (2) characterization of trait-specific association; and (3) inflated correlation of GWAS due to overlapping samples. Although the statistical power is highly variable under distinct study conditions, we found the superior power of several methods under diverse heterogeneity. In particular, classic fixed-effects model showed surprisingly good performance when a variant is associated with more than a half of study traits. As the number of traits with null effects increases, ASSET performed the best along with competitive specificity and sensitivity. With opposite directional effects, CPASSOC featured the first-rate power. However, caution is advised when using CPASSOC for studying genetically correlated traits with overlapping samples. We conclude with a discussion of unresolved issues and directions for future research.

[1]  David M Nathan,et al.  TCF7L2 polymorphisms and progression to diabetes in the Diabetes Prevention Program. , 2006, The New England journal of medicine.

[2]  Z Chen,et al.  Is the weighted z‐test the best method for combining probabilities from independent tests? , 2011, Journal of evolutionary biology.

[3]  Christopher S. Poultney,et al.  Meta-analysis of GWAS of over 16,000 individuals with autism spectrum disorder highlights a novel locus at 10q24.32 and a significant overlap with schizophrenia , 2017, Molecular Autism.

[4]  Patrick F Sullivan,et al.  Modifiers and Subtype-Specific Analyses in Whole-Genome Association Studies: A Likelihood Framework , 2011, Human Heredity.

[5]  Constantin Polychronakos,et al.  Comparative genetic analysis of inflammatory bowel disease and type 1 diabetes implicates multiple loci with opposite effects. , 2010, Human molecular genetics.

[6]  Miles Parkes,et al.  Genetic insights into common pathways and complex relationships among immune-mediated diseases , 2013, Nature Reviews Genetics.

[7]  Robert M. Plenge,et al.  Meta-Analysis of Genome-Wide Association Studies in Celiac Disease and Rheumatoid Arthritis Identifies Fourteen Non-HLA Shared Loci , 2011, PLoS genetics.

[8]  Peter Kraft,et al.  Genome-Wide Meta-Analyses of Breast, Ovarian, and Prostate Cancer Association Studies Identify Multiple New Susceptibility Loci Shared by at Least Two Cancer Types. , 2016, Cancer discovery.

[9]  Eleazar Eskin,et al.  Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies. , 2011, American journal of human genetics.

[10]  M. Daly,et al.  Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis , 2013, The Lancet.

[11]  Nick C Fox,et al.  Analysis of shared heritability in common disorders of the brain , 2018, Science.

[12]  P. Visscher,et al.  Multi-trait analysis of genome-wide association summary statistics using MTAG , 2017, Nature Genetics.

[13]  Saralees Nadarajah,et al.  On the optimally weighted z-test for combining probabilities from independent studies , 2014, Comput. Stat. Data Anal..

[14]  Xiaofeng Zhu,et al.  Meta-analysis of correlated traits via summary statistics from GWASs with an application in hypertension. , 2015, American journal of human genetics.

[15]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[16]  L. Liang,et al.  Shared Genetic Architecture between Asthma and Allergic Diseases: A Genome-Wide Cross Trait Analysis of 112,000 Individuals from UK Biobank , 2017, bioRxiv.

[17]  M. O’Donovan,et al.  What have we learned from the Psychiatric Genomics Consortium , 2015, World psychiatry : official journal of the World Psychiatric Association.

[18]  S. Purcell,et al.  Pleiotropy in complex traits: challenges and strategies , 2013, Nature Reviews Genetics.

[19]  Kasper Lage,et al.  Pervasive Sharing of Genetic Effects in Autoimmune Disease , 2011, PLoS genetics.

[20]  N. Craddock,et al.  Evidence for genetic heterogeneity between clinical subtypes of bipolar disorder , 2017, Translational Psychiatry.

[21]  M. Daly,et al.  An Atlas of Genetic Correlations across Human Diseases and Traits , 2015, Nature Genetics.

[22]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[23]  C. Sudlow,et al.  Shared genetic aetiology between cognitive functions and physical and mental health in UK Biobank (N=112 151) and 24 GWAS consortia , 2015, Molecular Psychiatry.

[24]  M. Kendall Statistical Methods for Research Workers , 1937, Nature.

[25]  F. Agakov,et al.  Abundant pleiotropy in human complex diseases and traits. , 2011, American journal of human genetics.

[26]  S Greenland,et al.  Quantitative methods in the review of epidemiologic literature. , 1987, Epidemiologic reviews.

[27]  Meta-Analysis of Genome-Wide Association Studies in Celiac Disease and Rheumatoid Arthritis Identifies Fourteen Non-HLA Shared Loci , 2011, PLoS genetics.

[28]  Joseph K. Pickrell,et al.  Detection and interpretation of shared genetic influences on 42 human traits , 2015, Nature Genetics.

[29]  Jing Li,et al.  Goodness-of-fit test for meta-analysis , 2015, Scientific Reports.

[30]  Nilanjan Chatterjee,et al.  A subset-based approach improves power and interpretation for the combined analysis of genetic association studies of heterogeneous traits. , 2012, American journal of human genetics.

[31]  C. Spencer,et al.  Biological Insights From 108 Schizophrenia-Associated Genetic Loci , 2014, Nature.

[32]  N. Laird,et al.  Meta-analysis in clinical trials. , 1986, Controlled clinical trials.

[33]  J. Murabito,et al.  Shared genetic aetiology of puberty timing between sexes and with health-related outcomes , 2015, Nature Communications.

[34]  Eleazar Eskin,et al.  Interpreting Meta-Analyses of Genome-Wide Association Studies , 2012, PLoS genetics.

[35]  John S Witte,et al.  Turning of COGS moves forward findings for hormonally mediated cancers , 2013, Nature Genetics.

[36]  Constantin Polychronakos,et al.  Meta-analysis of shared genetic architecture across ten pediatric autoimmune diseases , 2016 .

[37]  Jakob Grove,et al.  Discovery of the first genome-wide significant risk loci for ADHD , 2017, bioRxiv.

[38]  Søren Brunak,et al.  Analysis of five chronic inflammatory diseases identifies 27 new associations and highlights disease-specific patterns at shared loci , 2016, Nature Genetics.

[39]  Dmitri V Zaykin,et al.  P‐value based analysis for shared controls design in genome‐wide association studies , 2010, Genetic epidemiology.

[40]  G. Tseng,et al.  Comprehensive literature review and statistical considerations for GWAS meta-analysis , 2012, Nucleic acids research.

[41]  Qingzhong Liu,et al.  A new statistical approach to combining p-values using gamma distribution and its application to genome-wide association study , 2014, BMC Bioinformatics.