BAYESIAN METHODS FOR GENETIC ASSOCIATION ANALYSIS WITH HETEROGENEOUS SUBGROUPS: FROM META-ANALYSES TO GENE-ENVIRONMENT INTERACTIONS.

Genetic association analyses often involve data from multiple potentially-heterogeneous subgroups. The expected amount of heterogeneity can vary from modest (e.g. a typical meta-analysis), to large (e.g. a strong gene-environment interaction). However, existing statistical tools are limited in their ability to address such heterogeneity. Indeed, most genetic association meta-analyses use a "fixed effects" analysis, which assumes no heterogeneity. Here we develop and apply Bayesian association methods to address this problem. These methods are easy to apply (in the simplest case, requiring only a point estimate for the genetic effect, and its standard error, from each subgroup), and effectively include standard frequentist meta-analysis methods, including the usual "fixed effects" analysis, as special cases. We apply these tools to two large genetic association studies: one a meta-analysis of genome-wide association studies from the Global Lipids consortium, and the second a cross-population analysis for expression quantitative trait loci (eQTLs). In the Global Lipids data we find, perhaps surprisingly, that effects are generally quite homogeneous across studies. In the eQTL study we find that eQTLs are generally shared among different continental groups, and discuss consequences of this for study design.

[1]  Art B. Owen,et al.  Karl Pearson’s meta analysis revisited , 2009, 0911.3531.

[2]  Andrew T. A. Wood,et al.  Laplace approximations for hypergeometric functions with matrix argument , 2002 .

[3]  D. Koller,et al.  Population genomics of human gene expression , 2007, Nature Genetics.

[4]  Ross D. Shachter,et al.  A Bayesian Method for Synthesizing Evidence: The Confidence Profile Method , 1990, International Journal of Technology Assessment in Health Care.

[5]  A Whitehead,et al.  A general parametric approach to the meta-analysis of randomized clinical trials. , 1991, Statistics in medicine.

[6]  Stephen Burgess,et al.  Bayesian methods for meta‐analysis of causal relationships estimated using genetic instrumental variables , 2010, Statistics in medicine.

[7]  K R Abrams,et al.  Bayesian methods in meta-analysis and evidence synthesis. , 2001, Statistical methods in medical research.

[8]  Donald A. Berry,et al.  Meta-Analysis in Medicine and Health Policy , 2000 .

[9]  Yongtao Guan,et al.  Variation in Human Recombination Rates and Its Genetic Determinants , 2011, PloS one.

[10]  I Olkin,et al.  Simple Pooling versus Combining in Meta-Analysis , 2001, Evaluation & the health professions.

[11]  Jon Wakefield,et al.  Bayes factors for genome‐wide association studies: comparison with P‐values , 2009, Genetic epidemiology.

[12]  Scott A. Rifkin,et al.  Revealing the architecture of gene regulation: the promise of eQTL studies. , 2008, Trends in genetics : TIG.

[13]  Eleazar Eskin,et al.  Random-effects model aimed at discovering associations in meta-analysis of genome-wide association studies. , 2011, American journal of human genetics.

[14]  R. Butler SADDLEPOINT APPROXIMATIONS WITH APPLICATIONS. , 2007 .

[15]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[16]  Valen E. Johnson,et al.  Properties of Bayes Factors Based on Test Statistics , 2008 .

[17]  Claudio J. Verzilli,et al.  Bayesian semiparametric meta‐analysis for genetic association studies , 2011, Genetic epidemiology.

[18]  Yoav Gilad,et al.  Sex-specific genetic architecture of human disease , 2008, Nature Reviews Genetics.

[19]  P. Deloukas,et al.  Common Regulatory Variation Impacts Gene Expression in a Cell Type–Dependent Manner , 2009, Science.

[20]  Joseph K. Pickrell,et al.  Understanding mechanisms underlying human gene expression variation with RNA sequencing , 2010, Nature.

[21]  Valen E. Johnson,et al.  Bayes factors based on test statistics , 2005 .

[22]  M. Stephens A Unified Framework for Association Analysis with Multiple Related Phenotypes , 2013, PloS one.

[23]  Jeffrey E. Harris,et al.  Bayes Methods for Combining the Results of Cancer Studies in Humans and other Species , 1983 .

[24]  Tanya M. Teslovich,et al.  Biological, Clinical, and Population Relevance of 95 Loci for Blood Lipids , 2010, Nature.

[25]  Barbara E. Stranger,et al.  Gene Expression Levels Are a Target of Recent Natural Selection in the Human Genome , 2008, Molecular biology and evolution.

[26]  Kari Stefansson,et al.  Sequence Variants in the RNF212 Gene Associate with Genome-Wide Recombination Rate , 2008, Science.

[27]  Yongtao Guan,et al.  Practical Issues in Imputation-Based Association Mapping , 2008, PLoS genetics.

[28]  Yun Li,et al.  METAL: fast and efficient meta-analysis of genomewide association scans , 2010, Bioinform..

[29]  R. Tweedie,et al.  Publication Bias in Meta-Analysis: A Bayesian Data-Augmentation Approach to Account for Issues Exempli(cid:12)ed in the Passive Smoking Debate , 1997 .

[30]  Christopher D. Brown,et al.  Integrative Modeling of eQTLs and Cis-Regulatory Elements Suggests Mechanisms Underlying Cell Type Specificity of eQTLs , 2012, PLoS genetics.

[31]  M. McCarthy,et al.  Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes , 2008, Nature Genetics.

[32]  M. Stephens,et al.  Imputation-Based Analysis of Association Studies: Candidate Regions and Quantitative Traits , 2007, PLoS genetics.

[33]  Theo Stijnen,et al.  Statistical Applications in Genetics and Molecular Biology Dealing with Heterogeneity between Cohorts in Genomewide SNP Association Studies , 2011 .

[34]  Claudio J. Verzilli,et al.  Bayesian meta-analysis of genetic association studies with different sets of markers. , 2007, American journal of human genetics.

[35]  M. Stephens,et al.  High-Resolution Mapping of Expression-QTLs Yields Insight into Human Gene Regulation , 2008, PLoS genetics.

[36]  Colin B. Begg,et al.  Random Effects Models for Combining Results from Controlled and Uncontrolled Studies in a Meta-Analysis , 1994 .

[37]  M. Stephens,et al.  A Statistical Framework for Joint eQTL Analysis in Multiple Tissues , 2012, PLoS genetics.

[38]  Xiaoquan Wen,et al.  Bayesian analysis of genetic association data, accounting for heterogeneity , 2011 .

[39]  A. Mila,et al.  A Bayesian approach to meta-analysis of plant pathology studies. , 2011, Phytopathology.