Joint Fine Mapping of Gwas and Eqtl Detects Target Gene and Relevant Tissue

The vast majority of genome-wide association studies (GWAS) risk loci fall in non-coding regions of the genome. One possible hypothesis is that these GWAS risk loci alter the disease risk through their effect on gene expression in different tissues. In order to understand the mechanisms driving a GWAS risk locus, it is helpful to determine which gene is affected in specific tissue types. If the same variant responsible for a GWAS locus also affects gene expression, the relevant gene and tissue may play a role in the disease mechanism. Identifying whether or not the same variant is causal in both GWAS and eQTL studies is challenging due to the uncertainty induced by linkage disequilibrium (LD) and the fact that some loci harbor multiple causal variants. However, current methods that address this problem assume that each locus contains a single causal variant. In this paper, we present a new method, eCAVIAR, that is capable of accounting for LD while computing the quantity we refer to as the colocalization posterior probability (CLPP). The CLPP is the probability that the same variant is responsible for both the GWAS and eQTL signal. eCAVIAR has several key advantages. First, our method can account for more than one causal variant in any loci. Second, it can leverage summary statistics without accessing the individual genotype data. We use both simulated and real datasets to demonstrate the utility of our method. Utilizing data from the Genotype-Tissue Expression (GTEx) project, we demonstrate that computing CLPP can prioritize likely relevant tissues and target genes for a set of Glucose and Insulin-related traits loci. eCAVIAR is available at http://genetics.cs.ucla.edu/caviar/

[1]  Jong Wha J. Joo,et al.  Multiple testing correction in linear mixed models , 2016, Genome Biology.

[2]  Mark I. McCarthy,et al.  Transcript Expression Data from Human Islets Links Regulatory Signals from Genome-Wide Association Studies for Type 2 Diabetes and Glycemic Traits to Their Downstream Effectors , 2015, PLoS genetics.

[3]  E. Dermitzakis,et al.  Alternative Splicing QTLs in European and African Populations , 2015, American journal of human genetics.

[4]  Matti Pirinen,et al.  FINEMAP: efficient variable selection using summary data from genome-wide association studies , 2015, bioRxiv.

[5]  Kaanan P. Shah,et al.  A gene-based association method for mapping traits using reference transcriptome data , 2015, Nature Genetics.

[6]  María M. Abad-Grau,et al.  A functional variant that affects exon-skipping and protein expression of SP140 as genetic mechanism predisposing to multiple sclerosis. , 2015, Human molecular genetics.

[7]  L. Liang,et al.  iGWAS: Integrative Genome‐Wide Association Studies of Genetic and Genomic Data for Disease Susceptibility Using Mediation Analysis , 2015, Genetic epidemiology.

[8]  Eleazar Eskin,et al.  Identification of causal genes for complex traits , 2015, Bioinform..

[9]  Eleazar Eskin,et al.  Accurate and fast multiple-testing correction in eQTL studies. , 2015, American journal of human genetics.

[10]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[11]  Gregory A. Poland,et al.  Fine Mapping Causal Variants with an Approximate Bayesian Method Using Marginal Test Statistics , 2015, Genetics.

[12]  Roderic Guigó,et al.  Identification of genetic variants associated with alternative splicing using sQTLseekeR , 2014, Nature Communications.

[13]  Eleazar Eskin,et al.  Identifying Causal Variants at Loci with Multiple Signals of Association , 2014, Genetics.

[14]  M. Gobbi,et al.  Analysis of hundreds of cis-regulatory landscapes at high resolution in a single, high-throughput experiment , 2014, Nature Genetics.

[15]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[16]  C. Wallace,et al.  Bayesian Test for Colocalisation between Pairs of Genetic Association Studies Using Summary Statistics , 2013, PLoS genetics.

[17]  Simon C. Potter,et al.  Genome-wide Association Analysis Identifies 14 New Risk Loci for Schizophrenia , 2013, Nature Genetics.

[18]  Jonathan P. Beauchamp,et al.  GWAS of 126,559 Individuals Identifies Genetic Variants Associated with Educational Attainment , 2013, Science.

[19]  Xia Yang,et al.  Sherlock: detecting gene-disease associations by matching patterns of expression QTL and GWAS. , 2013, American journal of human genetics.

[20]  Jake K. Byrnes,et al.  Bayesian refinement of association signals for 14 loci in 3 common diseases , 2012, Nature Genetics.

[21]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[22]  Tanya M. Teslovich,et al.  Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways , 2012, Nature Genetics.

[23]  Eleazar Eskin,et al.  Incorporating prior information into association studies , 2012, Bioinform..

[24]  John A. Todd,et al.  Statistical colocalization of monocyte gene expression and genetic risk variants for type 1 diabetes , 2012, Human molecular genetics.

[25]  P. Visscher,et al.  Five years of GWAS discovery. , 2012, American journal of human genetics.

[26]  E. Oetjen,et al.  Genome-Wide Association Identifies Nine Common Variants Associated With Fasting Proinsulin Levels and Provides New Insights Into the Pathophysiology of Type 2 Diabetes , 2011, Diabetes.

[27]  Peter Donnelly,et al.  HAPGEN2: simulation of multiple disease SNPs , 2011, Bioinform..

[28]  Eleazar Eskin,et al.  Increasing Power of Genome-Wide Association Studies by Collecting Additional Single-Nucleotide Polymorphisms , 2011, Genetics.

[29]  Eleazar Eskin,et al.  An Optimal Weighted Aggregated Association Test for Identification of Rare Variants Involved in Common Diseases , 2011, Genetics.

[30]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[31]  Christian Gieger,et al.  Edinburgh Research Explorer Common variants at 10 genomic loci influence hemoglobin A(C) levels via glycemic and nonglycemic pathways , 2010 .

[32]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[33]  E. Dermitzakis,et al.  Candidate Causal Regulatory Effects by Integration of Expression QTLs with Complex Trait Genetic Associations , 2010, PLoS genetics.

[34]  N. Cox,et al.  Trait-Associated SNPs Are More Likely to Be eQTLs: Annotation to Enhance Discovery from GWAS , 2010, PLoS genetics.

[35]  Alex Doney,et al.  Genetic variation in GIPR influences the glucose and insulin responses to an oral glucose challenge , 2010, Nature Genetics.

[36]  Christian Gieger,et al.  New genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes risk , 2010, Nature Genetics.

[37]  Eran Halperin,et al.  Leveraging genetic variability across populations for the identification of causal variants. , 2010, American journal of human genetics.

[38]  Eleazar Eskin,et al.  Rapid and Accurate Multiple Testing Correction and Power Estimation for Millions of Correlated Markers , 2009, PLoS genetics.

[39]  John A. Todd,et al.  Statistical independence of the colocalized association signals for type 1 diabetes and RPS26 gene expression on chromosome 12q13 , 2008, Biostatistics.

[40]  Eleazar Eskin,et al.  Increasing Power in Association Studies by Using Linkage Disequilibrium Structure and Molecular Function as Prior Information , 2008, RECOMB.

[41]  Joseph K. Pickrell,et al.  Power of genome‐wide association studies in the presence of interacting loci , 2007, Genetic epidemiology.

[42]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[43]  J. Pritchard,et al.  Linkage disequilibrium in humans: models and data. , 2001, American journal of human genetics.

[44]  L R Cardon,et al.  Extent and distribution of linkage disequilibrium in three genomic regions. , 2001, American journal of human genetics.

[45]  E Lai,et al.  The extent of linkage disequilibrium in four populations with distinct demographic histories. , 2000, American journal of human genetics.

[46]  L. Kruglyak Prospects for whole-genome linkage disequilibrium mapping of common disease genes , 1999, Nature Genetics.