Trait-Associated SNPs Are More Likely to Be eQTLs: Annotation to Enhance Discovery from GWAS

Although genome-wide association studies (GWAS) of complex traits have yielded more reproducible associations than had been discovered using any other approach, the loci characterized to date do not account for much of the heritability to such traits and, in general, have not led to improved understanding of the biology underlying complex phenotypes. Using a web site we developed to serve results of expression quantitative trait locus (eQTL) studies in lymphoblastoid cell lines from HapMap samples (http://www.scandb.org), we show that single nucleotide polymorphisms (SNPs) associated with complex traits (from http://www.genome.gov/gwastudies/) are significantly more likely to be eQTLs than minor-allele-frequency–matched SNPs chosen from high-throughput GWAS platforms. These findings are robust across a range of thresholds for establishing eQTLs (p-values from 10−4–10−8), and a broad spectrum of human complex traits. Analyses of GWAS data from the Wellcome Trust studies confirm that annotating SNPs with a score reflecting the strength of the evidence that the SNP is an eQTL can improve the ability to discover true associations and clarify the nature of the mechanism driving the associations. Our results showing that trait-associated SNPs are more likely to be eQTLs and that application of this information can enhance discovery of trait-associated SNPs for complex phenotypes raise the possibility that we can utilize this information both to increase the heritability explained by identifiable genetic factors and to gain a better understanding of the biology underlying complex traits.

[1]  G. Abecasis,et al.  A general test of association for quantitative traits in nuclear families. , 2000, American journal of human genetics.

[2]  John Quackenbush,et al.  Multiple-laboratory comparison of microarray platforms , 2005, Nature Methods.

[3]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[4]  A. Ramé [Age-related macular degeneration]. , 2006, Revue de l'infirmiere.

[5]  Eric E Schadt,et al.  DNA variation and brain region-specific expression profiles exhibit different relationships between inbred mouse strains: implications for eQTL mapping studies , 2007, Genome Biology.

[6]  Maqc Consortium The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements , 2006, Nature Biotechnology.

[7]  Dan L Nicolae,et al.  Quantifying the amount of missing information in genetic association studies , 2006, Genetic epidemiology.

[8]  Thomas Lengauer,et al.  A genome-wide association scan of nonsynonymous SNPs identifies a susceptibility variant for Crohn disease in ATG16L1 , 2007, Nature Genetics.

[9]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[10]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[11]  D. Craig,et al.  Identification of a Novel Risk Locus for Multiple Sclerosis at 13q31.3 by a Pooled Genome-Wide Scan of 500,000 Single Nucleotide Polymorphisms , 2008, PloS one.

[12]  Tyson A. Clark,et al.  Genetic architecture of transcript-level variation in humans. , 2008, American journal of human genetics.

[13]  Elliott Kieff,et al.  Genetic Analysis of Human Traits In Vitro: Drug Response and Gene Expression in Lymphoblastoid Cell Lines , 2008, PLoS genetics.

[14]  Judy H. Cho,et al.  Genome-wide association defines more than 30 distinct susceptibility loci for Crohn's disease , 2008, Nature Genetics.

[15]  John D. Storey,et al.  Mapping the Genetic Architecture of Gene Expression in Human Liver , 2008, PLoS biology.

[16]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[17]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[18]  Peter Kraft,et al.  Genetic risk prediction--are we there yet? , 2009, The New England journal of medicine.

[19]  P. Deloukas,et al.  Common Regulatory Variation Impacts Gene Expression in a Cell Type–Dependent Manner , 2009, Science.

[20]  D. Goldstein Common genetic variation and human traits. , 2009, The New England journal of medicine.

[21]  D. Klionsky Crohn's disease, autophagy, and the Paneth cell. , 2009, The New England journal of medicine.

[22]  Andrew D. Johnson,et al.  Genome-wide association study of blood pressure and hypertension , 2009, Nature Genetics.

[23]  David J. Nott,et al.  Intra- and inter-individual genetic differences in gene expression , 2008, Mammalian Genome.

[24]  Ying Wang,et al.  Genomewide association study of leprosy. , 2009, The New England journal of medicine.

[25]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[26]  Yoav Gilad,et al.  Expression quantitative trait loci detected in cell lines are often present in primary tissues. , 2009, Human molecular genetics.

[27]  J. Hirschhorn Genomewide association studies--illuminating biologic pathways. , 2009, The New England journal of medicine.

[28]  E. Schadt Molecular networks as sensors and drivers of common human diseases , 2009, Nature.

[29]  Rainer Breitling,et al.  Expression Quantitative Trait Loci Are Highly Sensitive to Cellular Differentiation State , 2009, PLoS genetics.

[30]  S. Mi,et al.  Heritable and non-genetic factors as variables of pharmacologic phenotypes in lymphoblastoid cell lines , 2010, The Pharmacogenomics Journal.

[31]  Wei Zhang,et al.  SCAN: SNP and copy number annotation , 2010, Bioinform..