Heritability and Genomics of Gene Expression in Peripheral Blood

We assessed gene expression profiles in 2,752 twins, using a classic twin design to quantify expression heritability and quantitative trait loci (eQTLs) in peripheral blood. The most highly heritable genes (∼777) were grouped into distinct expression clusters, enriched in gene-poor regions, associated with specific gene function or ontology classes, and strongly associated with disease designation. The design enabled a comparison of twin-based heritability to estimates based on dizygotic identity-by-descent sharing and distant genetic relatedness. Consideration of sampling variation suggests that previous heritability estimates have been upwardly biased. Genotyping of 2,494 twins enabled powerful identification of eQTLs, which we further examined in a replication set of 1,895 unrelated subjects. A large number of non-redundant local eQTLs (6,756) met replication criteria, whereas a relatively small number of distant eQTLs (165) met quality control and replication standards. Our results provide a new resource toward understanding the genetic control of transcription.

[1]  H. Grüneberg,et al.  Introduction to quantitative genetics , 1960 .

[2]  L. Buchanan Mental retardation. , 1972, The Medical journal of Australia.

[3]  Nature Genetics , 1991, Nature.

[4]  Robin Thompson,et al.  Average information REML: An efficient algorithm for variance parameter estimation in linear mixed models , 1995 .

[5]  J. Rashbass Online Mendelian Inheritance in Man. , 1995, Trends in genetics : TIG.

[6]  The phenotypic difference discards sib-pair QTL linkage information. , 1997 .

[7]  F. Wright The phenotypic difference discards sib-pair QTL linkage information. , 1997, American journal of human genetics.

[8]  David J. Groggel,et al.  Practical Nonparametric Statistics , 2000, Technometrics.

[9]  G. Abecasis,et al.  Merlin—rapid analysis of dense genetic maps using sparse gene flow trees , 2002, Nature Genetics.

[10]  M. Daly,et al.  PGC-1α-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes , 2003, Nature Genetics.

[11]  C. Pál,et al.  The evolutionary dynamics of eukaryotic gene order , 2004, Nature Reviews Genetics.

[12]  Cameron S. Osborne,et al.  Active genes dynamically colocalize to shared sites of ongoing transcription , 2004, Nature Genetics.

[13]  Jennifer K Inlow,et al.  Molecular and comparative genetics of mental retardation. , 2004, Genetics.

[14]  Timothy B Sackton,et al.  A Scan for Positively Selected Genes in the Genomes of Humans and Chimpanzees , 2005, PLoS biology.

[15]  Andrew B. Nobel,et al.  Significance analysis of functional categories in gene expression studies: a structured permutation approach , 2005, Bioinform..

[16]  Nick Gilbert,et al.  The role of chromatin structure in regulating the expression of clustered genes , 2005, Nature Reviews Genetics.

[17]  Manuel A. R. Ferreira,et al.  Assumption-Free Estimation of Heritability from Genome-Wide Identity-by-Descent Sharing between Full Siblings , 2006, PLoS genetics.

[18]  Danielle Posthuma,et al.  Netherlands Twin Register: From Twins to Twin Families , 2006, Twin Research and Human Genetics.

[19]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[20]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[21]  David Bryant,et al.  DAVID Bioinformatics Resources: expanded annotation database and novel algorithms to better extract biology from large gene lists , 2007, Nucleic Acids Res..

[22]  Kathryn E. Hentges,et al.  Regional Variation in the Density of Essential Genes in Mice , 2007, PLoS genetics.

[23]  V. McKusick Mendelian Inheritance in Man and Its Online Version, OMIM , 2007, The American Journal of Human Genetics.

[24]  R. Redon,et al.  Relative Impact of Nucleotide and Copy Number Variation on Gene Expression Phenotypes , 2007, Science.

[25]  P. Donnelly,et al.  New models of collaboration in genome-wide association studies: the Genetic Association Information Network , 2007, Nature Genetics.

[26]  Joshua T. Burdick,et al.  Common genetic variants account for differences in gene expression among ethnic groups , 2007, Nature Genetics.

[27]  Jeffrey T Leek,et al.  On the design and analysis of gene expression studies in human populations , 2007, Nature Genetics.

[28]  John D. Storey,et al.  Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis , 2007, PLoS genetics.

[29]  P. Sullivan,et al.  Genome-wide association of major depression: description of samples for the GAIN Major Depressive Disorder Study: NTR and NESDA biobank projects , 2008, European Journal of Human Genetics.

[30]  P. Cuijpers,et al.  The Netherlands Study of Depression and Anxiety (NESDA): rationale, objectives and methods , 2008, International journal of methods in psychiatric research.

[31]  P. Chiurazzi,et al.  XLMR genes: update 2007 , 2008, European Journal of Human Genetics.

[32]  Mark D. Adams,et al.  Human PAML browser: a database of positive selection on human genes using phylogenetic methods , 2007, Nucleic Acids Res..

[33]  Manuel A. R. Ferreira,et al.  Practical aspects of imputation-driven meta-analysis of genome-wide association studies. , 2008, Human molecular genetics.

[34]  Elliott Kieff,et al.  Genetic Analysis of Human Traits In Vitro: Drug Response and Gene Expression in Lymphoblastoid Cell Lines , 2008, PLoS genetics.

[35]  W. G. Hill,et al.  Heritability in the genomics era — concepts and misconceptions , 2008, Nature Reviews Genetics.

[36]  John D. Rioux,et al.  Genome-wide association studies: a new window into immune-mediated diseases , 2008, Nature Reviews Immunology.

[37]  D. Reich,et al.  Effects of cis and trans Genetic Ancestry on Gene Expression in African Americans , 2008, PLoS genetics.

[38]  Benjamin M Neale,et al.  The Positives , Protocols , and Perils of Genome-Wide Association , 2008 .

[39]  Andrew B. Nobel,et al.  A statistical framework for testing functional categories in microarray data , 2008, 0803.3881.

[40]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[41]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[42]  Daniel Sinnett,et al.  Population genomics in a disease targeted primary cell model. , 2009, Genome research.

[43]  L. Liang,et al.  Mapping complex disease traits with global gene expression , 2009, Nature Reviews Genetics.

[44]  Steven J. M. Jones,et al.  Circos: an information aesthetic for comparative genomics. , 2009, Genome research.

[45]  Guy Sella,et al.  Pervasive Hitchhiking at Coding and Regulatory Sites in Humans , 2009, PLoS genetics.

[46]  Jeremiah D. Degenhardt,et al.  Targets of balancing selection in the human genome. , 2009, Molecular biology and evolution.

[47]  F. Real,et al.  Interaction between Hhex and SOX13 Modulates Wnt/TCF Activity , 2009, The Journal of Biological Chemistry.

[48]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[49]  Andrew Starkey,et al.  Evidence of uneven selective pressure on different subsets of the conserved human genome; implications for the significance of intronic and intergenic DNA , 2009, BMC Genomics.

[50]  N. Wray,et al.  Genomewide Association for Major Depressive Disorder: A possible role for the presynaptic protein Piccolo , 2008, Molecular Psychiatry.

[51]  Silke Szymczak,et al.  Genetics and Beyond – The Transcriptome of Human Monocytes and Disease Susceptibility , 2010, PloS one.

[52]  R. Guigó,et al.  Transcriptome genetics using second generation sequencing in a Caucasian population , 2010, Nature.

[53]  M. Eileen Dolan,et al.  Chemotherapeutic drug susceptibility associated SNPs are enriched in expression quantitative trait loci , 2010, Proceedings of the National Academy of Sciences.

[54]  David M. Simcha,et al.  Tackling the widespread and critical impact of batch effects in high-throughput data , 2010, Nature Reviews Genetics.

[55]  Ayellet V. Segrè,et al.  Hundreds of variants clustered in genomic loci and biological pathways affect human height , 2010, Nature.

[56]  A. Nobel,et al.  Heading Down the Wrong Pathway: on the Influence of Correlation within Gene Sets , 2010, BMC Genomics.

[57]  Or Zuk,et al.  A Composite of Multiple Signals Distinguishes Causal Variants in Regions of Positive Selection , 2010, Science.

[58]  N. Cox,et al.  Trait-Associated SNPs Are More Likely to Be eQTLs: Annotation to Enhance Discovery from GWAS , 2010, PLoS genetics.

[59]  Joseph K. Pickrell,et al.  Understanding mechanisms underlying human gene expression variation with RNA sequencing , 2010, Nature.

[60]  Tanya M. Teslovich,et al.  Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index , 2010 .

[61]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[62]  T. Lehner,et al.  The Netherlands Twin Register Biobank: A Resource for Genetic Epidemiological Studies , 2010, Twin Research and Human Genetics.

[63]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[64]  Daniel Rios,et al.  Bioinformatics Applications Note Databases and Ontologies Deriving the Consequences of Genomic Variants with the Ensembl Api and Snp Effect Predictor , 2022 .

[65]  Cisca Wijmenga,et al.  Analysis of SNPs with an effect on gene expression identifies UBE2L3 and BCL3 as potential new risk genes for Crohn's disease. , 2010, Human molecular genetics.

[66]  J. Ibrahim,et al.  Genomewide Multiple-Loci Mapping in Experimental Crosses by Iterative Adaptive Penalized Regression , 2010, Genetics.

[67]  Luigi Ferrucci,et al.  Abundant Quantitative Trait Loci Exist for DNA Methylation and Gene Expression in Human Brain , 2010, PLoS genetics.

[68]  Jacek Majewski,et al.  The study of eQTL variations by RNA-seq: from SNPs to phenotypes. , 2011, Trends in genetics : TIG.

[69]  M. Daly,et al.  Proteins Encoded in Genomic Regions Associated with Immune-Mediated Disease Physically Interact and Suggest Underlying Biology , 2011, PLoS genetics.

[70]  Christian Gieger,et al.  The Use of Genome-Wide eQTL Associations in Lymphoblastoid Cell Lines to Identify Novel Genetic Pathways Involved in Complex Traits , 2011, PloS one.

[71]  Emmanouil Collab A map of human genome variation from population-scale sequencing , 2011, Nature.

[72]  P. Visscher,et al.  GCTA: a tool for genome-wide complex trait analysis. , 2011, American journal of human genetics.

[73]  Albert J. Vilella,et al.  A high-resolution map of human evolutionary constraint using 29 mammals , 2011, Nature.

[74]  C. Betancur,et al.  Etiological heterogeneity in autism spectrum disorders: More than 100 genetic and genomic disorders and still counting , 2011, Brain Research.

[75]  F. Agakov,et al.  Abundant pleiotropy in human complex diseases and traits. , 2011, American journal of human genetics.

[76]  Christopher D. Brown,et al.  Identification, Replication, and Functional Fine-Mapping of Expression Quantitative Trait Loci in Primary Human Liver Tissue , 2011, PLoS genetics.

[77]  Gregory M. Cooper,et al.  A Copy Number Variation Morbidity Map of Developmental Delay , 2011, Nature Genetics.

[78]  Heping Zhang,et al.  Statistical Inference in Mixed Models and Analysis of Twin and Family Data , 2011, Biometrics.

[79]  Jingyuan Fu,et al.  Trans-eQTLs Reveal That Independent Genetic Variants Associated with a Complex Phenotype Converge on Intermediate Genes, with a Major Role for the HLA , 2011, PLoS genetics.

[80]  F. Vannberg,et al.  GENETICS OF GENE EXPRESSION IN PRIMARY IMMUNE CELLS IDENTIFIES CELL-SPECIFIC MASTER REGULATORS AND ROLES OF HLA ALLELES , 2012, Nature Genetics.

[81]  Patrick F. Sullivan,et al.  Genetic architectures of psychiatric disorders: the emerging picture and its implications , 2012, Nature Reviews Genetics.

[82]  Fred A. Wright,et al.  seeQTL: a searchable database for human eQTLs , 2011, Bioinform..

[83]  Joseph K. Pickrell,et al.  DNaseI sensitivity QTLs are a major determinant of human expression variation , 2011, Nature.

[84]  P. Deloukas,et al.  Patterns of Cis Regulatory Variation in Diverse Human Populations , 2012, PLoS genetics.

[85]  K. Hao,et al.  Bayesian method to predict individual SNP genotypes from gene expression data , 2012, Nature Genetics.

[86]  Simon C. Potter,et al.  Mapping cis- and trans-regulatory effects across multiple tissues in twins , 2012, Nature Genetics.

[87]  Yuan Tian,et al.  Genome-wide transcriptome profiling reveals the functional impact of rare de novo and recurrent CNVs in autism spectrum disorders. , 2012, American journal of human genetics.

[88]  Wiepke Cahn,et al.  Expression QTL analysis of top loci from GWAS meta-analysis highlights additional schizophrenia candidate genes , 2012, European Journal of Human Genetics.

[89]  Nathan C. Sheffield,et al.  The accessible chromatin landscape of the human genome , 2012, Nature.

[90]  P. Visscher,et al.  Genetic control of gene expression in whole blood and lymphoblastoid cell lines is largely independent. , 2012, Genome research.

[91]  Shane J. Neph,et al.  Systematic Localization of Common Disease-Associated Variation in Regulatory DNA , 2012, Science.

[92]  Andrey A. Shabalin,et al.  Matrix eQTL: ultra fast eQTL analysis via large matrix operations , 2011, Bioinform..

[93]  Dorret I. Boomsma,et al.  The continuing value of twin studies in the omics era , 2012, Nature Reviews Genetics.

[94]  M. Peters,et al.  Systematic identification of trans eQTLs as putative drivers of known disease associations , 2013, Nature Genetics.

[95]  Psychiatric genetics: are we there yet? , 2013, JAMA psychiatry.

[96]  M. Stephens,et al.  A Statistical Framework for Joint eQTL Analysis in Multiple Tissues , 2012, PLoS genetics.

[97]  Eric S. Lander,et al.  Identifying Recent Adaptations in Large-Scale Genomic Data , 2013, Cell.

[98]  Joseph E. Powell,et al.  Congruence of Additive and Non-Additive Effects on Gene Expression Estimated from Pedigree and SNP Data , 2013, PLoS genetics.

[99]  T. Manolio,et al.  eXclusion: toward integrating the X chromosome in genome-wide association analyses. , 2013, American journal of human genetics.

[100]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[101]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.