Genetic Variants Contribute to Gene Expression Variability in Humans

Expression quantitative trait loci (eQTL) studies have established convincing relationships between genetic variants and gene expression. Most of these studies focused on the mean of gene expression level, but not the variance of gene expression level (i.e., gene expression variability). In the present study, we systematically explore genome-wide association between genetic variants and gene expression variability in humans. We adapt the double generalized linear model (dglm) to simultaneously fit the means and the variances of gene expression among the three possible genotypes of a biallelic SNP. The genomic loci showing significant association between the variances of gene expression and the genotypes are termed expression variability QTL (evQTL). Using a data set of gene expression in lymphoblastoid cell lines (LCLs) derived from 210 HapMap individuals, we identify cis-acting evQTL involving 218 distinct genes, among which 8 genes, ADCY1, CTNNA2, DAAM2, FERMT2, IL6, PLOD2, SNX7, and TNFRSF11B, are cross-validated using an extra expression data set of the same LCLs. We also identify ∼300 trans-acting evQTL between >13,000 common SNPs and 500 randomly selected representative genes. We employ two distinct scenarios, emphasizing single-SNP and multiple-SNP effects on expression variability, to explain the formation of evQTL. We argue that detecting evQTL may represent a novel method for effectively screening for genetic interactions, especially when the multiple-SNP influence on expression variability is implied. The implication of our results for revealing genetic mechanisms of gene expression variability is discussed.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  J. Nelder,et al.  Double hierarchical generalized linear models , 2006 .

[3]  Kevin R. Thornton,et al.  A New Approach for Using Genome Scans to Detect Recent Positive Selection in the Human Genome , 2007, PLoS biology.

[4]  A. Fujimoto,et al.  A Practical Genome Scan for Population-Specific Strong Selective Sweeps That Have Reached Fixation , 2007, PloS one.

[5]  Peter H. Sudmant,et al.  Diversity of Human Copy Number Variation and Multicopy Genes , 2010, Science.

[6]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[7]  Marc Hallin,et al.  Optimal rank-based tests for homogeneity of scatter , 2008, 0806.2963.

[8]  John D. Storey,et al.  Gene-expression variation within and among human populations. , 2007, American journal of human genetics.

[9]  J. Derisi,et al.  Single-cell proteomic analysis of S. cerevisiae reveals the architecture of biological noise , 2006, Nature.

[10]  J. Nelder,et al.  Double hierarchical generalized linear models (with discussion) , 2006 .

[11]  Carlos D Bustamante,et al.  Localizing Recent Adaptive Evolution in the Human Genome , 2007, PLoS genetics.

[12]  P. Bickel,et al.  Using Residuals Robustly I: Tests for Heteroscedasticity, Nonlinearity , 1975 .

[13]  Life Technologies,et al.  A map of human genome variation from population-scale sequencing , 2011 .

[14]  Michael A. Charleston,et al.  Differential variability analysis of gene expression and its application to human diseases , 2008, ISMB.

[15]  Guorong Xu,et al.  SAMMate: a GUI tool for processing short read alignments in SAM/BAM format , 2011, Source Code for Biology and Medicine.

[16]  Jacek Majewski,et al.  The study of eQTL variations by RNA-seq: from SNPs to phenotypes. , 2011, Trends in genetics : TIG.

[17]  Hongzhe Li,et al.  High‐Dimensional Heteroscedastic Regression with an Application to eQTL Data Analysis , 2012, Biometrics.

[18]  Pierre Baldi,et al.  Global landscape of recent inferred Darwinian selection for Homo sapiens , 2006, Proc. Natl. Acad. Sci. USA.

[19]  Joseph K. Pickrell,et al.  Understanding mechanisms underlying human gene expression variation with RNA sequencing , 2010, Nature.

[20]  Andrew Ying-Fei Chang,et al.  Maintenance of duplicate genes and their functional redundancy by reduced expression. , 2010, Trends in genetics : TIG.

[21]  Martin S. Taylor,et al.  Pervasive haplotypic variation in the spliceo-transcriptome of the human major histocompatibility complex. , 2011, Genome research.

[22]  Eric E. Schadt,et al.  Systematic Detection of Polygenic cis-Regulatory Evolution , 2011, PLoS genetics.

[23]  E. O’Shea,et al.  Living with noisy genes: how cells function reliably with inherent variability in gene expression. , 2007, Annual review of biophysics and biomolecular structure.

[24]  Cornelia van Duijn,et al.  Variance heterogeneity analysis for detection of potentially interacting genetic loci: method and its limitations , 2010, BMC Genetics.

[25]  T. Hirano,et al.  Recombinant human B cell stimulatory factor 2 (BSF-2/IFN-beta 2) regulates beta-fibrinogen and albumin mRNA levels in Fao-9 cells. , 1987, FEBS letters.

[26]  L. Maquat,et al.  Regulation of cytoplasmic mRNA decay , 2012, Nature Reviews Genetics.

[27]  R. Redon,et al.  Relative Impact of Nucleotide and Copy Number Variation on Gene Expression Phenotypes , 2007, Science.

[28]  Eric E. Schadt,et al.  The Quantitative Genetics of Phenotypic Robustness , 2010, PloS one.

[29]  K. Hansen,et al.  Sequencing technology does not eliminate biological variability , 2011, Nature Biotechnology.

[30]  Emilie Lalonde,et al.  RNA sequencing reveals the role of splicing polymorphisms in regulating human gene expression. , 2011, Genome research.

[31]  M. Shriver,et al.  Interrogating a high-density SNP map for signatures of natural selection. , 2002, Genome research.

[32]  S. Hunt,et al.  Genome-Wide Associations of Gene Expression Variation in Humans , 2005, PLoS genetics.

[33]  Xiang-Dong Fu,et al.  Functional integration of transcriptional and RNA processing machineries. , 2008, Current opinion in cell biology.

[34]  W. Valdar,et al.  Detecting Major Genetic Loci Controlling Phenotypic Variability in Experimental Crosses , 2011, Genetics.

[35]  Wenfeng Qian,et al.  Positive selection for elevated gene expression noise in yeast , 2009, Molecular systems biology.

[36]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[37]  Y. Pilpel,et al.  Regulatory mechanisms and networks couple the different phases of gene expression. , 2011, Trends in genetics : TIG.

[38]  Wolfgang Huber,et al.  Relating CNVs to transcriptome data at fine resolution: assessment of the effect of variant size, type, and overlap with functional regions. , 2011, Genome research.

[39]  Craig B. Thompson,et al.  Hierarchical Control of Lymphocyte Survival , 1996, Science.

[40]  Joel Eriksson,et al.  FTO genotype is associated with phenotypic variability of body mass index , 2012, Nature.

[41]  Francisco M. De La Vega,et al.  Identifying Selected Regions from Heterozygosity and Divergence Using a Light-Coverage Genomic Dataset from Two Human Populations , 2008, PloS one.

[42]  R. Guigó,et al.  Transcriptome genetics using second generation sequencing in a Caucasian population , 2010, Nature.

[43]  A. Reymond,et al.  Copy number variants, diseases and gene expression. , 2009, Human molecular genetics.

[44]  Deborah A Nickerson,et al.  Genomic regions exhibiting positive selection identified from dense genotype data. , 2005, Genome research.

[45]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[46]  T. Hirano,et al.  Recombinant human B cell stimulatory factor 2 (BSF‐2/IFN‐β2) regulates β‐fibrinogen and albumin mRNA levels in Fao‐9 cells , 1987 .

[47]  Paul M. Ridker,et al.  On the Use of Variance per Genotype as a Tool to Identify Quantitative Trait Interaction Effects: A Report from the Women's Genome Health Study , 2010, PLoS genetics.

[48]  T. Bergström,et al.  Recent origin of HLA-DRB1 alleles and implications for human evolution , 1998, Nature Genetics.

[49]  T. Taniguchi,et al.  Complementary DNA for a novel human interleukin (BSF-2) that induces B lymphocytes to produce immunoglobulin , 1986, Nature.

[50]  James J. Cai PGEToolbox: A Matlab toolbox for population genetics and evolution. , 2008, The Journal of heredity.

[51]  Zhaolei Zhang,et al.  Exploiting the determinants of stochastic gene expression in Saccharomyces cerevisiae for genome-wide prediction of expression noise , 2010, Proceedings of the National Academy of Sciences.

[52]  Lars Feuk,et al.  Characterization of copy number‐stable regions in the human genome , 2011, Human mutation.

[53]  Daniel R. Larson,et al.  Single-Molecule mRNA Decay Measurements Reveal Promoter- Regulated mRNA Stability in Yeast , 2011, Cell.

[54]  Michael A. Fligner,et al.  Distribution-Free Two-Sample Tests for Scale , 1976 .

[55]  P. Visscher,et al.  Statistical Power to Detect Genetic Loci Affecting Environmental Sensitivity , 2010, Behavior genetics.

[56]  Daniel J. Kliebenstein,et al.  Genomic Analysis of QTLs and Genes Altering Natural Variation in Stochastic Noise , 2011, PLoS genetics.

[57]  E. Dermitzakis,et al.  From expression QTLs to personalized transcriptomics , 2011, Nature Reviews Genetics.

[58]  Jerome T. Mettetal,et al.  Stochastic switching as a survival strategy in fluctuating environments , 2008, Nature Genetics.

[59]  Vivian G. Cheung,et al.  Genetics of human gene expression: mapping DNA variants that influence gene expression , 2009, Nature Reviews Genetics.

[60]  W. G. Hill,et al.  Genetic analysis of environmental variation. , 2010, Genetics research.

[61]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[62]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[63]  Elliott Kieff,et al.  Genetic Analysis of Human Traits In Vitro: Drug Response and Gene Expression in Lymphoblastoid Cell Lines , 2008, PLoS genetics.

[64]  Wolfgang Huber,et al.  Antisense expression increases gene expression variability and locus interdependency , 2011, Molecular systems biology.

[65]  M. Stephens,et al.  High-Resolution Mapping of Expression-QTLs Yields Insight into Human Gene Regulation , 2008, PLoS genetics.

[66]  Yu Liu,et al.  Gene Expression Variability within and between Human Populations and Implications toward Disease Susceptibility , 2010, PLoS Comput. Biol..

[67]  A. Oudenaarden,et al.  Nature, Nurture, or Chance: Stochastic Gene Expression and Its Consequences , 2008, Cell.

[68]  J. François,et al.  Cell-to-Cell Stochastic Variation in Gene Expression Is a Complex Genetic Trait , 2008, PLoS genetics.

[69]  L. Feuk,et al.  Detection of large-scale variation in the human genome , 2004, Nature Genetics.

[70]  Cornelia M van Duijn,et al.  An R package "VariABEL" for genome-wide searching of potentially interacting loci by testing genotypic variance heterogeneity , 2012, BMC Genetics.

[71]  C. Wijmenga,et al.  Identification of PLOD2 as Telopeptide Lysyl Hydroxylase, an Important Enzyme in Fibrosis* , 2003, Journal of Biological Chemistry.

[72]  L. Aravind,et al.  Interplay between gene expression noise and regulatory network architecture. , 2012, Trends in genetics : TIG.

[73]  J. Raser,et al.  Noise in Gene Expression: Origins, Consequences, and Control , 2005, Science.

[74]  Or Zuk,et al.  A Composite of Multiple Signals Distinguishes Causal Variants in Regions of Positive Selection , 2010, Science.

[75]  Mark Daly,et al.  Haploview: analysis and visualization of LD and haplotype maps , 2005, Bioinform..

[76]  S. Gringhuis,et al.  Dectin-1 is an extracellular pathogen sensor for the induction and processing of IL-1β via a noncanonical caspase-8 inflammasome , 2012, Nature Immunology.

[77]  Farren J. Isaacs,et al.  Phenotypic consequences of promoter-mediated transcriptional noise. , 2006, Molecular cell.

[78]  D. Volfson,et al.  Origins of extrinsic variability in eukaryotic gene expression , 2006, Nature.

[79]  T. Ohta,et al.  Population Biology of Antigen Presentation by MHC Class I Molecules , 1996, Science.

[80]  G. Cox,et al.  ~ " " " ' l I ~ " " -" . : -· " J , 2006 .

[81]  Joseph K. Pickrell,et al.  DNaseI sensitivity QTLs are a major determinant of human expression variation , 2011, Nature.

[82]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[83]  R. Germain,et al.  Variability and Robustness in T Cell Activation from Regulated Heterogeneity in Protein Levels , 2008, Science.

[84]  John Quackenbush,et al.  Variance of Gene Expression Identifies Altered Network Constraints in Neurological Disease , 2011, PLoS genetics.

[85]  B. Williams,et al.  Mapping and quantifying mammalian transcriptomes by RNA-Seq , 2008, Nature Methods.

[86]  D. Schaid,et al.  Exact tests of Hardy-Weinberg equilibrium and homogeneity of disequilibrium across strata. , 2006, American journal of human genetics.