Quantifying the regulatory effect size of cis-acting genetic variation using allelic fold change

Mapping cis-acting expression quantitative trait loci (cis-eQTL) has become a popular approach for characterizing proximal genetic regulatory variants. However, measures used for quantifying the effect size of cis-eQTLs have been inconsistent and poorly defined. In this paper, we describe log allelic fold change (aFC) as a biologically interpretable and mathematically convenient unit that represents the magnitude of expression change associated with a given genetic variant. This measure is mathematically independent from expression level and allele frequency, applicable to multi-allelic variants, and generalizable to multiple independent variants. We provide tools and guidelines for estimating aFC from eQTL and allelic expression data sets, and apply it to GTEx data. We show that aFC estimates independently derived from eQTL and allelic expression data are highly consistent, and identify technical and biological correlates of eQTL effect size. We generalize aFC to analyze genes with two eQTLs in GTEx, and show that in nearly all cases these eQTLs are independent in their regulatory activity. In summary, aFC is a solid measure of cis-regulatory effect size that allows quantitative interpretation of cellular regulatory events from population data, and it is a valuable approach for investigating novel aspects of eQTL data sets.

[1]  Yi-Hui Zhou,et al.  Estimation of cis‐eQTL effect sizes using a log of linear model , 2018, Biometrics.

[2]  Nicola J. Rinaldi,et al.  Genetic effects on gene expression across human tissues , 2017, Nature.

[3]  William S Bush,et al.  Are Interactions between cis-Regulatory Variants Evidence for Biological Epistasis or Statistical Artifacts? , 2016, American journal of human genetics.

[4]  Eleazar Eskin,et al.  Local genetic effects on gene expression across 44 human tissues , 2016, bioRxiv.

[5]  Stephane E. Castel,et al.  Rare variant phasing and haplotypic expression from RNA sequencing with phASER , 2016, Nature Communications.

[6]  Neville E. Sanjana,et al.  CRISPR Screens to Discover Functional Noncoding Elements. , 2016, Trends in genetics : TIG.

[7]  Jacob C. Ulirsch,et al.  Systematic Functional Dissection of Common Genetic Variation Affecting Red Blood Cell Traits , 2016, Cell.

[8]  Eric S. Lander,et al.  Direct Identification of Hundreds of Expression-Modulating Variants using a Multiplexed Reporter Assay , 2016, Cell.

[9]  Andrew B. Nobel,et al.  Estimation of Interpretable eQTL Effect Sizes Using a Log of Linear Model , 2016, 1605.08799.

[10]  Daniel J. Gaffney,et al.  Fine-mapping cellular QTLs with RASQUAL and ATAC-seq , 2015, Nature Genetics.

[11]  J. Stinchcombe,et al.  Association mapping reveals the role of purifying selection in the maintenance of genomic variation in gene expression , 2015, Proceedings of the National Academy of Sciences.

[12]  J. Stinchcombe,et al.  Association mapping reveals the role of mutation-selection balance in the maintenance of genomic variation for gene expression , 2015, bioRxiv.

[13]  Eric Banks,et al.  Tools and best practices for data processing in allelic expression analysis , 2015, Genome Biology.

[14]  Jonathan K. Pritchard,et al.  WASP: allele-specific software for robust molecular quantitative trait locus discovery , 2015, Nature Methods.

[15]  Matthew C. Canver,et al.  BCL11A enhancer dissection by Cas9-mediated in situ saturating mutagenesis , 2015, Nature.

[16]  Emmanouil T. Dermitzakis,et al.  Fast and efficient QTL mapper for thousands of molecular phenotypes , 2015, bioRxiv.

[17]  William H. Majoros,et al.  Massively parallel quantification of the regulatory effects of noncoding genetic variation in a human cohort , 2015, Genome research.

[18]  Jung-Ying Tzeng,et al.  Proper Use of Allele-Specific Expression Improves Statistical Power for cis-eQTL Mapping with RNA-Seq Data , 2015, Journal of the American Statistical Association.

[19]  Kristin Reiche,et al.  Dissecting the genetics of the human transcriptome identifies novel trait-related trans-eQTLs and corroborates the regulatory relevance of non-protein coding loci† , 2015, Human molecular genetics.

[20]  G. Kempermann Faculty Opinions recommendation of Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. , 2015 .

[21]  Benjamin J. Strober,et al.  A method to predict the impact of regulatory variants from DNA sequence , 2015, Nature Genetics.

[22]  Emily K. Tsang,et al.  Effect of predicted protein-truncating genetic variants on the human transcriptome , 2015, Science.

[23]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[24]  L. Kruglyak,et al.  The role of regulatory variation in complex traits and disease , 2015, Nature Reviews Genetics.

[25]  Alexis Battle,et al.  Impact of regulatory variation from RNA to protein , 2015, Science.

[26]  E. Dermitzakis,et al.  Tissue-Specific Effects of Genetic and Epigenetic Variation on Gene Regulation and Splicing , 2015, PLoS genetics.

[27]  Maria Gutierrez-Arcelus,et al.  Allelic mapping bias in RNA-sequencing is not a major confounder in eQTL studies , 2014, Genome Biology.

[28]  Ross M. Fraser,et al.  Defining the role of common variation in the genomic and biological architecture of adult human height , 2014, Nature Genetics.

[29]  Matthew Stephens,et al.  The genetic architecture of gene expression levels in wild baboons , 2014, bioRxiv.

[30]  David A. Knowles,et al.  Allelic Expression of Deleterious Protein-Coding Variants across Human Tissues , 2014, PLoS genetics.

[31]  Richard Durbin,et al.  Genetic interactions affecting human gene expression identified by variance association mapping , 2014, eLife.

[32]  P. Sullivan,et al.  Heritability and Genomics of Gene Expression in Peripheral Blood , 2014, Nature Genetics.

[33]  Joseph E. Powell,et al.  Detection and replication of epistasis influencing transcription in humans , 2014, Nature.

[34]  D. Koller,et al.  Characterizing the genetic basis of transcriptome diversity through RNA-sequencing of 922 individuals , 2013, Genome research.

[35]  Pedro G. Ferreira,et al.  Transcriptome and genome sequencing uncovers functional variation in humans , 2013, Nature.

[36]  D. Goldstein,et al.  Genic Intolerance to Functional Variation and the Interpretation of Personal Genomes , 2013, PLoS genetics.

[37]  E. Dermitzakis,et al.  Passive and active DNA methylation and the interplay with genetic variation in gene regulation , 2013, eLife.

[38]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[39]  Łukasz M. Boryń,et al.  Genome-Wide Quantitative Enhancer Activity Maps Identified by STARR-seq , 2013, Science.

[40]  M. Stephens,et al.  A Statistical Framework for Joint eQTL Analysis in Multiple Tissues , 2012, PLoS genetics.

[41]  Simon C. Potter,et al.  Mapping cis- and trans-regulatory effects across multiple tissues in twins , 2012, Nature Genetics.

[42]  R. Durbin,et al.  Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses , 2012, Nature Protocols.

[43]  Wei Sun,et al.  A Statistical Framework for eQTL Mapping Using RNA‐seq Data , 2012, Biometrics.

[44]  Andrey A. Shabalin,et al.  Matrix eQTL: ultra fast eQTL analysis via large matrix operations , 2011, Bioinform..

[45]  Joseph K. Pickrell,et al.  Understanding mechanisms underlying human gene expression variation with RNA sequencing , 2010, Nature.

[46]  W. Huber,et al.  which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. MAnorm: a robust model for quantitative comparison of ChIP-Seq data sets , 2011 .

[47]  A. Whitehead,et al.  Neutral and adaptive variation in gene expression. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[48]  Doree Sitkoff,et al.  models homology modeling : From sequence alignments to structural A comparative study of available software for high-accuracy , 2005 .

[49]  R. Stoughton,et al.  Genetics of gene expression surveyed in maize, mouse and man , 2003, Nature.

[50]  Y. Tu,et al.  Quantitative noise analysis for gene expression microarray experiments , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[51]  B. Efron Better Bootstrap Confidence Intervals , 1987 .