An empirical Bayes approach for multiple tissue eQTL analysis

SUMMARY Expression quantitative trait locus (eQTL) analyses identify genetic markers associated with the expression of a gene. Most up‐to‐date eQTL studies consider the connection between genetic variation and expression in a single tissue. Multi‐tissue analyses have the potential to improve findings in a single tissue, and elucidate the genotypic basis of differences between tissues. In this article, we develop a hierarchical Bayesian model (MT‐eQTL) for multi‐tissue eQTL analysis. MT‐eQTL explicitly captures patterns of variation in the presence or absence of eQTL, as well as the heterogeneity of effect sizes across tissues. We devise an efficient Expectation‐Maximization (EM) algorithm for model fitting. Inferences concerning eQTL detection and the configuration of eQTL across tissues are derived from the adaptive thresholding of local false discovery rates, and maximum a posteriori estimation, respectively. We also provide theoretical justification of the adaptive procedure. We investigate the MT‐eQTL model through an extensive analysis of a 9‐tissue data set from the GTEx initiative.

[1]  M. Stephens,et al.  A Statistical Framework for Joint eQTL Analysis in Multiple Tissues , 2012, PLoS genetics.

[2]  Matthew Stephens,et al.  BAYESIAN METHODS FOR GENETIC ASSOCIATION ANALYSIS WITH HETEROGENEOUS SUBGROUPS: FROM META-ANALYSES TO GENE-ENVIRONMENT INTERACTIONS. , 2011, The annals of applied statistics.

[3]  P. Sullivan,et al.  Heritability and Genomics of Gene Expression in Peripheral Blood , 2014, Nature Genetics.

[4]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[5]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[6]  Andrey A. Shabalin,et al.  Matrix eQTL: ultra fast eQTL analysis via large matrix operations , 2011, Bioinform..

[7]  Wei Chen,et al.  Gene Expression in Skin and Lymphoblastoid Cells: Refined Statistical Method Reveals Extensive Overlap in Cis-eqtl Signals , 2022 .

[8]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[9]  Lude Franke,et al.  eQTL analysis in humans. , 2009, Methods in molecular biology.

[10]  Ping Wang,et al.  A review of statistical methods for expression quantitative trait loci mapping , 2006, Mammalian Genome.

[11]  Christopher D. Brown,et al.  Integrative Modeling of eQTLs and Cis-Regulatory Elements Suggests Mechanisms Underlying Cell Type Specificity of eQTLs , 2012, PLoS genetics.

[12]  P. Deloukas,et al.  Common Regulatory Variation Impacts Gene Expression in a Cell Type–Dependent Manner , 2009, Science.

[13]  Eleazar Eskin,et al.  Distant regulatory effects of genetic variation in multiple human tissues , 2016, bioRxiv.

[14]  Gordon K Smyth,et al.  Statistical Applications in Genetics and Molecular Biology Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments , 2011 .

[15]  Wenguang Sun,et al.  Simultaneous Testing of Grouped Hypotheses: Finding Needles in Multiple Haystacks , 2009 .

[16]  Christina Kendziorski,et al.  On Differential Variability of Expression Ratios: Improving Statistical Inference about Gene Expression Changes from Microarray Data , 2001, J. Comput. Biol..

[17]  Simon C. Potter,et al.  The Architecture of Gene Regulatory Variation across Multiple Human Tissues: The MuTHER Study , 2011, PLoS genetics.

[18]  John D. Storey,et al.  Statistical significance for genomewide studies , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[19]  John D. Storey,et al.  Empirical Bayes Analysis of a Microarray Experiment , 2001 .

[20]  Christina Kendziorski,et al.  An Empirical Bayesian Approach for Identifying Differential Coexpression in High‐Throughput Experiments , 2012, Biometrics.

[21]  M. Peters,et al.  Systematic identification of trans eQTLs as putative drivers of known disease associations , 2013, Nature Genetics.

[22]  Ivan Rusyn,et al.  Computational tools for discovery and interpretation of expression quantitative trait loci. , 2012, Pharmacogenomics.

[23]  Deepayan Sarkar,et al.  Detecting differential gene expression with a semiparametric hierarchical mixture method. , 2004, Biostatistics.

[24]  Matthias Heinig,et al.  New Insights into the Genetic Control of Gene Expression using a Bayesian Multi-tissue Approach , 2010, PLoS Comput. Biol..

[25]  Andrew B. Nobel,et al.  HT-eQTL: integrative expression quantitative trait loci analysis in a large number of human tissues , 2017, BMC Bioinformatics.

[26]  K. Shianna,et al.  Tissue-Specific Genetic Control of Splicing: Implications for the Study of Complex Traits , 2008, PLoS biology.

[27]  Andrew Kasarskis,et al.  Cross-species systems analysis identifies gene networks differentially altered by sleep loss and depression , 2018, Science Advances.

[28]  David J. Spiegelhalter,et al.  Microarrays, Empirical Bayes and the Two-Groups Model. Comment. , 2008 .

[29]  R. Ophoff,et al.  Unraveling the Regulatory Mechanisms Underlying Tissue-Dependent Genetic Variation of Gene Expression , 2012, PLoS genetics.

[30]  E. Stone,et al.  The genetics of quantitative traits: challenges and prospects , 2009, Nature Reviews Genetics.

[31]  Alan Winterbottom,et al.  A Note on the Derivation of Fisher's Transformation of the Correlation Coefficient , 1979 .

[32]  Wenguang Sun,et al.  Oracle and Adaptive Compound Decision Rules for False Discovery Rate Control , 2007 .

[33]  John D. Storey,et al.  Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis , 2007, PLoS genetics.

[34]  Terence P. Speed,et al.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias , 2003, Bioinform..

[35]  B. Efron Size, power and false discovery rates , 2007, 0710.2245.

[36]  L. Kruglyak,et al.  Genetics of global gene expression , 2006, Nature Reviews Genetics.

[37]  Chun Jimmie Ye,et al.  Effectively Identifying eQTLs from Multiple Tissues by Combining Mixed Model and Meta-analytic Approaches , 2013, PLoS genetics.

[38]  Yoav Gilad,et al.  Expression quantitative trait loci detected in cell lines are often present in primary tissues. , 2009, Human molecular genetics.

[39]  Christopher R. Sibley,et al.  Identification of expression quantitative trait loci associated with schizophrenia and affective disorders in normal brain tissue , 2016, bioRxiv.

[40]  Fred A. Wright,et al.  seeQTL: a searchable database for human eQTLs , 2011, Bioinform..

[41]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[42]  M. Stephens,et al.  Flexible statistical methods for estimating and testing effects in genomic studies with multiple conditions , 2016, Nature Genetics.

[43]  L. Liang,et al.  Mapping complex disease traits with global gene expression , 2009, Nature Reviews Genetics.

[44]  Michel G. Nivard,et al.  Characterizing the Relation Between Expression QTLs and Complex Traits: Exploring the Role of Tissue Specificity , 2018, Behavior Genetics.

[45]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[46]  C M Kendziorski,et al.  On parametric empirical Bayes methods for comparing multiple groups using replicated gene expression profiles , 2003, Statistics in medicine.

[47]  R. Durbin,et al.  Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses , 2012, Nature Protocols.

[48]  Rainer Breitling,et al.  Expression Quantitative Trait Loci Are Highly Sensitive to Cellular Differentiation State , 2009, PLoS genetics.