New Insights into the Genetic Control of Gene Expression using a Bayesian Multi-tissue Approach

The majority of expression quantitative trait locus (eQTL) studies have been carried out in single tissues or cell types, using methods that ignore information shared across tissues. Although global analysis of RNA expression in multiple tissues is now feasible, few integrated statistical frameworks for joint analysis of gene expression across tissues combined with simultaneous analysis of multiple genetic variants have been developed to date. Here, we propose Sparse Bayesian Regression models for mapping eQTLs within individual tissues and simultaneously across tissues. Testing these on a set of 2,000 genes in four tissues, we demonstrate that our methods are more powerful than traditional approaches in revealing the true complexity of the eQTL landscape at the systems-level. Highlighting the power of our method, we identified a two-eQTL model (cis/trans) for the Hopx gene that was experimentally validated and was not detected by conventional approaches. We showed common genetic regulation of gene expression across four tissues for ∼27% of transcripts, providing >5 fold increase in eQTLs detection when compared with single tissue analyses at 5% FDR level. These findings provide a new opportunity to uncover complex genetic regulatory mechanisms controlling global gene expression while the generality of our modelling approach makes it adaptable to other model systems and humans, with broad application to analysis of multiple intermediate and whole-body phenotypes.

[1]  Debashis Ghosh,et al.  Bayesian Variable Selection with Joint Modeling of Categorical and Survival Outcomes: An Application to Individualizing Chemotherapy Treatment in Advanced Colorectal Cancer , 2009, Biometrics.

[2]  Scott A. Rifkin,et al.  Revealing the architecture of gene regulation: the promise of eQTL studies. , 2008, Trends in genetics : TIG.

[3]  C. Hoggart,et al.  Genome‐wide significance for dense SNP and resequencing data , 2008, Genetic epidemiology.

[4]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[5]  C. Molony,et al.  Genetic analysis of genome-wide variation in human gene expression , 2004, Nature.

[6]  Dominique Gauguier,et al.  Pathophysiological, Genetic and Gene Expression Features of a Novel Rodent Model of the Cardio-Metabolic Syndrome , 2008, PloS one.

[7]  Refik Soyer,et al.  Bayesian Methods for Nonlinear Classification and Regression , 2004, Technometrics.

[8]  L. Almasy,et al.  Discovery of expression QTLs using large-scale transcriptional profiling in human lymphocytes , 2007, Nature Genetics.

[9]  R. Doerge,et al.  Global eQTL Mapping Reveals the Complex Genetic Architecture of Transcript-Level Variation in Arabidopsis , 2007, Genetics.

[10]  E. Petretto,et al.  Integrated transcriptional profiling and linkage analysis for identification of genes underlying disease , 2005, Nature Genetics.

[11]  Eric E Schadt,et al.  Multi-tissue coexpression networks reveal unexpected subnetworks associated with disease. , 2009 .

[12]  H. Deng,et al.  Bayesian mapping of quantitative trait loci for multiple complex traits with the use of variance components. , 2007, American journal of human genetics.

[13]  Enrico Petretto,et al.  Heritability and Tissue Specificity of Expression Quantitative Trait Loci , 2006, PLoS genetics.

[14]  Robert W. Williams,et al.  Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function , 2005, Nature Genetics.

[15]  Andrew G. Clark,et al.  Mapping Multiple Quantitative Trait Loci by Bayesian Classification , 2005, Genetics.

[16]  Eric E Schadt,et al.  Identification of Pathways for Atherosclerosis in Mice: Integration of Quantitative Trait Locus Analysis and Global Gene Expression Data , 2007, Circulation research.

[17]  Jingyuan Fu,et al.  Genetical Genomics: Spotlight on QTL Hotspots , 2008, PLoS genetics.

[18]  E. Xing,et al.  Statistical Estimation of Correlated Genome Associations to a Quantitative Trait Network , 2009, PLoS genetics.

[19]  David J Nott,et al.  Normalization procedures and detection of linkage signal in genetical-genomics experiments , 2006, Nature Genetics.

[20]  M. Steel,et al.  Benchmark Priors for Bayesian Model Averaging , 2001 .

[21]  Eric E Schadt,et al.  DNA variation and brain region-specific expression profiles exhibit different relationships between inbred mouse strains: implications for eQTL mapping studies , 2007, Genome Biology.

[22]  Hugh Chipman,et al.  Bayesian variable selection with related predictors , 1995, bayes-an/9510001.

[23]  Richard A. Johnson,et al.  Applied Multivariate Statistical Analysis , 1983 .

[24]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[25]  James Scott,et al.  Identification of Cd36 (Fat) as an insulin-resistance gene causing defective fatty acid and glucose metabolism in hypertensive rats , 1999, Nature Genetics.

[26]  Thomas R. Sutter,et al.  How replicable are mRNA expression QTL? , 2006, Mammalian Genome.

[27]  T. Fearn,et al.  Multivariate Bayesian variable selection and prediction , 1998 .

[28]  R. Kohn,et al.  Nonparametric regression using Bayesian variable selection , 1996 .

[29]  M. Clyde,et al.  Mixtures of g Priors for Bayesian Variable Selection , 2008 .

[30]  Yukito Iba EXTENDED ENSEMBLE MONTE CARLO , 2001 .

[31]  R. O’Hara,et al.  A review of Bayesian variable selection methods: what, how and which , 2009 .

[32]  Martin Vingron,et al.  Soluble epoxide hydrolase is a susceptibility factor for heart failure in a rat model of human disease , 2008, Nature Genetics.

[33]  Rohan Williams,et al.  The influence of genetic variation on gene expression. , 2007, Genome research.

[34]  M. Stephens,et al.  Imputation-Based Analysis of Association Studies: Candidate Regions and Quantitative Traits , 2007, PLoS genetics.

[35]  John K Kruschke,et al.  Bayesian data analysis. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[36]  Thomas D. Schmittgen,et al.  Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. , 2001, Methods.

[37]  L. Kruglyak,et al.  Genetic Dissection of Transcriptional Regulation in Budding Yeast , 2002, Science.

[38]  John D. Storey,et al.  Mapping the Genetic Architecture of Gene Expression in Human Liver , 2008, PLoS biology.

[39]  B. Neel,et al.  Neuronal PTP1B regulates body weight, adiposity and leptin action , 2006, Nature Medicine.

[40]  Matthew J. Moscou,et al.  Tissue-dependent limited pleiotropy affects gene expression in barley. , 2008, The Plant journal : for cell and molecular biology.

[41]  L. Liang,et al.  A genome-wide association study of global gene expression , 2007, Nature Genetics.

[42]  John D. Storey,et al.  Multiple Locus Linkage Analysis of Genomewide Expression in Yeast , 2005, PLoS biology.

[43]  John D. Storey A direct approach to false discovery rates , 2002 .

[44]  A. Dawid Some matrix-variate distribution theory: Notational considerations and a Bayesian application , 1981 .

[45]  Andrew I Su,et al.  Uncovering regulatory pathways that affect hematopoietic stem cell function using 'genetical genomics' , 2005, Nature Genetics.

[46]  Faming Liang,et al.  EVOLUTIONARY MONTE CARLO: APPLICATIONS TO Cp MODEL SAMPLING AND CHANGE POINT PROBLEM , 2000 .

[47]  C. Kendziorski,et al.  Statistical Methods for Expression Quantitative Trait Loci (eQTL) Mapping , 2006, Biometrics.

[48]  T. Speed,et al.  Summaries of Affymetrix GeneChip probe level data. , 2003, Nucleic acids research.

[49]  Robert Kohn,et al.  Nonparametric regression using linear combinations of basis functions , 2001, Stat. Comput..

[50]  Min Zhang,et al.  Variable selection for large p small n regression models with incomplete data: Mapping QTL with epistases , 2007, BMC Bioinformatics.

[51]  Dudley J Pennell,et al.  Integrated genomic approaches implicate osteoglycin (Ogn) in the regulation of left ventricular mass , 2008, Nature Genetics.

[52]  Edward I. George,et al.  The Practical Implementation of Bayesian Model Selection , 2001 .

[53]  A. Arnold,et al.  Tissue-specific expression and regulation of sexually dimorphic genes in mice. , 2006, Genome research.

[54]  B. Yandell,et al.  Bayesian Quantitative Trait Loci Mapping for Multiple Traits , 2008, Genetics.

[55]  Ajay Jasra,et al.  Population-Based Reversible Jump Markov Chain Monte Carlo , 2007, 0711.0186.