Estimating the strength of expression conservation from high throughput RNA-seq data

MOTIVATION Evolution of gene across species is usually subject to the stabilizing selection to maintain the optimal expression level. While it is generally accepted that the resulting expression conservation may vary considerably among genes, statistically reliable estimation remains challenging, due to few species included in current comparative RNA-seq data with high number of unknown parameters. RESULTS In this paper, we develop a gamma distribution model to describe how the strength of expression conservation (denoted by W) varies among genes. Given the high throughput RNA-seq datasets from multiple species, we then formulate an empirical Bayesian procedure to estimate W for each gene. Our case studies showed that those W-estimates are useful to study the evolutionary pattern of expression conservation. AVAILABILITY AND IMPLEMENTATION Our method has been implemented in the R-package software, TreeExp 2.0, which is publically available at Github develop site https://github.com/hr1912/TreeExp. It involves three functions: estParaGamma, estParaQ, and estParaWBayesian. CONTACT AND SUPPLEMENT INFORMATION The manual for software TreeExp is available at https://github.com/hr1912/TreeExp/tree/master/vignettes. For any question, one may contact Dr. Hang Ruan (Hang.Ruan@uth.tmc.edu).

[1]  S. Pääbo,et al.  Parallel Patterns of Evolution in the Genomes and Transcriptomes of Humans and Chimpanzees , 2005, Science.

[2]  Peter W. Harrison,et al.  The evolution of gene expression and the transcriptome-phenotype relationship. , 2012, Seminars in cell & developmental biology.

[3]  N. Barkai,et al.  A genetic signature of interspecies variations in gene expression , 2006, Nature Genetics.

[4]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[5]  S. Pääbo,et al.  Intra- and Interspecific Variation in Primate Gene Expression Patterns , 2002, Science.

[6]  U. Alon,et al.  Optimality and evolutionary tuning of the expression level of a protein , 2005, Nature.

[7]  Ben Lehner,et al.  Epigenetic epistatic interactions constrain the evolution of gene expression , 2013, Molecular systems biology.

[8]  A. Eyre-Walker,et al.  A Selection Index for Gene Expression Evolution and Its Application to the Divergence between Humans and Chimpanzees , 2012, PloS one.

[9]  C. Mathé,et al.  The THAP domain of THAP1 is a large C2CH module with zinc-dependent sequence-specific DNA-binding activity. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[10]  X. Gu,et al.  TreeExp1.0: R Package for Analyzing Expression Evolution Based on RNA-Seq Data. , 2016, Journal of experimental zoology. Part B, Molecular and developmental evolution.

[11]  S. Bergmann,et al.  The evolution of gene expression levels in mammalian organs , 2011, Nature.

[12]  Xun Gu,et al.  Understanding tissue expression evolution: from expression phylogeny to phylogenetic network , 2016, Briefings Bioinform..

[13]  Naama Barkai,et al.  Evolution of gene sequence and gene expression are not correlated in yeast. , 2008, Trends in genetics : TIG.

[14]  C. Pál,et al.  Dosage sensitivity and the evolution of gene families in yeast , 2003, Nature.

[15]  Ronald W. Davis,et al.  Mechanisms of Haploinsufficiency Revealed by Genome-Wide Profiling in Yeast , 2005, Genetics.

[16]  R. Lande NATURAL SELECTION AND RANDOM GENETIC DRIFT IN PHENOTYPIC EVOLUTION , 1976, Evolution; international journal of organic evolution.

[17]  Xun Gu,et al.  Statistical detection of differentially expressed genes based on RNA-seq: from biological to phylogenetic replicates , 2016, Briefings Bioinform..

[18]  Ben Lehner Genotype to phenotype: lessons from model organisms for human genetics , 2013, Nature Reviews Genetics.

[19]  S. Pääbo,et al.  A Neutral Model of Transcriptome Evolution , 2004, PLoS biology.

[20]  T. Hughes,et al.  Mapping pathways and phenotypes by systematic gene overexpression. , 2006, Molecular cell.

[21]  R. Nielsen,et al.  Phylogenetic ANOVA: The Expression Variance and Evolution Model for Quantitative Trait Evolution. , 2015, Systematic biology.

[22]  M. King,et al.  Evolution at two levels in humans and chimpanzees. , 1975, Science.

[23]  Scott A. Rifkin,et al.  Evolution of gene expression in the Drosophila melanogaster subgroup , 2003, Nature Genetics.

[24]  Terence P. Speed,et al.  Expression profiling in primates reveals a rapid evolution of human transcription factors , 2006, Nature.

[25]  X. Gu,et al.  Induced gene expression in human brain after the split from chimpanzee. , 2003, Trends in genetics : TIG.

[26]  S. Bergmann,et al.  Similarities and Differences in Genome-Wide Expression Data of Six Organisms , 2003, PLoS biology.

[27]  X. Gu,et al.  Tissue-driven hypothesis of genomic evolution and sequence-expression correlations , 2007, Proceedings of the National Academy of Sciences.

[28]  T. F. Hansen,et al.  TRANSLATING BETWEEN MICROEVOLUTIONARY PROCESS AND MACROEVOLUTIONARY PATTERNS: THE CORRELATION STRUCTURE OF INTERSPECIFIC DATA , 1996, Evolution; international journal of organic evolution.

[29]  D. Hartl,et al.  Optimization of gene expression by natural selection , 2009, Proceedings of the National Academy of Sciences.

[30]  Rasmus Nielsen,et al.  Modeling gene expression evolution with an extended Ornstein-Uhlenbeck process accounting for within-species variation. , 2014, Molecular biology and evolution.

[31]  B. Papp,et al.  No Evidence That Protein Noise-Induced Epigenetic Epistasis Constrains Gene Expression Evolution. , 2016, Molecular biology and evolution.

[32]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[33]  Edwin Wang,et al.  MicroRNA regulation and interspecific variation of gene expression. , 2007, Trends in genetics : TIG.

[34]  X. Gu,et al.  Predominant gain of promoter TATA box after gene duplication associated with stress responses. , 2011, Molecular biology and evolution.

[35]  D. Hartl,et al.  RATES OF DIVERGENCE IN GENE EXPRESSION PROFILES OF PRIMATES, MICE, AND FLIES: STABILIZING SELECTION AND VARIABILITY AMONG FUNCTIONAL CATEGORIES , 2005, Evolution; international journal of organic evolution.