Negative Correlation between Expression Level and Evolutionary Rate of Long Intergenic Noncoding RNAs

Mammalian genomes contain numerous genes for long noncoding RNAs (lncRNAs). The functions of the lncRNAs remain largely unknown but their evolution appears to be constrained by purifying selection, albeit relatively weakly. To gain insights into the mode of evolution and the functional range of the lncRNA, they can be compared with much better characterized protein-coding genes. The evolutionary rate of the protein-coding genes shows a universal negative correlation with expression: highly expressed genes are on average more conserved during evolution than the genes with lower expression levels. This correlation was conceptualized in the misfolding-driven protein evolution hypothesis according to which misfolding is the principal cost incurred by protein expression. We sought to determine whether long intergenic ncRNAs (lincRNAs) follow the same evolutionary trend and indeed detected a moderate but statistically significant negative correlation between the evolutionary rate and expression level of human and mouse lincRNA genes. The magnitude of the correlation for the lincRNAs is similar to that for equal-sized sets of protein-coding genes with similar levels of sequence conservation. Additionally, the expression level of the lincRNAs is significantly and positively correlated with the predicted extent of lincRNA molecule folding (base-pairing), however, the contributions of evolutionary rates and folding to the expression level are independent. Thus, the anticorrelation between evolutionary rate and expression level appears to be a general feature of gene evolution that might be caused by similar deleterious effects of protein and RNA misfolding and/or other factors, for example, the number of interacting partners of the gene product.

[1]  Aleksey Y. Ogurtsov,et al.  Expression Patterns of Protein Kinases Correlate with Gene Architecture and Evolutionary Rates , 2008, PloS one.

[2]  C. Ponting,et al.  Functionality or transcriptional noise? Evidence for selection within long noncoding RNAs. , 2007, Genome research.

[3]  Sanghyuk Lee,et al.  Accurate quantification of transcriptome from RNA-Seq data by effective length normalization , 2010, Nucleic Acids Res..

[4]  Bruce G. Lindsay,et al.  Computer-assisted analysis of mixtures (C.A.MAN) statistical algorithms , 1992 .

[5]  Yi Zhang,et al.  Imprinting along the Kcnq1 domain on mouse chromosome 7 involves repressive histone methylation and recruitment of Polycomb group complexes , 2004, Nature Genetics.

[6]  J. Mattick,et al.  Long non-coding RNAs: insights into functions , 2009, Nature Reviews Genetics.

[7]  Laurent Duret,et al.  The Xist RNA Gene Evolved in Eutherians by Pseudogenization of a Protein-Coding Gene , 2006, Science.

[8]  Brian Charlesworth,et al.  Patterns of intron sequence evolution in Drosophila are dependent upon length and GC content , 2005, Genome Biology.

[9]  D. M. Krylov,et al.  Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution. , 2003, Genome research.

[10]  Aleksey Y. Ogurtsov,et al.  Analysis of internal loops within the RNA secondary structure in almost quadratic time , 2006, Bioinform..

[11]  J. Mcneil,et al.  XIST RNA paints the inactive X chromosome at interphase: evidence for a novel RNA involved in nuclear/chromosome structure , 1996, The Journal of cell biology.

[12]  L. Hurst The Ka/Ks ratio: diagnosing the form of sequence evolution. , 2002, Trends in genetics : TIG.

[13]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[14]  L. Duret,et al.  Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. , 2000, Molecular biology and evolution.

[15]  Ana Serra Barros,et al.  Repression of the human dihydrofolate reductase gene by a non-coding interfering transcript , 2007, Nature.

[16]  Jurg Ott,et al.  Nucleotide frequency variation across human genes. , 2003, Genome research.

[17]  J. Mattick,et al.  Rapid evolution of noncoding RNAs: lack of conservation does not mean lack of function. , 2006, Trends in genetics : TIG.

[18]  Eugene V Koonin,et al.  Evolutionary systems biology: links between gene evolution and function. , 2006, Current opinion in biotechnology.

[19]  D. Lipman,et al.  Relative Contributions of Intrinsic Structural–Functional Constraints and Translation Rate to the Evolution of Protein-Coding Genes , 2010, Genome biology and evolution.

[20]  Svetlana A. Shabalina,et al.  Differential Arginylation of Actin Isoforms Is Regulated by Coding Sequence–Dependent Degradation , 2010, Science.

[21]  S. Sunkin,et al.  Specific expression of long noncoding RNAs in the mouse brain , 2008, Proceedings of the National Academy of Sciences.

[22]  Daniel J. Blankenberg,et al.  Galaxy: A Web‐Based Genome Analysis Tool for Experimentalists , 2010, Current protocols in molecular biology.

[23]  J. Komorowski,et al.  Kcnq1ot1 antisense noncoding RNA mediates lineage-specific transcriptional silencing through chromatin-level regulation. , 2008, Molecular cell.

[24]  Arcadi Navarro,et al.  Patterns and rates of intron divergence between humans and chimpanzees , 2007, Genome Biology.

[25]  J. Rinn,et al.  A Large Intergenic Noncoding RNA Induced by p53 Mediates Global Gene Repression in the p53 Response , 2010, Cell.

[26]  C. Pál,et al.  Highly expressed genes in yeast evolve slowly. , 2001, Genetics.

[27]  Douglas Jb Computer-assisted analysis of mixtures. , 2000 .

[28]  L. Hurst,et al.  Human antisense genes have unusually short introns: evidence for selection for rapid transcription. , 2005, Trends in genetics : TIG.

[29]  M. Lynch The frailty of adaptive hypotheses for the origins of organismal complexity , 2007, Proceedings of the National Academy of Sciences.

[30]  Peter F Stadler,et al.  Fast and reliable prediction of noncoding RNAs , 2005, Proc. Natl. Acad. Sci. USA.

[31]  Howard Y. Chang,et al.  Genome-wide measurement of RNA secondary structure in yeast , 2010, Nature.

[32]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[33]  P. Stadler,et al.  RNA Maps Reveal New RNA Classes and a Possible Function for Pervasive Transcription , 2007, Science.

[34]  Ivo L. Hofacker,et al.  The RNAz web server: prediction of thermodynamically stable and evolutionarily conserved RNA structures , 2007, Nucleic Acids Res..

[35]  Aleksey Y. Ogurtsov,et al.  OWEN: aligning long collinear regions of genomes , 2002, Bioinform..

[36]  Thomas E. Royce,et al.  Global Identification of Human Transcribed Sequences with Genome Tiling Arrays , 2004, Science.

[37]  Michael F. Lin,et al.  Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals , 2009, Nature.

[38]  K. Semrad,et al.  Proteins with RNA Chaperone Activity: A World of Diverse Proteins with a Common Task—Impediment of RNA Misfolding , 2010, Biochemistry research international.

[39]  T. Hughes,et al.  Establishing legitimacy and function in the new transcriptome. , 2009, Briefings in functional genomics & proteomics.

[40]  C. Ponting,et al.  Transcribed dark matter: meaning or myth? , 2010, Human molecular genetics.

[41]  Jian-Rong Yang,et al.  Impact of translational error-induced and error-free misfolding on the rate of protein evolution , 2010, Molecular systems biology.

[42]  M. Lynch,et al.  The Origins of Genome Complexity , 2003, Science.

[43]  J. Mattick,et al.  Non-coding RNA. , 2006, Human molecular genetics.

[44]  M. Lazar,et al.  Inhibition of c-erbA mRNA splicing by a naturally occurring antisense RNA. , 1991, The Journal of biological chemistry.

[45]  Liran Carmel,et al.  Unifying measures of gene function and evolution , 2006, Proceedings of the Royal Society B: Biological Sciences.

[46]  Wen-Hsiung Li,et al.  Mammalian housekeeping genes evolve more slowly than tissue-specific genes. , 2004, Molecular biology and evolution.

[47]  Liran Carmel,et al.  Widespread positive selection in synonymous sites of mammalian genes. , 2007, Molecular biology and evolution.

[48]  K. Williams,et al.  Dendritic BC1 RNA in translational control mechanisms , 2005, The Journal of cell biology.

[49]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[50]  Aleksey Y. Ogurtsov,et al.  A periodic pattern of mRNA secondary structure created by the genetic code , 2006, Nucleic acids research.

[51]  T B Nesterova,et al.  Characterization of the genomic Xist locus in rodents reveals conservation of overall gene structure and tandem repeats but rapid evolution of unique sequence. , 2001, Genome research.

[52]  Eugene V Koonin,et al.  Universal distribution of protein evolution rates as a consequence of protein folding physics , 2010, Proceedings of the National Academy of Sciences.

[53]  Samuel S. Shepard,et al.  Critical association of ncRNA with introns , 2010, Nucleic acids research.

[54]  L. Hurst,et al.  How do synonymous mutations affect fitness? , 2007, BioEssays : news and reviews in molecular, cellular and developmental biology.

[55]  P. Avner,et al.  2-D Structure of the A Region of Xist RNA and Its Implication for PRC2 Association , 2010, PLoS biology.

[56]  E. Birney,et al.  Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs , 2002, Nature.

[57]  Jennifer A. Mitchell,et al.  The Air Noncoding RNA Epigenetically Silences Transcription by Targeting G9a to Chromatin , 2008, Science.

[58]  P. Calabresi,et al.  The Brain Cytoplasmic RNA BC1 Regulates Dopamine D2 Receptor-Mediated Transmission in the Striatum , 2007, The Journal of Neuroscience.

[59]  J. Goodrich,et al.  Non-coding-RNA regulators of RNA polymerase II transcription , 2006, Nature Reviews Molecular Cell Biology.

[60]  L Milanesi,et al.  Protein-coding regions prediction combining similarity searches and conservative evolutionary properties of protein-coding sequences. , 1999, Gene.

[61]  Brian S. Clark,et al.  The Evf-2 noncoding RNA is transcribed from the Dlx-5/6 ultraconserved region and functions as a Dlx-2 transcriptional coactivator. , 2006, Genes & development.

[62]  E. Koonin,et al.  Conservation and coevolution in the scale-free human gene coexpression network. , 2004, Molecular biology and evolution.

[63]  R. Russell RNA misfolding and the action of chaperones. , 2008, Frontiers in bioscience : a journal and virtual library.

[64]  Terrence S. Furey,et al.  The UCSC Table Browser data retrieval tool , 2004, Nucleic Acids Res..

[65]  A. G. de Herreros,et al.  A natural antisense transcript regulates Zeb2/Sip1 gene expression during Snail1-induced epithelial-mesenchymal transition. , 2008, Genes & development.

[66]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[67]  J. Darlix,et al.  The ubiquitous nature of RNA chaperone proteins. , 2002, Progress in nucleic acid research and molecular biology.

[68]  J. Felsenstein Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods. , 1996, Methods in enzymology.

[69]  C. Ponting,et al.  Long noncoding RNA genes: conservation of sequence and brain expression among diverse amniotes , 2010, Genome Biology.

[70]  C. Ponting,et al.  Evolution and Functions of Long Noncoding RNAs , 2009, Cell.

[71]  M. Lynch The origins of eukaryotic gene structure. , 2006, Molecular biology and evolution.

[72]  Celso A. Espinoza,et al.  Human Alu RNA is a modular transacting repressor of mRNA transcription during heat shock. , 2008, Molecular cell.

[73]  P Schlattmann,et al.  Computer-assisted analysis of mixtures (C.A.MAM): statistical algorithms. , 1992, Biometrics.

[74]  C. Ponting,et al.  Catalogues of mammalian long noncoding RNAs: modest conservation and incompleteness , 2009, Genome Biology.

[75]  K. Shokat,et al.  Human Catechol-O-Methyltransferase Haplotypes Modulate Protein Expression by Altering mRNA Secondary Structure , 2006, Science.

[76]  Claus O. Wilke,et al.  Mistranslation-Induced Protein Misfolding as a Dominant Constraint on Coding-Sequence Evolution , 2008, Cell.

[77]  Tim R. Mercer,et al.  NRED: a database of long noncoding RNA expression , 2008, Nucleic Acids Res..

[78]  Celso A. Espinoza,et al.  Characterization of the structure, function, and mechanism of B2 RNA, an ncRNA repressor of RNA polymerase II transcription. , 2007, RNA.

[79]  Ewan Birney,et al.  Estimating the neutral rate of nucleotide substitution using introns. , 2006, Molecular biology and evolution.

[80]  Kazuho Ikeo,et al.  Transcriptional Interferences in cis Natural Antisense Transcripts of Humans and Mice , 2007, Genetics.

[81]  N. Brockdorff,et al.  A Dual Origin of the Xist Gene from a Protein-Coding Gene and a Set of Transposable Elements , 2008, PloS one.

[82]  T. Shibata,et al.  Stepwise chromatin remodelling by a cascade of transcription initiation of non-coding RNAs , 2008, Nature.

[83]  Alexey S Kondrashov,et al.  Classification of common conserved sequences in mammalian intergenic regions. , 2002, Human molecular genetics.

[84]  Eugene V. Koonin,et al.  Constraints and plasticity in genome and molecular-phenome evolution , 2010, Nature Reviews Genetics.

[85]  P Schlattmann,et al.  Recent developments in computer-assisted analysis of mixtures. , 1998, Biometrics.

[86]  A. Nekrutenko,et al.  Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences , 2010, Genome Biology.

[87]  Fred Winston,et al.  Intergenic transcription is required to repress the Saccharomyces cerevisiae SER3 gene , 2004, Nature.

[88]  Rick Russell,et al.  Kinetic redistribution of native and misfolded RNAs by a DEAD-box chaperone , 2007, Nature.

[89]  C. Wilke,et al.  The evolutionary consequences of erroneous protein synthesis , 2009, Nature Reviews Genetics.

[90]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[91]  Janet Kelso,et al.  Functionality of Intergenic Transcription: An Evolutionary Comparison , 2006, PLoS genetics.

[92]  R. Konrat,et al.  RNA Chaperones, RNA Annealers and RNA Helicases , 2007, RNA biology.