Secondary structure impacts patterns of selection in human lncRNAs

BackgroundMetazoans transcribe many long non-coding RNAs (lncRNAs) that are poorly conserved and whose function remains unknown. This has raised the questions of what fraction of the predicted lncRNAs is actually functional, and whether selection can effectively constrain lncRNAs in species with small effective population sizes such as human populations.ResultsHere we evaluate signatures of selection in human lncRNAs using inter-specific data and intra-specific comparisons from five major populations, as well as by assessing relationships between sequence variation and predictions of secondary structure. In all analyses we included a reference of functionally characterized lncRNAs. Altogether, our results show compelling evidence of recent purifying selection acting on both characterized and predicted lncRNAs. We found that RNA secondary structure constrains sequence variation in lncRNAs, so that polymorphisms are depleted in paired regions with low accessibility and tend to be neutral with respect to structural stability.ConclusionsImportant implications of our results are that secondary structure plays a role in the functionality of lncRNAs, and that the set of predicted lncRNAs contains a large fraction of functional ones that may play key roles that remain to be discovered.

[1]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[2]  R. Durbin,et al.  Inference of Human Population History From Whole Genome Sequence of A Single Individual , 2011, Nature.

[3]  Yong Zhang,et al.  CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine , 2007, Nucleic Acids Res..

[4]  D. Bartel,et al.  lincRNAs: Genomics, Evolution, and Mechanisms , 2013, Cell.

[5]  R. Durbin,et al.  Inference of human population history from individual whole-genome sequences. , 2011, Nature.

[6]  Howard Y. Chang,et al.  Long noncoding RNA HOTAIR reprograms chromatin state to promote cancer metastasis , 2010, Nature.

[7]  J. Mattick,et al.  Genome-wide analysis of long noncoding RNA stability , 2012, Genome research.

[8]  J. Rinn,et al.  Many human large intergenic noncoding RNAs associate with chromatin-modifying complexes and affect gene expression , 2009, Proceedings of the National Academy of Sciences.

[9]  M. Karin,et al.  A large noncoding RNA is a marker for murine hepatocellular carcinomas and a spectrum of human carcinomas , 2007, Oncogene.

[10]  Wilfried Haerty,et al.  Mutations within lncRNAs are effectively selected against in fruitfly but not in human , 2013, Genome Biology.

[11]  D. Bartel,et al.  Long noncoding RNAs in C. elegans , 2012, Genome research.

[12]  J. Steitz,et al.  Formation of triple-helical structures by the 3′-end sequences of MALAT1 and MENβ noncoding RNAs , 2012, Proceedings of the National Academy of Sciences.

[13]  D. Bartel,et al.  Principles of long noncoding RNA evolution derived from direct comparison of transcriptomes in 17 species. , 2015, Cell reports.

[14]  Sudhir Kumar,et al.  Tree of Life Reveals Clock-Like Speciation and Diversification , 2014, Molecular biology and evolution.

[15]  Frank Grützner,et al.  The evolution of lncRNA repertoires and expression patterns in tetrapods , 2014, Nature.

[16]  Terrence S. Furey,et al.  The UCSC Table Browser data retrieval tool , 2004, Nucleic Acids Res..

[17]  Manolis Kellis,et al.  Evolutionary dynamics and tissue specificity of human long noncoding RNAs in six mammals , 2014, Genome research.

[18]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[19]  David G. Knowles,et al.  The GENCODE v7 catalog of human long noncoding RNAs: Analysis of their gene structure, evolution, and expression , 2012, Genome research.

[20]  P. Stadler,et al.  Comparison of splice sites reveals that long noncoding RNAs are evolutionarily well conserved , 2015, RNA.

[21]  Y. Zhang,et al.  In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features , 2013, Nature.

[22]  Yunli Zhou,et al.  Maternally expressed gene 3 (MEG3) noncoding ribonucleic acid: isoform structure, expression, and functions. , 2010, Endocrinology.

[23]  J. Essigmann,et al.  Role of tautomerism in RNA biochemistry , 2015, RNA.

[24]  Karissa Y. Sanbonmatsu,et al.  Structural architecture of the human long non-coding RNA, steroid receptor RNA activator , 2012, Nucleic acids research.

[25]  M. Lercher,et al.  PopGenome: An Efficient Swiss Army Knife for Population Genomic Analyses in R , 2014, Molecular biology and evolution.

[26]  D. Reich,et al.  No evidence that selection has been less effective at removing deleterious mutations in Europeans than in Africans , 2014, Nature Genetics.

[27]  Saba Valadkhan,et al.  Computational analysis of functional long noncoding RNAs reveals lack of peptide-coding capacity and parallels with 3' UTRs. , 2012, RNA.

[28]  Francesc Calafell,et al.  Recombination gives a new insight in the effective population size and the history of the old world human populations. , 2012, Molecular biology and evolution.

[29]  P. Stadler,et al.  Widespread purifying selection on RNA structure in mammals , 2013, Nucleic acids research.

[30]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[31]  D. Haussler,et al.  Phylogenomic resources at the UCSC Genome Browser. , 2008, Methods in molecular biology.

[32]  Laurent Journot,et al.  H19 acts as a trans regulator of the imprinted gene network controlling growth in mice , 2009, Development.

[33]  Gonçalo R. Abecasis,et al.  The variant call format and VCFtools , 2011, Bioinform..

[34]  D. Reich,et al.  No evidence that natural selection has been less effective at removing deleterious mutations in Europeans than in West Africans , 2014, bioRxiv.

[35]  Aaron R. Quinlan,et al.  Bioinformatics Applications Note Genome Analysis Bedtools: a Flexible Suite of Utilities for Comparing Genomic Features , 2022 .

[36]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[37]  M. Nei,et al.  Mathematical model for studying genetic variation in terms of restriction endonucleases. , 1979, Proceedings of the National Academy of Sciences of the United States of America.

[38]  Jianzhi Zhang,et al.  Human long noncoding RNAs are substantially less folded than messenger RNAs. , 2015, Molecular biology and evolution.

[39]  Anna Marie Pyle,et al.  HOTAIR forms an intricate and modular secondary structure. , 2015, Molecular cell.

[40]  Chris P. Ponting,et al.  Rapid Turnover of Long Noncoding RNAs and the Evolution of Gene Expression , 2012, PLoS genetics.

[41]  Carolyn J. Brown,et al.  A gene from the region of the human X inactivation centre is expressed exclusively from the inactive X chromosome , 1991, Nature.

[42]  K. Pollard,et al.  Detection of nonneutral substitution rates on mammalian phylogenies. , 2010, Genome research.

[43]  Rob W. Ness,et al.  Assessing Recent Selection and Functionality at Long Noncoding RNA Loci in the Mouse Genome , 2015, Genome biology and evolution.

[44]  Michael F. Lin,et al.  Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammals , 2009, Nature.

[45]  Ye Ding,et al.  A structural interpretation of the effect of GC-content on efficiency of RNA interference , 2009, BMC Bioinformatics.

[46]  M. Albà,et al.  Sequence shortening in the rodent ancestor. , 2012, Genome research.

[47]  S. Dhanasekaran,et al.  The landscape of long noncoding RNAs in the human transcriptome , 2015, Nature Genetics.

[48]  R. Hudson,et al.  A new statistic for detecting genetic differentiation. , 2000, Genetics.

[49]  K. Struhl Transcriptional noise and the fidelity of initiation by RNA polymerase II , 2007, Nature Structural &Molecular Biology.

[50]  Thomas D. Schmittgen,et al.  Ultraconserved regions encoding ncRNAs are altered in human leukemias and carcinomas. , 2007, Cancer cell.

[51]  Ivo L. Hofacker,et al.  Vienna RNA secondary structure server , 2003, Nucleic Acids Res..

[52]  K. Morris,et al.  Evolutionary conservation of long non-coding RNAs; sequence, structure, function. , 2014, Biochimica et biophysica acta.

[53]  C. Ponting,et al.  Unexpected selection to retain high GC content and splicing enhancers within exons of multiexonic lncRNA loci , 2015, RNA.

[54]  Cole Trapnell,et al.  Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. , 2011, Genes & development.

[55]  Richard R. Hudson,et al.  Generating samples under a Wright-Fisher neutral model of genetic variation , 2002, Bioinform..

[56]  M. Esteller,et al.  CpG island hypermethylation-associated silencing of non-coding RNAs transcribed from ultraconserved regions in human cancer , 2010, Oncogene.

[57]  Laurence D. Hurst,et al.  Purifying Selection on Splice-Related Motifs, Not Expression Level nor RNA Folding, Explains Nearly All Constraint on Human lincRNAs , 2014, Molecular biology and evolution.

[58]  John S. Mattick,et al.  lncRNAdb: a reference database for long noncoding RNAs , 2010, Nucleic Acids Res..

[59]  F. Tajima Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. , 1989, Genetics.

[60]  Structural basis for molecular discrimination by a 3',3'-cGAMP sensing riboswitch. , 2015, Cell reports.

[61]  An-Yuan Guo,et al.  lncRNASNP: a database of SNPs in lncRNAs and their potential functions in human and mouse , 2014, Nucleic Acids Res..

[62]  Marcel E. Dinger,et al.  lncRNAdb v2.0: expanding the reference database for functional long noncoding RNAs , 2014, Nucleic Acids Res..

[63]  D. Bartel,et al.  Conserved Function of lincRNAs in Vertebrate Embryonic Development despite Rapid Sequence Evolution , 2011, Cell.

[64]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[65]  John N. Hutchinson,et al.  A screen for nuclear transcripts identifies two linked noncoding RNAs associated with SC35 splicing domains , 2007, BMC Genomics.

[66]  C. Ponting,et al.  Evolution and Functions of Long Noncoding RNAs , 2009, Cell.

[67]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.