A method for RNA structure prediction shows evidence for structure in lncRNAs

To compare the secondary structure of RNA molecules we developed the CROSSalign method. CROSSalign is based on the combination of the Computational Recognition Of Secondary Structure (CROSS) algorithm to predict the RNA secondary structure at single-nucleotide resolution using sequence information only and the Dynamic Time Warping (DTW) method to align profiles of different lengths. We applied CROSSalign to investigate the structural conservation of long non-coding RNAs such as XIST and HOTAIR as well as ssRNA viruses including HIV. The algorithm is able to find homologues between thousands of possible matches identifying the exact regions of similarity between profiles of different length. CROSSalign is freely available at the webpage http://service.tartaglialab.com//new_submission/crossalign.

[1]  Howard Y. Chang,et al.  Genome-wide measurement of RNA secondary structure in yeast , 2010, Nature.

[2]  Peter F. Stadler,et al.  SHAPE directed RNA folding , 2015, bioRxiv.

[3]  Federico Agostini,et al.  Predicting protein associations with long noncoding RNAs , 2011, Nature Methods.

[4]  Kevin Y. Yip,et al.  Improved prediction of RNA secondary structure by integrating the free energy model with restraints derived from experimental probing data , 2015, Nucleic acids research.

[5]  David H. Mathews,et al.  RNAstructure: software for RNA secondary structure prediction and analysis , 2010, BMC Bioinformatics.

[6]  P. Sharp,et al.  Origins of HIV and the AIDS pandemic. , 2011, Cold Spring Harbor perspectives in medicine.

[7]  Kristen K. Dang,et al.  Architecture and Secondary Structure of an Entire HIV-1 RNA Genome , 2009, Nature.

[8]  S. Eddy,et al.  A statistical test for conserved RNA structure shows lack of evidence for structure in lncRNAs , 2016, Nature Methods.

[9]  Federico Agostini,et al.  X-inactivation: quantitative predictions of protein interactions in the Xist network , 2012, Nucleic acids research.

[10]  Nuno A. Fonseca,et al.  High-resolution mapping of transcriptional dynamics across tissue development reveals a stable mRNA–tRNA interface , 2014, Genome research.

[11]  Shinichi Nakagawa,et al.  Xist Exon 7 Contributes to the Stable Localization of Xist RNA on the Inactive X-Chromosome , 2015, PLoS genetics.

[12]  Howard Y. Chang,et al.  Structural imprints in vivo decode RNA regulatory mechanisms , 2015, Nature.

[13]  R. Guigó,et al.  Comparative transcriptomics in human and mouse , 2017, Nature Reviews Genetics.

[14]  Toni Giorgino,et al.  Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package , 2009 .

[15]  S. Steinberg,et al.  A hierarchical model for evolution of 23S ribosomal RNA , 2009, Nature.

[16]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[17]  Sven Diederichs,et al.  The four dimensions of noncoding RNA conservation. , 2014, Trends in genetics : TIG.

[18]  J. Sabina,et al.  Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure. , 1999, Journal of molecular biology.

[19]  D. Frishman,et al.  Conservation of mRNA secondary structures may filter out mutations in Escherichia coli evolution , 2013, Nucleic acids research.

[20]  P. Bieniasz,et al.  HIV-1 and Ebola virus encode small peptide motifs that recruit Tsg101 to sites of particle assembly to facilitate egress , 2001, Nature Medicine.

[21]  P. Avner,et al.  Quantitative predictions of protein interactions with long noncoding RNAs , 2016, Nature Methods.

[22]  Kristen K. Dang,et al.  Comparison of SIV and HIV-1 Genomic RNA Structures Reveals Impact of Sequence Evolution on Conserved and Non-Conserved Structural Motifs , 2013, PLoS pathogens.

[23]  M. Summers,et al.  Structural determinants and mechanism of HIV-1 genome packaging. , 2011, Journal of molecular biology.

[24]  Anne Condon,et al.  RNA STRAND: The RNA Secondary Structure and Statistical Analysis Database , 2008, BMC Bioinformatics.

[25]  Loren Dean Williams,et al.  History of the ribosome and the origin of translation , 2015, Proceedings of the National Academy of Sciences.

[26]  V. Kim,et al.  Regulation of microRNA biogenesis , 2014, Nature Reviews Molecular Cell Biology.

[27]  Rolf Backofen,et al.  Global or local? Predicting secondary structure and accessibility in mRNAs , 2012, Nucleic acids research.

[28]  G. Tartaglia,et al.  A high-throughput approach to profile RNA structure , 2016, Nucleic acids research.

[29]  R. Manmatha,et al.  Word image matching using dynamic time warping , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[30]  Igor Ulitsky,et al.  Evolution to the rescue: using comparative genomics to understand long non-coding RNAs , 2016, Nature Reviews Genetics.

[31]  Qiangfeng Cliff Zhang,et al.  Landscape and variation of RNA secondary structure across the human transcriptome , 2014, Nature.

[32]  A. Panganiban,et al.  Simian immunodeficiency virus RNA is efficiently encapsidated by human immunodeficiency virus type 1 particles , 1993, Journal of virology.

[33]  Eamonn J. Keogh,et al.  Scaling up dynamic time warping for datamining applications , 2000, KDD '00.