Rsite2: an efficient computational method to predict the functional sites of noncoding RNAs

Noncoding RNAs (ncRNAs) represent a big class of important RNA molecules. Given the large number of ncRNAs, identifying their functional sites is becoming one of the most important topics in the post-genomic era, but available computational methods are limited. For the above purpose, we previously presented a tertiary structure based method, Rsite, which first calculates the distance metrics defined in Methods with the tertiary structure of an ncRNA and then identifies the nucleotides located within the extreme points in the distance curve as the functional sites of the given ncRNA. However, the application of Rsite is largely limited because of limited RNA tertiary structures. Here we present a secondary structure based computational method, Rsite2, based on the observation that the secondary structure based nucleotide distance is strongly positively correlated with that derived from tertiary structure. This makes it reasonable to replace tertiary structure with secondary structure, which is much easier to obtain and process. Moreover, we applied Rsite2 to three ncRNAs (tRNA (Lys), Diels-Alder ribozyme, and RNase P) and a list of human mitochondria transcripts. The results show that Rsite2 works well with nearly equivalent accuracy as Rsite but is much more feasible and efficient. Finally, a web-server, the source codes, and the dataset of Rsite2 are available at http://www.cuialb.cn/rsite2.

[1]  Janet M. Thornton,et al.  The Catalytic Site Atlas: a resource of catalytic sites and residues identified in enzymes using structural data , 2004, Nucleic Acids Res..

[2]  Martin Vingron,et al.  IntAct: an open source molecular interaction database , 2004, Nucleic Acids Res..

[3]  Q. Cui,et al.  An Analysis of Human MicroRNA and Disease Associations , 2008, PloS one.

[4]  Aleksandra Filipovska,et al.  Mapping of mitochondrial RNA-protein interactions by digital RNase footprinting. , 2013, Cell reports.

[5]  Yingyuan Mo MicroRNA regulatory networks and human disease , 2012, Cellular and Molecular Life Sciences.

[6]  Amos Bairoch,et al.  ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins , 2006, Nucleic Acids Res..

[7]  S. Holbrook Structural principles from large RNAs. , 2008, Annual review of biophysics.

[8]  Jason A Greenbaum,et al.  Mapping nucleic acid structure by hydroxyl radical cleavage. , 2005, Current opinion in chemical biology.

[9]  Shaojie Zhang,et al.  Computational analysis of RNA structures with chemical probing data. , 2015, Methods.

[10]  Brian T. Sutch,et al.  Predicting protein functional sites with phylogenetic motifs , 2004, Proteins.

[11]  G. Helt,et al.  Transcriptional Maps of 10 Human Chromosomes at 5-Nucleotide Resolution , 2005, Science.

[12]  Dennis R. Livesay,et al.  MINER: software for phylogenetic motif identification , 2005, Nucleic Acids Res..

[13]  J. Doudna,et al.  Insights into RNA structure and function from genome-wide studies , 2014, Nature Reviews Genetics.

[14]  Kuo-Chen Chou,et al.  Pattern recognition methods for protein functional site prediction. , 2005, Current protein & peptide science.

[15]  K. Kinoshita,et al.  Identification of the ligand binding sites on the molecular surface of proteins , 2005, Protein science : a publication of the Protein Society.

[16]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[17]  Xing Chen,et al.  LncRNADisease: a database for long-non-coding RNA-associated diseases , 2012, Nucleic Acids Res..

[18]  P. Fraser,et al.  No-Nonsense Functions for Long Noncoding RNAs , 2011, Cell.

[19]  D. Baker,et al.  Automated de novo prediction of native-like RNA tertiary structures , 2007, Proceedings of the National Academy of Sciences.

[20]  Karin M. Verspoor,et al.  Text Mining Improves Prediction of Protein Functional Sites , 2012, PloS one.

[21]  Gil Amitai,et al.  Network analysis of protein structures identifies functional residues. , 2004, Journal of molecular biology.

[22]  Amos Bairoch,et al.  PROSITE, a protein domain database for functional characterization and annotation , 2009, Nucleic Acids Res..

[23]  Dennis R. Livesay,et al.  How accurate and statistically robust are catalytic site predictions based on closeness centrality? , 2007, BMC Bioinformatics.

[24]  Jianwei Li,et al.  Rsite: a computational method to identify the functional sites of noncoding RNAs , 2015, Scientific reports.

[25]  T. Nilsen RNase footprinting to map sites of RNA-protein interactions. , 2014, Cold Spring Harbor protocols.

[26]  J. Mattick,et al.  Structure and function of long noncoding RNAs in epigenetic regulation , 2013, Nature Structural &Molecular Biology.

[27]  Mary Jo Ondrechen,et al.  POOL server: machine learning application for functional site prediction in proteins , 2012, Bioinform..

[28]  Steven Busan,et al.  RNA motif discovery by SHAPE and mutational profiling (SHAPE-MaP) , 2014, Nature Methods.

[29]  John D. Westbrook,et al.  The Nucleic Acid Database: new features and capabilities , 2013, Nucleic Acids Res..

[30]  Terrence S. Furey,et al.  The UCSC Table Browser data retrieval tool , 2004, Nucleic Acids Res..

[31]  Kanti V. Mardia,et al.  A Probabilistic Model of RNA Conformational Space , 2009, PLoS Comput. Biol..

[32]  Ian M. Donaldson,et al.  BIND: the Biomolecular Interaction Network Database , 2001, Nucleic Acids Res..

[33]  Yang Li,et al.  HMDD v2.0: a database for experimentally supported human microRNA and disease associations , 2013, Nucleic Acids Res..

[34]  Jeannie T. Lee Epigenetic Regulation by Long Noncoding RNAs , 2012, Science.

[35]  Feng Ding,et al.  iFoldRNA: three-dimensional RNA structure prediction and folding , 2008, Bioinform..

[36]  Sven Diederichs,et al.  The four dimensions of noncoding RNA conservation. , 2014, Trends in genetics : TIG.

[37]  Ioannis Xenarios,et al.  DIP: the Database of Interacting Proteins , 2000, Nucleic Acids Res..

[38]  Walter Fontana,et al.  Fast folding and comparison of RNA secondary structures , 1994 .

[39]  Eric Westhof,et al.  Nucleic Acids and Molecular Biology , 1994, Nucleic Acids and Molecular Biology.

[40]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[41]  P. Wollenzien,et al.  Short-range RNA-RNA crosslinking methods to determine rRNA structure and interactions. , 2001, Methods.

[42]  R. Nussinov,et al.  Residue centrality, functionally important residues, and active site shape: Analysis of enzyme and non‐enzyme families , 2006, Protein science : a publication of the Protein Society.

[43]  Zhaolei Zhang,et al.  Computational Biology in microRNA , 2015, Wiley interdisciplinary reviews. RNA.

[44]  A. Ferré-D’Amaré,et al.  RNA quaternary structure and global symmetry. , 2015, Trends in biochemical sciences.

[45]  Thomas E. Royce,et al.  Global Identification of Human Transcribed Sequences with Genome Tiling Arrays , 2004, Science.

[46]  B KC Dukka,et al.  Structure-based Methods for Computational Protein Functional Site Prediction , 2013, Computational and structural biotechnology journal.

[47]  Craig L. Zirbel,et al.  Nonredundant 3D Structure Datasets for RNA Knowledge Extraction and Benchmarking , 2012 .

[48]  A. S. Krasilnikov,et al.  Footprinting analysis of interactions between the largest eukaryotic RNase P/MRP protein Pop1 and RNase P/MRP RNA components , 2015, RNA.

[49]  Anne Condon,et al.  RNA STRAND: The RNA Secondary Structure and Statistical Analysis Database , 2008, BMC Bioinformatics.

[50]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[51]  Lin Yang,et al.  Long noncoding RNAs: fresh perspectives into the RNA world. , 2014, Trends in biochemical sciences.

[52]  C. Zwieb The principles of RNA structure architecture. , 2014, Methods in molecular biology.

[53]  Vladimir A. Ivanisenko,et al.  PDBSite: a database of the 3D structure of protein functional sites , 2004, Nucleic Acids Res..

[54]  R. Greaves,et al.  Active site identification through geometry-based and sequence profile-based calculations: burial of catalytic clefts. , 2005, Journal of molecular biology.