Non‐Coding RNA Analysis Using the Rfam Database

Rfam is a database of non‐coding RNA families in which each family is represented by a multiple sequence alignment, a consensus secondary structure, and a covariance model. Using a combination of manual and literature‐based curation and a custom software pipeline, Rfam converts descriptions of RNA families found in the scientific literature into computational models that can be used to annotate RNAs belonging to those families in any DNA or RNA sequence. Valuable research outputs that are often locked up in figures and supplementary information files are encapsulated in Rfam entries and made accessible through the Rfam Web site. The data produced by Rfam have a broad application, from genome annotation to providing training sets for algorithm development. This article gives an overview of how to search and navigate the Rfam Web site, and how to annotate sequences with RNA families. The Rfam database is freely available at http://rfam.org. © 2018 by John Wiley & Sons, Inc.

[1]  Zasha Weinberg,et al.  Detection of 224 candidate structured RNAs by comparative analysis of specific subsets of intergenic regions , 2017, Nucleic acids research.

[2]  R. Breaker,et al.  Riboswitch diversity and distribution , 2017, RNA.

[3]  Lars Barquist,et al.  Building non-coding RNA families , 2012, 1206.4087.

[4]  Scott Federhen,et al.  The NCBI Taxonomy database , 2011, Nucleic Acids Res..

[5]  A. Quinlan BEDTools: The Swiss‐Army Tool for Genome Feature Analysis , 2014, Current protocols in bioinformatics.

[6]  Robert D. Finn,et al.  Rfam 13.0: shifting to a genome-centric resource for non-coding RNA families , 2017, Nucleic Acids Res..

[7]  Sean R. Eddy,et al.  Rfam: an RNA family database , 2003, Nucleic Acids Res..

[8]  P. Gardner,et al.  Annotating RNA motifs in sequences and alignments , 2014, bioRxiv.

[9]  Alessandro Vullo,et al.  Ensembl 2017 , 2016, Nucleic Acids Res..

[10]  Robert D. Finn,et al.  Rfam: Wikipedia, clans and the “decimal” release , 2010, Nucleic Acids Res..

[11]  Sebastian Will,et al.  RNAalifold: improved consensus structure prediction for RNA alignments , 2008, BMC Bioinformatics.

[12]  J. Steitz,et al.  The Noncoding RNA Revolution—Trashing Old Rules to Forge New Ones , 2014, Cell.

[13]  Eric P. Nawrocki,et al.  Annotating functional RNAs in genomes using Infernal. , 2014, Methods in molecular biology.

[14]  F. Narberhaus,et al.  Bacterial RNA thermometers: molecular zippers and switches , 2012, Nature Reviews Microbiology.

[15]  Alex Bateman,et al.  RNAcentral: a comprehensive database of non-coding RNA sequences , 2016, Nucleic acids research.

[16]  Robert D. Finn,et al.  Rfam 12.0: updates to the RNA families database , 2014, Nucleic Acids Res..

[17]  Sean R. Eddy,et al.  Infernal 1.1: 100-fold faster RNA homology searches , 2013, Bioinform..

[18]  S. Eddy,et al.  A statistical test for conserved RNA structure shows lack of evidence for structure in lncRNAs , 2016, Nature Methods.

[19]  Daniel Lai,et al.  R-chie: a web server and R package for visualizing RNA secondary structures , 2012, Nucleic acids research.

[20]  The RNAcentral Consortium RNAcentral: a comprehensive database of non-coding RNA sequences , 2016, Nucleic Acids Res..