Rfam: Wikipedia, clans and the “decimal” release

The Rfam database aims to catalogue non-coding RNAs through the use of sequence alignments and statistical profile models known as covariance models. In this contribution, we discuss the pros and cons of using the online encyclopedia, Wikipedia, as a source of community-derived annotation. We discuss the addition of groupings of related RNA families into clans and new developments to the website. Rfam is available on the Web at http://rfam.sanger.ac.uk.

[1]  N. Rawlings,et al.  Evolutionary families of peptidases. , 1993, The Biochemical journal.

[2]  Dmitry A. Samarsky,et al.  A comprehensive database for the small nucleolar RNAs from Saccharomyces cerevisiae , 1999, Nucleic Acids Res..

[3]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[4]  Sean R. Eddy,et al.  A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure , 2002, BMC Bioinformatics.

[5]  Sean R. Eddy,et al.  Rfam: an RNA family database , 2003, Nucleic Acids Res..

[6]  Paul D. Shaw,et al.  Plant snoRNA database , 2003, Nucleic Acids Res..

[7]  Ian Holmes,et al.  A probabilistic model for the evolution of RNA structure , 2004, BMC Bioinformatics.

[8]  R. Durbin,et al.  The Sequence Ontology: a tool for the unification of genome annotations , 2005, Genome Biology.

[9]  Sean R. Eddy,et al.  Rfam: annotating non-coding RNAs in complete genomes , 2004, Nucleic Acids Res..

[10]  Si-guang Li,et al.  Identification and Functional Analysis of 20 Box H/ACA Small Nucleolar RNAs (snoRNAs) from Schizosaccharomyces pombe* , 2005, Journal of Biological Chemistry.

[11]  Laurent Lestrade,et al.  snoRNA-LBME-db, a comprehensive database of human H/ACA and C/D box snoRNAs , 2005, Nucleic Acids Res..

[12]  Zasha Weinberg,et al.  Sequence-based heuristics for faster annotation of non-coding RNA families , 2006, Bioinform..

[13]  Robert D. Finn,et al.  Pfam: clans, web tools and services , 2005, Nucleic Acids Res..

[14]  Serafim Batzoglou,et al.  CONTRAfold: RNA secondary structure prediction without physics-based models , 2006, ISMB.

[15]  Zasha Weinberg,et al.  CMfinder - a covariance model based RNA motif finding algorithm , 2006, Bioinform..

[16]  Robert D. Finn,et al.  SCOOP: a simple method for identification of novel protein superfamily relationships , 2007, Bioinform..

[17]  Sean R. Eddy,et al.  A Probabilistic Model of Local Sequence Alignment That Simplifies Statistical Significance Estimation , 2008, PLoS Comput. Biol..

[18]  J. Vogel,et al.  Two Seemingly Homologous Noncoding RNAs Act Hierarchically to Activate glmS mRNA Translation , 2008, PLoS biology.

[19]  Stijn van Dongen,et al.  miRBase: tools for microRNA genomics , 2007, Nucleic Acids Res..

[20]  G. Storz,et al.  Small Toxic Proteins and the Antisense RNAs That Repress Them , 2008, Microbiology and Molecular Biology Reviews.

[21]  J. Buhler,et al.  Designing secondary structure profiles for fast ncRNA identification. , 2008, Computational systems bioinformatics. Computational Systems Bioinformatics Conference.

[22]  Martin Madera,et al.  Profile Comparer: a program for scoring and aligning profile hidden Markov models , 2008, Bioinform..

[23]  Jon W. Huss,et al.  A Gene Wiki for Community Annotation of Gene Function , 2008, PLoS biology.

[24]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[25]  H. Nielsen,et al.  Group I introns: Moving in new directions , 2009, RNA biology.

[26]  R. Unger,et al.  Families of H/ACA ncRNA molecules in Trypanosomatids , 2009, RNA biology.

[27]  Kiyoshi Asai,et al.  The Functional RNA Database 3.0: databases to support mining and annotation of functional RNAs , 2008, Nucleic Acids Res..

[28]  P. Stadler,et al.  A survey of nematode SmY RNAs , 2009, RNA biology.

[29]  Sean R Eddy,et al.  A new generation of homology search tools based on probabilistic inference. , 2009, Genome informatics. International Conference on Genome Informatics.

[30]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[31]  N. Larsen,et al.  Kinship in the SRP RNA family , 2009, RNA biology.

[32]  Natalia N. Ivanova,et al.  A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea , 2009, Nature.

[33]  Robert D. Finn,et al.  Rfam: updates to the RNA families database , 2008, Nucleic Acids Res..

[34]  Joshua M. Stuart,et al.  Genome 10K: a proposal to obtain whole-genome sequence for 10,000 vertebrate species. , 2009, The Journal of heredity.

[35]  Gregor Gierga,et al.  The Yfr2 ncRNA family, a group of abundant RNA molecules widely conserved in cyanobacteria , 2009, RNA biology.

[36]  R. Breaker,et al.  Comparative genomics reveals 104 candidate structured RNAs from bacteria, archaea, and their metagenomes , 2010, Genome Biology.

[37]  James W. Brown,et al.  The RNA structure alignment ontology. , 2009, RNA.

[38]  E. Westhof,et al.  A pH-responsive riboregulator. , 2009, Genes & development.

[39]  B. Sobral,et al.  Variations on the tmRNA gene , 2009, RNA biology.

[40]  James W. Brown,et al.  The RNase P family , 2009, RNA biology.

[41]  P. Stadler,et al.  Comparative analysis of eukaryotic U3 snoRNA , 2009, RNA biology.

[42]  Marcela Dávila López,et al.  Conserved and variable domains of RNase MRP RNA , 2009, RNA biology.

[43]  P. Stadler,et al.  Bcheck: a wrapper tool for detecting RNase P RNA genes , 2010, BMC Genomics.

[44]  D. Gautheret,et al.  RsaOG, a new Staphylococcal family of highly transcribed non-coding RNA , 2010, RNA biology.

[45]  Ying Cheng,et al.  Improvements to services at the European Nucleotide Archive , 2009, Nucleic Acids Res..

[46]  P. Stadler,et al.  A novel family of plasmid-transferred anti-sense ncRNAs , 2010, RNA biology.

[47]  Andrew I. Su,et al.  The Gene Wiki: community intelligence applied to human gene annotation , 2009, Nucleic Acids Res..

[48]  C. Gualerzi,et al.  The cspA mRNA is a thermosensor that modulates translation of the cold-shock protein CspA. , 2010, Molecular cell.

[49]  Sean R. Eddy,et al.  Hidden Markov model speed heuristic and iterative HMM search procedure , 2010, BMC Bioinformatics.

[50]  A. Gultyaev,et al.  A family of non-classical pseudoknots in influenza A and B viruses , 2010, RNA biology.