Structural profiles of human miRNA families from pairwise clustering

UNLABELLED MicroRNAs (miRNAs) are a group of small, approximately 21 nt long, riboregulators inhibiting gene expression at a post-transcriptional level. Their most distinctive structural feature is the foldback hairpin of their precursor pre-miRNAs. Even though each pre-miRNA deposited in miRBase has its secondary structure already predicted, little is known about the patterns of structural conservation among pre-miRNAs. We address this issue by clustering the human pre-miRNA sequences based on pairwise, sequence and secondary structure alignment using FOLDALIGN, followed by global multiple alignment of obtained clusters by WAR. As a result, the common secondary structure was successfully determined for four FOLDALIGN clusters: the RF00027 structural family of the Rfam database and three clusters with previously undescribed consensus structures. AVAILABILITY http://genome.ku.dk/resources/mirclust

[1]  Jan Gorodkin,et al.  MicroRNA sequence motifs reveal asymmetry between the stem arms , 2006, Comput. Biol. Chem..

[2]  David G. Stork,et al.  Pattern Classification , 1973 .

[3]  Masaomi Kato,et al.  microRNAs: small molecules with big roles –C. elegans to human cancer , 2008, Biology of the cell.

[4]  Y. Hayashizaki,et al.  Mouse‐centric comparative transcriptomics of protein coding and non‐coding RNAs , 2004, BioEssays : news and reviews in molecular, cellular and developmental biology.

[5]  Thomas D. Schmittgen,et al.  Regulation of microRNA processing in development, differentiation and cancer , 2008, Journal of cellular and molecular medicine.

[6]  Jaume Bertranpetit,et al.  Comparative analysis of cancer genes in the human and chimpanzee genomes , 2006, BMC Genomics.

[7]  W. Pearson Searching protein sequence libraries: comparison of the sensitivity and selectivity of the Smith-Waterman and FASTA algorithms. , 1991, Genomics.

[8]  Baohong Zhang,et al.  MicroRNA: A new player in stem cells , 2006, Journal of cellular physiology.

[9]  P. Schuster,et al.  From sequences to shapes and back: a case study in RNA secondary structures , 1994, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[10]  William Ritchie,et al.  RNA stem-loops: to be or not to be cleaved by RNAse III. , 2007, RNA.

[11]  T. Nilsen,et al.  MicroRNAs, mRNAs, and translation. , 2006, Cold Spring Harbor symposia on quantitative biology.

[12]  Christoph Flamm,et al.  The expansion of the metazoan microRNA repertoire , 2006, BMC Genomics.

[13]  I. King Jordan,et al.  A Family of Human MicroRNA Genes from Miniature Inverted-Repeat Transposable Elements , 2007, PloS one.

[14]  Phillip D. Zamore,et al.  Sorting of Drosophila Small Silencing RNAs , 2007, Cell.

[15]  D. Sankoff Simultaneous Solution of the RNA Folding, Alignment and Protosequence Problems , 1985 .

[16]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[17]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[18]  Jan Gorodkin,et al.  Principles and limitations of computational microRNA gene and target finding. , 2007, DNA and cell biology.

[19]  Phillip D. Zamore,et al.  Drosophila microRNAs Are Sorted into Functionally Distinct Argonaute Complexes after Production by Dicer-1 , 2007, Cell.

[20]  Stinus Lindgreen,et al.  WAR: Webserver for aligning structural RNAs , 2008, Nucleic Acids Res..

[21]  Hidetoshi Shimodaira,et al.  Pvclust: an R package for assessing the uncertainty in hierarchical clustering , 2006, Bioinform..

[22]  Jan Gorodkin,et al.  Fast Pairwise Structural RNA Alignments by Pruning of the Dynamical Programming Matrix , 2007, PLoS Comput. Biol..

[23]  Sean R. Eddy,et al.  Query-Dependent Banding (QDB) for Faster RNA Similarity Searches , 2007, PLoS Comput. Biol..

[24]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[25]  M. Matzke,et al.  Short RNAs Can Identify New Candidate Transposable Element Families in Arabidopsis , 2002, Plant Physiology.

[26]  Thomas Tuschl,et al.  The growing catalog of small RNAs and their association with distinct Argonaute/Piwi family members , 2008, Development.

[27]  Stijn van Dongen,et al.  miRBase: microRNA sequences, targets and gene nomenclature , 2005, Nucleic Acids Res..

[28]  Eran Segal,et al.  Computational prediction of RNA structural motifs involved in posttranscriptional regulatory processes , 2008, Proceedings of the National Academy of Sciences.

[29]  Zasha Weinberg,et al.  Sequence-based heuristics for faster annotation of non-coding RNA families , 2006, Bioinform..

[30]  Gunter Meister,et al.  Argonaute proteins: mediators of RNA silencing. , 2007, Molecular cell.

[31]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[32]  Sean R. Eddy,et al.  Rfam: an RNA family database , 2003, Nucleic Acids Res..