MicroRNA binding sites in C. elegans 3′ UTRs

MicroRNAs (miRNAs) are post-transcriptional regulators of gene expression. Since the discovery of lin-4, the founding member of the miRNA family, over 360 miRNAs have been identified for Caenorhabditis elegans (C. elegans). Prediction and validation of targets are essential for elucidation of regulatory functions of these miRNAs. For C. elegans, crosslinking immunoprecipitation (CLIP) has been successfully performed for the identification of target mRNA sequences bound by Argonaute protein ALG-1. In addition, reliable annotation of the 3′ untranslated regions (3′ UTRs) as well as developmental stage-specific expression profiles for both miRNAs and 3′ UTR isoforms are available. By utilizing these data, we developed statistical models and bioinformatics tools for both transcriptome-scale and developmental stage-specific predictions of miRNA binding sites in C. elegans 3′ UTRs. In performance evaluation via cross validation on the ALG-1 CLIP data, the models were found to offer major improvements over established algorithms for predicting both seed sites and seedless sites. In particular, our top-ranked predictions have a substantially higher true positive rate, suggesting a much higher likelihood of positive experimental validation. A gene ontology analysis of stage-specific predictions suggests that miRNAs are involved in dynamic regulation of biological functions during C. elegans development. In particular, miRNAs preferentially target genes related to development, cell cycle, trafficking, and cell signaling processes. A database for both transcriptome-scale and stage-specific predictions and software for implementing the prediction models are available through the Sfold web server at http://sfold.wadsworth.org.

[1]  Mark Gerstein,et al.  The temporal patterning microRNA let-7 regulates several transcription factors at the larval to adult transition in C. elegans. , 2005, Developmental cell.

[2]  Anjali J. Koppal,et al.  Supplementary data: Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites , 2010 .

[3]  C. Lawrence,et al.  Statistical prediction of single-stranded regions in RNA secondary structure and application to predicting effective antisense target sites and beyond. , 2001, Nucleic acids research.

[4]  Nahum Sonenberg,et al.  The mechanics of miRNA-mediated gene silencing: a look under the hood of miRISC , 2012, Nature Structural &Molecular Biology.

[5]  Zachary Pincus,et al.  Dynamic expression of small non-coding RNAs, including novel microRNAs and piRNAs/21U-RNAs, during Caenorhabditis elegans development , 2009, Genome Biology.

[6]  N. Rajewsky,et al.  Widespread changes in protein synthesis induced by microRNAs , 2008, Nature.

[7]  Dang D. Long,et al.  Potent effect of target structure on microRNA function , 2007, Nature Structural &Molecular Biology.

[8]  Jun Lu,et al.  STarMir: a web server for prediction of microRNA binding sites , 2014, Nucleic Acids Res..

[9]  D. Bartel,et al.  The impact of microRNAs on protein output , 2008, Nature.

[10]  C. Burge,et al.  The microRNAs of Caenorhabditis elegans. , 2003, Genes & development.

[11]  Oliver Hofmann,et al.  miR-24 Inhibits cell proliferation by targeting E2F2, MYC, and other cell-cycle genes via binding to "seedless" 3'UTR microRNA recognition elements. , 2009, Molecular cell.

[12]  Yong Zhao,et al.  Serum response factor regulates a muscle-specific microRNA that targets Hand2 during cardiogenesis , 2005, Nature.

[13]  William A. Rennie,et al.  CLIP-based prediction of mammalian microRNA binding sites , 2013, Nucleic acids research.

[14]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[15]  D. Haussler,et al.  Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. , 2005, Genome research.

[16]  D. Bartel,et al.  Formation, Regulation and Evolution of Caenorhabditis elegans 3′UTRs , 2010, Nature.

[17]  F. Slack,et al.  RAS Is Regulated by the let-7 MicroRNA Family , 2005, Cell.

[18]  Michael Kertesz,et al.  The role of site accessibility in microRNA target recognition , 2007, Nature Genetics.

[19]  V. Ambros,et al.  Systematic analysis of dynamic miRNA-target interactions during C. elegans development , 2009, Development.

[20]  P. Green,et al.  Massively parallel sequencing of the polyadenylated transcriptome of C. elegans. , 2009, Genome research.

[21]  Eun-Young Choi,et al.  The C. elegans microRNA let-7 binds to imperfect let-7 complementary sites from the lin-41 3'UTR. , 2004, Genes & development.

[22]  Sebastian D. Mackowiak,et al.  The Landscape of C. elegans 3′UTRs , 2010, Science.

[23]  Ana Kozomara,et al.  miRBase: integrating microRNA annotation and deep-sequencing data , 2010, Nucleic Acids Res..

[24]  Gene W. Yeo,et al.  Comprehensive discovery of endogenous Argonaute binding sites in Caenorhabditis elegans , 2010, Nature Structural &Molecular Biology.

[25]  Y. Li,et al.  Incorporating structure to predict microRNA targets. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[26]  Scott B. Dewell,et al.  Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP , 2010, Cell.

[27]  M. Zavolan,et al.  A biophysical miRNA-mRNA interaction model infers canonical and noncanonical targets , 2013, Nature Methods.

[28]  Oliver Hobert,et al.  Perfect seed pairing is not a generally reliable predictor for miRNA-target interactions , 2006, Nature Structural &Molecular Biology.

[29]  Yvonne Tay,et al.  MicroRNAs to Nanog, Oct4 and Sox2 coding regions modulate embryonic stem cell differentiation , 2009, Nature.

[30]  Ye Ding,et al.  Analysis of Microrna-Target Interactions by a Target Structure Based Hybridization Model , 2007, Pacific Symposium on Biocomputing.

[31]  Stefan L Ameres,et al.  Molecular Basis for Target RNA Recognition and Cleavage by Human RISC , 2007, Cell.

[32]  Jay Shendure,et al.  Transcriptome-wide miR-155 binding map reveals widespread noncanonical microRNA targeting. , 2012, Molecular cell.

[33]  R. Giegerich,et al.  Fast and effective prediction of microRNA/target duplexes. , 2004, RNA.

[34]  V. Ambros,et al.  The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14 , 1993, Cell.

[35]  D. Bartel MicroRNAs: Target Recognition and Regulatory Functions , 2009, Cell.

[36]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[37]  A. Mele,et al.  Ago HITS-CLIP decodes miRNA-mRNA interaction maps , 2009, Nature.

[38]  Dang D. Long,et al.  mirWIP: microRNA target prediction based on microRNA-containing ribonucleoprotein–enriched transcripts , 2008, Nature Methods.

[39]  C. Lawrence,et al.  A statistical sampling algorithm for RNA secondary structure prediction. , 2003, Nucleic acids research.

[40]  G. Ruvkun,et al.  A bulged lin-4/lin-14 RNA duplex is sufficient for Caenorhabditis elegans lin-14 temporal gradient formation. , 1996, Genes & development.

[41]  Job Harms,et al.  THE LANDSCAPE OF , 2010 .

[42]  L. Lim,et al.  MicroRNA targeting specificity in mammals: determinants beyond seed pairing. , 2007, Molecular cell.