Genome-wide identification of clusters of predicted microRNA binding sites as microRNA sponge candidates

The number of discovered natural miRNA sponges in plants, viruses, and mammals is increasing steadily. Some sponges like ciRS-7 for miR-7 contain multiple nearby miRNA binding sites. We hypothesize that such clusters of miRNA binding sites on the genome can function together as a sponge. No systematic effort has been made in search for clusters of miRNA targets. Here, we, to our knowledge, make the first genome-wide target site predictions for clusters of mature human miRNAs. For each miRNA, we predict the target sites on a genome-wide scale, build a graph with edge weights based on the pairwise distances between sites, and apply Markov clustering to identify genomic regions with high binding site density. Significant clusters are then extracted based on cluster size difference between real and shuffled genomes preserving local properties such as the GC content. We then use conservation and binding energy to filter a final set of miRNA target site clusters or sponge candidates. Our pipeline predicts 3673 sponge candidates for 1250 miRNAs, including the experimentally verified miR-7 sponge ciRS-7. In addition, we point explicitly to 19 high-confidence candidates overlapping annotated genomic sequence. The full list of candidates is freely available at http://rth.dk/resources/mirnasponge, where detailed properties for individual candidates can be explored, such as alignment details, conservation, accessibility and target profiles, which facilitates selection of sponge candidates for further context specific analysis.

[1]  Stephan H. Bernhart,et al.  RNA Accessibility in cubic time , 2011, Algorithms for Molecular Biology.

[2]  D. Bartel,et al.  Expanded identification and characterization of mammalian circular RNAs , 2014, Genome Biology.

[3]  Phillip A. Sharp,et al.  Emerging Roles for Natural MicroRNA Sponges , 2010, Current Biology.

[4]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[5]  J. Castle,et al.  Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs , 2005, Nature.

[6]  Yue Wang,et al.  Endogenous miRNA sponge lincRNA-RoR regulates Oct4, Nanog, and Sox2 in human embryonic stem cell self-renewal. , 2013, Developmental cell.

[7]  Ivan Antonov,et al.  Analysis of discordant Affymetrix probesets casts serious doubt on idea of microarray data reutilization , 2014, BMC Genomics.

[8]  Andrew J. Saykin,et al.  Functional microRNAs in Alzheimer’s disease and cancer: differential regulation of common mechanisms and pathways , 2013, Front. Gene..

[9]  Ratna Chakrabarti,et al.  MicroRNA expressions associated with progression of prostate cancer cells to antiandrogen therapy resistance , 2014, Molecular Cancer.

[10]  Ferdinando Di Cunto,et al.  Coding-Independent Regulation of the Tumor Suppressor PTEN by Competing Endogenous mRNAs , 2011, Cell.

[11]  Jan Gorodkin,et al.  Structured RNAs and synteny regions in the pig genome , 2014, BMC Genomics.

[12]  D. Bartel MicroRNAs: Target Recognition and Regulatory Functions , 2009, Cell.

[13]  Sebastian D. Mackowiak,et al.  Circular RNAs are a large class of animal RNAs with regulatory potency , 2013, Nature.

[14]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[15]  Christian von Mering,et al.  RAIN: RNA–protein Association and Interaction Networks , 2017, Database J. Biol. Databases Curation.

[16]  Haifan Lin,et al.  MicroRNAs: key regulators of stem cells , 2009, Nature Reviews Molecular Cell Biology.

[17]  P. Pandolfi,et al.  A ceRNA Hypothesis: The Rosetta Stone of a Hidden RNA Language? , 2011, Cell.

[18]  R. Gregory,et al.  Many roads to maturity: microRNA biogenesis pathways and their regulation , 2009, Nature Cell Biology.

[19]  Margaret S. Ebert,et al.  MicroRNA sponges: competitive inhibitors of small RNAs in mammalian cells , 2007, Nature Methods.

[20]  M. Esteller Non-coding RNAs in human disease , 2011, Nature Reviews Genetics.

[21]  Vikram Agarwal,et al.  Assessing the ceRNA hypothesis with quantitative measurements of miRNA and target abundance. , 2014, Molecular cell.

[22]  M. Fabbri,et al.  MicroRNAs and other non-coding RNAs as targets for anticancer drug development , 2013, Nature Reviews Drug Discovery.

[23]  Minghui Jiang,et al.  uShuffle: A useful tool for shuffling biological sequences while preserving the k-let counts , 2008, BMC Bioinformatics.

[24]  Peter F. Stadler,et al.  RIsearch2: suffix array-based large-scale prediction of RNA–RNA interactions and siRNA off-targets , 2017, Nucleic acids research.

[25]  K. Pollard,et al.  Detection of nonneutral substitution rates on mammalian phylogenies. , 2010, Genome research.

[26]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[27]  Jerzy Jurka,et al.  Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor , 2006, BMC Bioinformatics.

[28]  Hui Zhou,et al.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein–RNA interaction networks from large-scale CLIP-Seq data , 2013, Nucleic Acids Res..

[29]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[30]  Shaoli Das,et al.  lnCeDB: Database of Human Long Noncoding RNA Acting as Competing Endogenous RNA , 2014, PloS one.

[31]  Ye Ding,et al.  Rapid Generation of MicroRNA Sponges for MicroRNA Inhibition , 2012, PloS one.

[32]  Ana Kozomara,et al.  miRBase: integrating microRNA annotation and deep-sequencing data , 2010, Nucleic Acids Res..

[33]  S. Dongen Graph clustering by flow simulation , 2000 .

[34]  Junpeng Zhang,et al.  Computational methods for identifying miRNA sponge interactions , 2016, Briefings Bioinform..

[35]  Rong Li,et al.  Whole-genome analysis of 5-hydroxymethylcytosine and 5-methylcytosine at base resolution in the human brain , 2013, Genome Biology.

[36]  D. V. Vactor,et al.  NIH-PA Author Manuscript NIH-PA Author Manuscript NIH-PA Author Manuscript NIH Public Access Author Manuscript Nat Methods. Author manuscript; available in PMC 2011 September 30. , 2009 .

[37]  Subbaya Subramanian,et al.  Competing endogenous RNA database , 2012, Bioinformation.

[38]  M. Dinger,et al.  Endogenous microRNA sponges: evidence and controversy , 2016, Nature Reviews Genetics.

[39]  C. Burge,et al.  Most mammalian mRNAs are conserved targets of microRNAs. , 2008, Genome research.

[40]  Raffaele Giancarlo,et al.  Speeding up the Consensus Clustering methodology for microarray data analysis , 2011, Algorithms for Molecular Biology.

[41]  Alessandro Vullo,et al.  Ensembl 2015 , 2014, Nucleic Acids Res..

[42]  J. Kjems,et al.  Circular RNA and miR-7 in cancer. , 2013, Cancer research.

[43]  Ming Sun,et al.  Lnc RNA HOTAIR functions as a competing endogenous RNA to regulate HER2 expression by sponging miR-331-3p in gastric cancer , 2014, Molecular Cancer.

[44]  Pål Sætrom,et al.  Circular RNAs are depleted of polymorphisms at microRNA binding sites , 2014, Bioinform..

[45]  Jun Zhang,et al.  Diverse alternative back-splicing and alternative splicing landscape of circular RNAs , 2016, Genome research.

[46]  Bronwen L. Aken,et al.  GENCODE: The reference human genome annotation for The ENCODE Project , 2012, Genome research.

[47]  J. Kjems,et al.  Natural RNA circles function as efficient microRNA sponges , 2013, Nature.

[48]  Jan Gorodkin,et al.  Protein-driven inference of miRNA–disease associations , 2013, Bioinform..

[49]  Jan Gorodkin,et al.  RIsearch: fast RNA–RNA interaction search using a simplified nearest-neighbor energy model , 2012, Bioinform..

[50]  D. Karolchik,et al.  The UCSC Genome Browser database: 2016 update , 2015, bioRxiv.

[51]  Yi Lu,et al.  MCM-test: a fuzzy-set-theory-based approach to differential analysis of gene pathways , 2008, BMC Bioinformatics.

[52]  Haifan Lin,et al.  Repressing the repressor: a lincRNA as a MicroRNA sponge in embryonic stem cell self-renewal. , 2013, Developmental cell.

[53]  Robert Giegerich,et al.  GUUGle: a utility for fast exact matching under RNA complementary rules including G-U base pairing , 2006, Bioinform..

[54]  Petar Glažar,et al.  circBase: a database for circular RNAs , 2014, RNA.

[55]  Chaochun Liu,et al.  The imprinted H19 lncRNA antagonizes let-7 microRNAs. , 2013, Molecular cell.

[56]  Rolf Backofen,et al.  Global or local? Predicting secondary structure and accessibility in mRNAs , 2012, Nucleic acids research.

[57]  Terrence S. Furey,et al.  The UCSC Genome Browser Database , 2003, Nucleic Acids Res..

[58]  Margaret S. Ebert,et al.  Pretty Boots Ankle Ugg Chestnut Short Classic Boots Womens 7qY1r7 , 2010 .