A systematic approach to RNA-associated motif discovery

BackgroundSequencing-based large screening of RNA-protein and RNA-RNA interactions has enabled the mechanistic study of post-transcriptional RNA processing and sorting, including exosome-mediated RNA secretion. The downstream analysis of RNA binding sites has encouraged the investigation of novel sequence motifs, which resulted in exceptional new challenges for identifying motifs from very short sequences (e.g., small non-coding RNAs or truncated messenger RNAs), where conventional methods tend to be ineffective. To address these challenges, we propose a novel motif-finding method and validate it on a wide range of RNA applications.ResultsWe first perform motif analysis on microRNAs and longer RNA fragments from various cellular and exosomal sources, and then validate our prediction through literature search and experimental test. For example, a 4 bp-long motif, GUUG, was detected to be responsible for microRNA loading in exosomes involved in human colon cancer (SW620). Additional performance comparisons in various case studies have shown that this new approach outperforms several existing state-of-the-art methods in detecting motifs with exceptional high coverage and explicitness.ConclusionsIn this work, we have demonstrated the promising performance of a new motif discovery approach that is particularly effective in current RNA applications. Important discoveries resulting from this work include the identification of possible RNA-loading motifs in a variety of exosomes, as well as novel insights in sequence features of RNA cargos, i.e., short non-coding RNAs and messenger RNAs may share similar loading mechanism into exosomes. This method has been implemented and deployed as a new webserver named MDS2 which is accessible at http://sbbi-panda.unl.edu/MDS2/, along with a standalone package available for download at https://github.com/sbbi/MDS2.

[1]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[2]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[3]  S. Pietrokovski Searching databases of conserved sequence regions by aligning protein multiple-alignments. , 1996, Nucleic acids research.

[4]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[5]  W. J. Kent,et al.  Environmentally Induced Foregut Remodeling by PHA-4/FoxA and DAF-12/NHR , 2004, Science.

[6]  Gábor Csárdi,et al.  The igraph software package for complex network research , 2006 .

[7]  Serafim Batzoglou,et al.  MotifCut: regulatory motifs finding with maximum density subgraphs , 2006, ISMB.

[8]  P. D’haeseleer How does DNA sequence motif discovery work? , 2006, Nature Biotechnology.

[9]  Sunduz Keles,et al.  Statistical Applications in Genetics and Molecular Biology Supervised Detection of Conserved Motifs in DNA Sequences with Cosmo , 2011 .

[10]  J. Lötvall,et al.  Exosome-mediated transfer of mRNAs and microRNAs is a novel mechanism of genetic exchange between cells , 2007, Nature Cell Biology.

[11]  K. Ekström Exosomal Shuttle RNA , 2008 .

[12]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[13]  Jean-Loup Guillaume,et al.  Fast unfolding of community hierarchies in large networks , 2008, ArXiv.

[14]  P. Park ChIP–seq: advantages and challenges of a maturing technology , 2009, Nature Reviews Genetics.

[15]  Graça Raposo,et al.  Exosomes--vesicular carriers for intercellular communication. , 2009, Current opinion in cell biology.

[16]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[17]  J. Lötvall,et al.  Exosomes Communicate Protective Messages during Oxidative Stress; Possible Role of Exosomal Shuttle RNA , 2010, PloS one.

[18]  Juan M. Vaquerizas,et al.  Multiplexed massively parallel SELEX for characterization of human transcription factor binding specificities. , 2010, Genome research.

[19]  Eric C. Lai,et al.  Natural Variation of the Amino-Terminal Glutamine-Rich Domain in Drosophila Argonaute2 Is Not Associated with Developmental Defects , 2010, PloS one.

[20]  Scott B. Dewell,et al.  Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP , 2010, Cell.

[21]  Shan Li,et al.  MotifClick: prediction of cis-regulatory binding sites via merging cliques , 2011, BMC Bioinformatics.

[22]  Tohru Mochizuki,et al.  Let-7 MicroRNA Family Is Selectively Secreted into the Extracellular Environment via Exosomes in a Metastatic Gastric Cancer Cell Line , 2010, PloS one.

[23]  S. Mathivanan,et al.  Exosomes: extracellular organelles important in intercellular communication. , 2010, Journal of proteomics.

[24]  Celine Vens,et al.  Identifying discriminative classification-based motifs in biological sequences , 2011, Bioinform..

[25]  Timothy L. Bailey,et al.  Gene expression Advance Access publication May 4, 2011 DREME: motif discovery in transcription factor ChIP-seq data , 2011 .

[26]  William Stafford Noble,et al.  Improved similarity scores for comparing motifs , 2011, Bioinform..

[27]  G. Stormo,et al.  Quantitative analysis demonstrates most transcription factors require only simple models of specificity , 2011, Nature Biotechnology.

[28]  David Tollervey,et al.  Cross-linking, ligation, and sequencing of hybrids reveals RNA–RNA interactions in yeast , 2011, Proceedings of the National Academy of Sciences.

[29]  Cecilia Lässer Exosomal RNA as biomarkers and the therapeutic potential of exosome vectors , 2012, Expert opinion on biological therapy.

[30]  P. Altevogt,et al.  Vesiclepedia: A Compendium for Extracellular Vesicles with Continuous Community Annotation , 2012, PLoS biology.

[31]  F. Chisari,et al.  Short-range exosomal transfer of viral RNA from infected cells to plasmacytoid dendritic cells triggers innate immunity. , 2012, Cell host & microbe.

[32]  Doron Betel,et al.  Genome-wide identification of miRNA targets by PAR-CLIP. , 2012, Methods.

[33]  Ralf Zimmer,et al.  PARma: identification of microRNA target sites in AGO-PAR-CLIP data , 2013, Genome Biology.

[34]  A. Llorente,et al.  Exosomal miRNAs as Biomarkers for Prostate Cancer , 2013, Front. Genet..

[35]  Ron Shamir,et al.  RAP: Accurate and Fast Motif Finding Based on Protein-Binding Microarray Data , 2013, J. Comput. Biol..

[36]  F. Sánchez‐Madrid,et al.  Sumoylated hnRNPA2B1 controls the sorting of miRNAs into exosomes through binding to specific motifs , 2013, Nature Communications.

[37]  D. Tollervey,et al.  Mapping the Human miRNA Interactome by CLASH Reveals Frequent Noncanonical Binding , 2013, Cell.

[38]  J. Lötvall,et al.  Distinct RNA profiles in subpopulations of extracellular vesicles: apoptotic bodies, microvesicles and exosomes , 2013, Journal of extracellular vesicles.

[39]  S. Thibodeau,et al.  Characterization of human plasma-derived exosomal RNAs by deep sequencing , 2013, BMC Genomics.

[40]  Manolis Kellis,et al.  Systematic discovery and characterization of regulatory motifs in ENCODE TF binding experiments , 2013, Nucleic acids research.

[41]  Joseph A. Veech,et al.  The pairwise approach to analysing species co‐occurrence , 2014 .

[42]  I. MacRae,et al.  Structural basis for microRNA targeting , 2014, Science.

[43]  Ralf Zimmer,et al.  Widespread context dependency of microRNA-mediated regulation , 2014, Genome research.

[44]  Hailing Jin,et al.  ARGONAUTE PIWI domain and microRNA duplex structure regulate small RNA sorting in Arabidopsis , 2014, Nature Communications.

[45]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[46]  Naoto Tsuchiya,et al.  Circulating Exosomal microRNAs as Biomarkers of Colon Cancer , 2014, PloS one.

[47]  Jussi Taipale,et al.  Conservation of transcription factor binding specificities across 600 million years of bilateria evolution , 2015, eLife.

[48]  E. Furlong,et al.  Author response: Conservation of transcription factor binding specificities across 600 million years of bilateria evolution , 2015 .

[49]  Sha Li,et al.  Exosome and Exosomal MicroRNA: Trafficking, Sorting, and Function , 2015, Genom. Proteom. Bioinform..

[50]  Robert J Freishtat,et al.  Adipocyte-derived Exosomal miRNAs: A Novel Mechanism for Obesity-Related Disease , 2014, Pediatric Research.

[51]  Charles M. Rice,et al.  miRNA–target chimeras reveal miRNA 3′-end pairing as a major determinant of Argonaute target specificity , 2015, Nature Communications.

[52]  Alissa M. Weaver,et al.  KRAS-dependent sorting of miRNA to exosomes , 2015, eLife.

[53]  Antonella Bongiovanni,et al.  EVpedia: a community web portal for extracellular vesicles research , 2015, Bioinform..

[54]  Li Li,et al.  Exosomal RNA from Mycobacterium tuberculosis‐Infected Cells Is Functional in Recipient Macrophages , 2015, Traffic.

[55]  W. D. van Marken Lichtenbelt,et al.  Exosomal microRNA miR-92a concentration in serum reflects human brown fat activity , 2016, Nature Communications.

[56]  F. Aqil,et al.  Exosomal miRNAs as biomarkers of recurrent lung cancer , 2016, Tumor Biology.

[57]  Q. Han,et al.  Exosomes secreted by mesenchymal stem cells promote endothelial cell angiogenesis by transferring miR-125a , 2016, Journal of Cell Science.

[58]  Henrik J Johansson,et al.  Cells release subpopulations of exosomes with distinct molecular and biological properties , 2016, Scientific Reports.

[59]  Jina Ko,et al.  Detection and isolation of circulating exosomes and microvesicles for cancer monitoring and diagnostics using micro-/nano-based devices. , 2016, The Analyst.

[60]  Shivakumar Keerthikumar,et al.  ExoCarta: A Web-Based Compendium of Exosomal Cargo. , 2016, Journal of molecular biology.

[61]  A. Weisz,et al.  The RNA-Binding Protein SYNCRIP Is a Component of the Hepatocyte Exosomal Machinery Controlling MicroRNA Sorting. , 2016, Cell reports.

[62]  Clive Wilson,et al.  Exosomal miRNAs as cancer biomarkers and therapeutic targets , 2016, Journal of extracellular vesicles.

[63]  Je-Hyun Yoon,et al.  microRNA‐binding proteins: specificity and function , 2017, Wiley interdisciplinary reviews. RNA.

[64]  Xin Chen,et al.  DMINDA 2.0: integrated and systematic views of regulatory DNA motif identification and analyses , 2017, Bioinform..

[65]  C. Joo,et al.  Why Argonaute is needed to make microRNA target search fast and reliable. , 2017, Seminars in cell & developmental biology.

[66]  B. Kuster,et al.  Preferential microRNA targeting revealed by in vivo competitive binding and differential Argonaute immunoprecipitation , 2017, Nucleic acids research.

[67]  M. Ashraf,et al.  Mesenchymal stem cells release exosomes that transfer miRNAs to endothelial cells and promote angiogenesis , 2017, Oncotarget.

[68]  S. Jaffrey,et al.  Molecular basis for the specific and multivariant recognitions of RNA substrates by human hnRNP A2/B1 , 2017, Nature Communications.

[69]  David J. Arenillas,et al.  JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework , 2017, Nucleic acids research.