Sensitive and highly resolved identification of RNA-protein interaction sites in PAR-CLIP data

BackgroundPAR-CLIP is a recently developed Next Generation Sequencing-based method enabling transcriptome-wide identification of interaction sites between RNA and RNA-binding proteins. The PAR-CLIP procedure induces specific base transitions that originate from sites of RNA-protein interactions and can therefore guide the identification of binding sites. However, additional sources of transitions, such as cell type-specific SNPs and sequencing errors, challenge the inference of binding sites and suitable statistical approaches are crucial to control false discovery rates. In addition, a highly resolved delineation of binding sites followed by an extensive downstream analysis is necessary for a comprehensive characterization of the protein binding preferences and the subsequent design of validation experiments.ResultsWe present a statistical and computational framework for PAR-CLIP data analysis. We developed a sensitive transition-centered algorithm specifically designed to resolve protein binding sites at high resolution in PAR-CLIP data. Our method employes a Bayesian network approach to associate posterior log-odds with the observed transitions, providing an overall quantification of the confidence in RNA-protein interaction. We use published PAR-CLIP data to demonstrate the advantages of our approach, which compares favorably with alternative algorithms. Lastly, by integrating RNA-Seq data we compute conservative experimentally-based false discovery rates of our method and demonstrate the high precision of our strategy.ConclusionsOur method is implemented in the R package wavClusteR 2.0. The package is distributed under the GPL-2 license and is available from BioConductor at http://www.bioconductor.org/packages/devel/bioc/html/wavClusteR.html.

[1]  S. Richard,et al.  New implications for the QUAKING RNA binding protein in human disease , 2008, Journal of neuroscience research.

[2]  G. Meister Argonaute proteins: functional insights and emerging roles , 2013, Nature Reviews Genetics.

[3]  R. Agami,et al.  MicroRNA regulation by RNA-binding proteins and its implications for cancer , 2011, Nature Reviews Cancer.

[4]  Brendan J. Frey,et al.  A compendium of RNA-binding motifs for decoding gene regulation , 2013, Nature.

[5]  Thomas Tuschl,et al.  Identification of RNA–protein interaction networks using PAR‐CLIP , 2012, Wiley interdisciplinary reviews. RNA.

[6]  Christoph Rodak,et al.  MirZ: an integrated microRNA expression atlas and target prediction resource , 2009, Nucleic Acids Res..

[7]  M. Zavolan,et al.  A quantitative analysis of CLIP methods for identifying binding sites of RNA-binding proteins , 2011, Nature Methods.

[8]  Uwe Ohler,et al.  PARalyzer: definition of RNA binding sites from PAR-CLIP short-read sequence data , 2011, Genome Biology.

[9]  Kai-Wei Chang,et al.  RNA-binding proteins in human genetic disease. , 2008, Trends in genetics : TIG.

[10]  E. K. White,et al.  PUM2, a novel murine puf protein, and its consensus RNA-binding site. , 2001, RNA.

[11]  S. Janga,et al.  Dissecting the expression landscape of RNA-binding proteins in human cancers , 2014, Genome Biology.

[12]  Yang Xie,et al.  PIPE-CLIP: a comprehensive online tool for CLIP-seq data analysis , 2014, Genome Biology.

[13]  Bernd Fischer,et al.  RNA-binding proteins in Mendelian disease. , 2013, Trends in genetics : TIG.

[14]  J. Ule,et al.  Protein–RNA interactions: new genomic technologies and perspectives , 2012, Nature Reviews Genetics.

[15]  Scott B. Dewell,et al.  Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP , 2010, Cell.

[16]  Mohsen Khorshid,et al.  PAR-CliP - A Method to Identify Transcriptome-wide the Binding Sites of RNA Binding Proteins , 2010, Journal of visualized experiments : JoVE.

[17]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[18]  Martin J. Simard,et al.  Argonaute proteins: key players in RNA silencing , 2008, Nature Reviews Molecular Cell Biology.

[19]  Renato Paro,et al.  Mixture models and wavelet transforms reveal high confidence RNA-protein interaction sites in MOV10 PAR-CLIP data , 2012, Nucleic acids research.

[20]  D. Reinberg,et al.  PRC2 binds to active promoters and contacts nascent RNAs in embryonic stem cells , 2013, Nature Structural &Molecular Biology.

[21]  Mohsen Khorshid,et al.  CLIPZ: a database and analysis environment for experimentally determined binding sites of RNA-binding proteins , 2010, Nucleic Acids Res..

[22]  C. Norbury,et al.  The Long and Short of MicroRNA , 2013, Cell.