RCAS: an RNA centric annotation system for transcriptome-wide regions of interest

Abstract In the field of RNA, the technologies for studying the transcriptome have created a tremendous potential for deciphering the puzzles of the RNA biology. Along with the excitement, the unprecedented volume of RNA related omics data is creating great challenges in bioinformatics analyses. Here, we present the RNA Centric Annotation System (RCAS), an R package, which is designed to ease the process of creating gene-centric annotations and analysis for the genomic regions of interest obtained from various RNA-based omics technologies. The design of RCAS is modular, which enables flexible usage and convenient integration with other bioinformatics workflows. RCAS is an R/Bioconductor package but we also created graphical user interfaces including a Galaxy wrapper and a stand-alone web service. The application of RCAS on published datasets shows that RCAS is not only able to reproduce published findings but also helps generate novel knowledge and hypotheses. The meta-gene profiles, gene-centric annotation, motif analysis and gene-set analysis provided by RCAS provide contextual knowledge which is necessary for understanding the functional aspects of different biological events that involve RNAs. In addition, the array of different interfaces and deployment options adds the convenience of use for different levels of users. RCAS is available at http://bioconductor.org/packages/release/bioc/html/RCAS.html and http://rcas.mdc-berlin.de.

[1]  Phillip D. Zamore,et al.  Modular Recognition of RNA by a Human Pumilio-Homology Domain , 2002, Cell.

[2]  Sarah C. Ayling,et al.  The Ensembl gene annotation system , 2016, Database J. Biol. Databases Curation.

[3]  J. Steitz,et al.  The Noncoding RNA Revolution—Trashing Old Rules to Forge New Ones , 2014, Cell.

[4]  Robert Gentleman,et al.  Discriminative motif analysis of high-throughput dataset , 2014, Bioinform..

[5]  Yihui Xie,et al.  A Wrapper of the JavaScript Library 'DataTables' , 2015 .

[6]  Y. Hayashizaki,et al.  Deep cap analysis gene expression (CAGE): genome-wide identification of promoters, quantification of their expression, and network inference. , 2008, BioTechniques.

[7]  Howard Y. Chang,et al.  Genomic maps of long noncoding RNA occupancy reveal principles of RNA-chromatin interactions. , 2011, Molecular cell.

[8]  Kristian Vlahovicek,et al.  Genomation: a Toolkit to Summarize, Annotate and Visualize Genomic Intervals , 2015, Bioinform..

[9]  Brian T. Lee,et al.  The UCSC Genome Browser database: 2015 update , 2014, Nucleic Acids Res..

[10]  K. Morris,et al.  The rise of regulatory RNA , 2014, Nature Reviews Genetics.

[11]  Cole Trapnell,et al.  Multiplexed RNA structure characterization with selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq) , 2011, Proceedings of the National Academy of Sciences.

[12]  Helga Thorvaldsdóttir,et al.  Molecular signatures database (MSigDB) 3.0 , 2011, Bioinform..

[13]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[14]  E. Birney,et al.  Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt , 2009, Nature Protocols.

[15]  Juancarlos Chan,et al.  Gene Ontology Consortium: going forward , 2014, Nucleic Acids Res..

[16]  J. Mesirov,et al.  The Molecular Signatures Database Hallmark Gene Set Collection , 2015 .

[17]  Vincent J. Carey,et al.  Bioconductor: software and development strategies for statistical genomics , 2005 .

[18]  Kai Blin,et al.  DoRiNA 2.0—upgrading the doRiNA database of RNA interactions in post-transcriptional regulation , 2014, Nucleic Acids Res..

[19]  Carsten O. Daub,et al.  Update of the FANTOM web resource: from mammalian transcriptional landscape to its dynamic regulation , 2010, Nucleic Acids Res..

[20]  M. Kupiec,et al.  Topology of the human and mouse m6A RNA methylomes revealed by m6A-seq , 2012, Nature.

[21]  S. Richard,et al.  New implications for the QUAKING RNA binding protein in human disease , 2008, Journal of neuroscience research.

[22]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[23]  E. Meyerowitz,et al.  Cell-type specific analysis of translating RNAs in developing flowers reveals new levels of control , 2010, Molecular Systems Biology.

[24]  M. Gerstein,et al.  RNA-Seq: a revolutionary tool for transcriptomics , 2009, Nature Reviews Genetics.

[25]  David Tollervey,et al.  Cross-linking, ligation, and sequencing of hybrids reveals RNA–RNA interactions in yeast , 2011, Proceedings of the National Academy of Sciences.

[26]  Kenneth H. Buetow,et al.  PID: the Pathway Interaction Database , 2008, Nucleic Acids Res..

[27]  Bernd Fischer,et al.  RNA-binding proteins in Mendelian disease. , 2013, Trends in genetics : TIG.

[28]  S. Luo,et al.  Global identification of microRNA–target RNA pairs by parallel analysis of RNA ends , 2008, Nature Biotechnology.

[29]  Daniel R. Zerbino,et al.  Ensembl 2016 , 2015, Nucleic Acids Res..

[30]  Nicholas T. Ingolia,et al.  Genome-Wide Analysis in Vivo of Translation with Nucleotide Resolution Using Ribosome Profiling , 2009, Science.

[31]  J. Weissman,et al.  Nascent transcript sequencing visualizes transcription at nucleotide resolution , 2011, Nature.

[32]  H. Lodish,et al.  The Three Roles of RNA in Protein Synthesis , 2000 .

[33]  Gideon Rechavi,et al.  The dynamic N1-methyladenosine methylome in eukaryotic messenger RNA , 2016, Nature.

[34]  Ludovic Courtès,et al.  Reproducible and User-Controlled Software Environments in HPC with Guix , 2015, Euro-Par Workshops.

[35]  A. Sidow,et al.  Transcription-factor occupancy at HOT regions quantitatively predicts RNA polymerase recruitment in five human cell lines , 2013, BMC Genomics.

[36]  Robert Gentleman,et al.  rtracklayer: an R package for interfacing with genome browsers , 2009, Bioinform..

[37]  Kyle E. Watters,et al.  RNA systems biology: uniting functional discoveries and structural tools to understand global roles of RNAs. , 2016, Current opinion in biotechnology.

[38]  J. Kawai,et al.  Tiny RNAs associated with transcription start sites in animals , 2009, Nature Genetics.

[39]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[40]  Robert Gentleman,et al.  Software for Computing and Annotating Genomic Ranges , 2013, PLoS Comput. Biol..

[41]  John Chilton,et al.  The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2016 update , 2016, Nucleic Acids Res..

[42]  Scott B. Dewell,et al.  Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP , 2010, Cell.