DSAP: deep-sequencing small RNA analysis pipeline

DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log2-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.

[1]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[2]  Sean R. Eddy,et al.  Rfam: an RNA family database , 2003, Nucleic Acids Res..

[3]  Sue A. Olson,et al.  EMBOSS opens up sequence analysis. European Molecular Biology Open Software Suite. , 2002, Briefings in bioinformatics.

[4]  William Stafford Noble,et al.  Matrix2png: a utility for visualizing matrix data , 2003, Bioinform..

[5]  Ana M. Aransay,et al.  miRanalyzer: a microRNA detection and analysis tool for next-generation sequencing experiments , 2009, Nucleic Acids Res..

[6]  Robert J. Moore,et al.  A microRNA catalog of the developing chicken embryo identified by a deep sequencing approach. , 2008, Genome research.

[7]  Phillip D Zamore,et al.  microPrimer: the biogenesis and function of microRNA , 2005, Development.

[8]  Sue A. Olson,et al.  Emboss opens up sequence analysis , 2002, Briefings Bioinform..

[9]  R. Russell,et al.  bantam Encodes a Developmentally Regulated microRNA that Controls Cell Proliferation and Regulates the Proapoptotic Gene hid in Drosophila , 2003, Cell.

[10]  V. Ambros,et al.  The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14 , 1993, Cell.

[11]  Lisa J. Mullan,et al.  Short EMBOSS User Guide. European Molecular Biology Open Software Suite. , 2002, Briefings in bioinformatics.

[12]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[13]  Sam Griffiths-Jones,et al.  Annotating non-coding RNAs with Rfam. , 2005, Current protocols in bioinformatics.

[14]  Lisa J. Mullan,et al.  Short EMBOSS User Guide , 2002, Briefings Bioinform..

[15]  N. Rajewsky,et al.  Discovering microRNAs from deep sequencing data using miRDeep , 2008, Nature Biotechnology.

[16]  Ioannis Xenarios,et al.  R-Coffee: a web server for accurately aligning noncoding RNA sequences , 2008, Nucleic Acids Res..

[17]  Sam Griffiths-Jones,et al.  miRBase: the microRNA sequence database. , 2006, Methods in molecular biology.

[18]  Robert D. Finn,et al.  Rfam: updates to the RNA families database , 2008, Nucleic Acids Res..

[19]  B. Reinhart,et al.  The 21-nucleotide let-7 RNA regulates developmental timing in Caenorhabditis elegans , 2000, Nature.

[20]  B. Davidson,et al.  RNA polymerase III transcribes human microRNAs , 2006, Nature Structural &Molecular Biology.

[21]  V. Ambros,et al.  Role of MicroRNAs in Plant and Animal Development , 2003, Science.

[22]  D. Higgins,et al.  R-Coffee: a method for multiple alignment of non-coding RNA , 2008, Nucleic acids research.

[23]  Hsien-Da Huang,et al.  miRExpress: Analyzing high-throughput sequencing data for profiling microRNA expression , 2009, BMC Bioinformatics.

[24]  M. Byrom,et al.  Antisense inhibition of human miRNAs and indications for an involvement of miRNA in cell growth and apoptosis , 2005, Nucleic acids research.

[25]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[26]  T. Du,et al.  Asymmetry in the Assembly of the RNAi Enzyme Complex , 2003, Cell.

[27]  B. Cullen,et al.  Recognition and cleavage of primary microRNA precursors by the nuclear processing enzyme Drosha , 2005, The EMBO journal.

[28]  Malachi Griffith,et al.  In-depth characterization of the microRNA transcriptome in a leukemia progression model. , 2008, Genome research.

[29]  Sanghyuk Lee,et al.  MicroRNA genes are transcribed by RNA polymerase II , 2004, The EMBO journal.

[30]  D. Bartel,et al.  MicroRNAs Modulate Hematopoietic Lineage Differentiation , 2004, Science.

[31]  Julie D Thompson,et al.  Multiple Sequence Alignment Using ClustalW and ClustalX , 2003, Current protocols in bioinformatics.

[32]  Sean R. Eddy,et al.  Rfam: annotating non-coding RNAs in complete genomes , 2004, Nucleic Acids Res..