Ebbie: automated analysis and storage of small RNA cloning data using a dynamic web server

BackgroundDNA sequencing is used ubiquitously: from deciphering genomes[1] to determining the primary sequence of small RNAs (smRNAs) [2–5]. The cloning of smRNAs is currently the most conventional method to determine the actual sequence of these important regulators of gene expression. Typical smRNA cloning projects involve the sequencing of hundreds to thousands of smRNA clones that are delimited at their 5' and 3' ends by fixed sequence regions. These primers result from the biochemical protocol used to isolate and convert the smRNA into clonable PCR products. Recently we completed a smRNA cloning project involving tobacco plants, where analysis was required for ~700 smRNA sequences[6]. Finding no easily accessible research tool to enter and analyze smRNA sequences we developed Ebbie to assist us with our study.ResultsEbbie is a semi-automated smRNA cloning data processing algorithm, which initially searches for any substring within a DNA sequencing text file, which is flanked by two constant strings. The substring, also termed smRNA or insert, is stored in a MySQL and BlastN database. These inserts are then compared using BlastN to locally installed databases allowing the rapid comparison of the insert to both the growing smRNA database and to other static sequence databases. Our laboratory used Ebbie to analyze scores of DNA sequencing data originating from an smRNA cloning project[6]. Through its built-in instant analysis of all inserts using BlastN, we were able to quickly identify 33 groups of smRNAs from ~700 database entries. This clustering allowed the easy identification of novel and highly expressed clusters of smRNAs. Ebbie is available under GNU GPL and currently implemented on http://bioinformatics.org/ebbie/ConclusionEbbie was designed for medium sized smRNA cloning projects with about 1,000 database entries [6–8].Ebbie can be used for any type of sequence analysis where two constant primer regions flank a sequence of interest. The reliable storage of inserts, and their annotation in a MySQL database, BlastN[9] comparison of new inserts to dynamic and static databases make it a powerful new tool in any laboratory using DNA sequencing. Ebbie also prevents manual mistakes during the excision process and speeds up annotation and data-entry. Once the server is installed locally, its access can be restricted to protect sensitive new DNA sequencing data. Ebbie was primarily designed for smRNA cloning projects, but can be applied to a variety of RNA and DNA cloning projects[2, 3, 10, 11].

[1]  P. Unrau,et al.  Ribozyme motif structure mapped using random recombination and selection. , 2005, RNA.

[2]  Phillip D. Zamore,et al.  Ribo-gnome: The Big World of Small RNAs , 2005, Science.

[3]  K. Collins,et al.  Two classes of endogenous small RNAs in Tetrahymena thermophila. , 2006, Genes & development.

[4]  D. Curtis Jamison Perl Programming for Biologists: Jamison/Perl Programming , 2003 .

[5]  Henry Mirsky,et al.  RNA editing of a miRNA precursor. , 2004, RNA.

[6]  D. Bartel,et al.  Micromanagers of gene expression: the potentially widespread influence of metazoan microRNAs , 2004, Nature Reviews Genetics.

[7]  Thomas Girke,et al.  Cloning and Characterization of MicroRNAs from Ricew⃞ , 2005, The Plant Cell Online.

[8]  H. Ebhardt,et al.  Extensive 3' modification of plant small RNAs is modulated by helper component-proteinase expression. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Elizabeth Castro PERL and CGI for the World Wide Web , 1998 .

[10]  R. Breaker,et al.  Regulation of bacterial gene expression by riboswitches. , 2005, Annual review of microbiology.

[11]  Paul DuBois MySQL and Perl for the Web , 2001 .

[12]  Nóra Varga,et al.  Sensitive and specific detection of microRNAs by northern blot analysis using LNA-modified oligonucleotide probes. , 2004, Nucleic acids research.

[13]  A. Saïb,et al.  A Cellular MicroRNA Mediates Antiviral Defense in Human Cells , 2005, Science.

[14]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[15]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[16]  Shivakundan Singh Tej,et al.  Elucidation of the Small RNA Component of the Transcriptome , 2005, Science.

[17]  P Green,et al.  Base-calling of automated sequencer traces using phred. II. Error probabilities. , 1998, Genome research.

[18]  R. Russell,et al.  Principles of MicroRNA–Target Recognition , 2005, PLoS biology.

[19]  J. Castle,et al.  Microarray analysis shows that some microRNAs downregulate large numbers of target mRNAs , 2005, Nature.

[20]  Adam M. Gustafson,et al.  Genetic and Functional Diversification of Small RNA Pathways in Plants , 2004, PLoS biology.

[21]  Anton J. Enright,et al.  Human MicroRNA Targets , 2004, PLoS biology.

[22]  L. Lim,et al.  An Abundant Class of Tiny RNAs with Probable Regulatory Roles in Caenorhabditis elegans , 2001, Science.

[23]  V. Thorsson,et al.  Genome sequence of Halobacterium species NRC-1. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[24]  S. Eddy,et al.  Homologs of small nucleolar RNAs in Archaea. , 2000, Science.

[25]  C. Burge,et al.  The microRNAs of Caenorhabditis elegans. , 2003, Genes & development.

[26]  D. C. Jamison Perl Programming for Biologists , 2003 .

[27]  Scott A. Givan,et al.  ASRP: the Arabidopsis Small RNA Project Database , 2004, Nucleic Acids Res..

[28]  J. Carpten,et al.  Modulation of non-templated nucleotide addition by Taq DNA polymerase: primer modifications that facilitate genotyping. , 1996, BioTechniques.