miRExpress: Analyzing high-throughput sequencing data for profiling microRNA expression

BackgroundMicroRNAs (miRNAs), small non-coding RNAs of 19 to 25 nt, play important roles in gene regulation in both animals and plants. In the last few years, the oligonucleotide microarray is one high-throughput and robust method for detecting miRNA expression. However, the approach is restricted to detecting the expression of known miRNAs. Second-generation sequencing is an inexpensive and high-throughput sequencing method. This new method is a promising tool with high sensitivity and specificity and can be used to measure the abundance of small-RNA sequences in a sample. Hence, the expression profiling of miRNAs can involve use of sequencing rather than an oligonucleotide array. Additionally, this method can be adopted to discover novel miRNAs.ResultsThis work presents a systematic approach, miRExpress, for extracting miRNA expression profiles from sequencing reads obtained by second-generation sequencing technology. A stand-alone software package is implemented for generating miRNA expression profiles from high-throughput sequencing of RNA without the need for sequenced genomes. The software is also a database-supported, efficient and flexible tool for investigating miRNA regulation. Moreover, we demonstrate the utility of miRExpress in extracting miRNA expression profiles from two Illumina data sets constructed for the human and a plant species.ConclusionWe develop miRExpress, which is a database-supported, efficient and flexible tool for detecting miRNA expression profile. The analysis of two Illumina data sets constructed from human and plant demonstrate the effectiveness of miRExpress to obtain miRNA expression profiles and show the usability in finding novel miRNAs.

[1]  T. Tammela,et al.  MicroRNA expression profiling in prostate cancer. , 2007, Cancer research.

[2]  Manolis Kellis,et al.  Systematic discovery and characterization of fly microRNAs using 12 Drosophila genomes. , 2007, Genome research.

[3]  D. Bartel,et al.  A diverse and evolutionarily fluid set of microRNAs in Arabidopsis thaliana. , 2006, Genes & development.

[4]  C. Croce,et al.  Expression profiling of microRNA using oligo DNA arrays. , 2008, Methods.

[5]  R. Durbin,et al.  Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P

, 2022 .

[6]  Timothy S Davison,et al.  Analyzing micro-RNA expression using microarrays. , 2006, Methods in enzymology.

[7]  David Haussler,et al.  The UCSC Genome Browser database: update 2010 , 2009, Nucleic Acids Res..

[8]  Stijn van Dongen,et al.  miRBase: tools for microRNA genomics , 2007, Nucleic Acids Res..

[9]  Michael Farrar,et al.  Sequence analysis Striped Smith – Waterman speeds database searches six times over other SIMD implementations , 2007 .

[10]  G. Carmichael,et al.  ADAR editing wobbles the microRNA world. , 2007, ACS chemical biology.

[11]  Michael Q. Zhang,et al.  Using quality scores and longer reads improves accuracy of Solexa read mapping , 2008, BMC Bioinformatics.

[12]  Jorng-Tzong Horng,et al.  RNALogo: a new approach to display structural RNA alignment , 2008, Nucleic Acids Res..

[13]  Ting Wang,et al.  The UCSC Genome Browser Database: update 2009 , 2008, Nucleic Acids Res..

[14]  Martin M Matzuk,et al.  A bioinformatics tool for linking gene expression profiling results with public databases of microRNA target predictions. , 2008, RNA.

[15]  Yun Zheng,et al.  Identification of novel and candidate miRNAs in rice by high throughput sequencing , 2008, BMC Plant Biology.

[16]  Jason S. Cumbie,et al.  High-Throughput Sequencing of Arabidopsis microRNAs: Evidence for Frequent Birth and Death of MIRNA Genes , 2007, PloS one.

[17]  Yingyin Yao,et al.  Cloning and characterization of microRNAs from wheat (Triticum aestivum L.) , 2007, Genome Biology.

[18]  Xu Ma,et al.  Genome-wide microRNA profiling in human fetal nervous tissues by oligonucleotide microarray , 2006, Child's Nervous System.

[19]  Ruiqiang Li,et al.  SOAP: short oligonucleotide alignment program , 2008, Bioinform..

[20]  Ryan D. Morin,et al.  Application of massively parallel sequencing to microRNA profiling and discovery in human embryonic stem cells. , 2008, Genome research.

[21]  James R. Knight,et al.  Genome sequencing in microfabricated high-density picolitre reactors , 2005, Nature.

[22]  Robert J. Moore,et al.  A microRNA catalog of the developing chicken embryo identified by a deep sequencing approach. , 2008, Genome research.

[23]  Martin M Matzuk,et al.  Mouse let-7 miRNA populations exhibit RNA editing that is constrained in the 5'-seed/ cleavage/anchor regions and stabilize predicted mmu-let-7a:mRNA duplexes. , 2008, Genome research.

[24]  Molly Megraw,et al.  Frequency and fate of microRNA editing in human brain , 2008, Nucleic acids research.

[25]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[26]  F. Slack,et al.  Oncomirs — microRNAs with a role in cancer , 2006, Nature Reviews Cancer.

[27]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[28]  Lin He,et al.  MicroRNAs: small RNAs with a big role in gene regulation , 2004, Nature Reviews Genetics.

[29]  Jeffrey G. Reid,et al.  Expression profiling of microRNAs by deep sequencing , 2009, Briefings Bioinform..

[30]  Bin Ma,et al.  ZOOM! Zillions of oligos mapped , 2008, Bioinform..

[31]  Wing Hung Wong,et al.  SeqMap: mapping massive amount of oligonucleotides to the genome , 2008, Bioinform..

[32]  N. Amariglio,et al.  A-to-I RNA editing: a new regulatory mechanism of global gene expression. , 2007, Blood cells, molecules & diseases.

[33]  Jeffrey W. Habig,et al.  miRNA editing--we should have inosine this coming. , 2007, Molecular cell.

[34]  C. W. Lee,et al.  The open reading frame of bamboo mosaic potexvirus satellite RNA is not essential for its replication and can be replaced with a bacterial gene. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Mary Goldman,et al.  The UCSC Genome Browser database: update 2011 , 2010, Nucleic Acids Res..