piRNA identification based on motif discovery.

Piwi-interacting RNA (piRNA) is a class of small non-coding RNAs about 24 to 32 nucleotides long, associated with PIWI proteins, which are involved in germline development, transposon silencing, and epigenetic regulation. Identification of piRNA loci on the genome is very useful for further studies in the biogenesis and function of piRNAs. To accomplish this, we applied the computational biology tool Teiresias to identify motifs of variable length appearing frequently in mouse piRNA and non-piRNA sequences, respectively, and then proposed an algorithm for piRNA identification based on motif discovery, termed "Pibomd" by using these sequence motifs as features in the Support Vector Machine (SVM) algorithm, a sensitivity of 91.48% and a specificity of 89.76% on a mouse test dataset could be achieved, much better results than those reported in previously published algorithms. We also trained an unbalanced SVM classifier (named as "Asym-Pibomd") that provided a higher specificity (96.2%) and a lower sensitivity (72.68%) than Pibomd. Inspite of the predicted ACC being less than that of Pibomd, the predicted ACC (84.44%) of Asym-Pibomd is about ten percent more than that obtained using the k-mer method. Further analysis of the motif positions on the piRNA sequences showed that the piRNA sequences may contain information at the 5'- and/or 3'-end recognized by the piRNA processing apparatus of actual piRNA precursors. Furthermore, this prediction method can be found on a user-friendly web server found at http://app.aporc.org/Pibomd/.

[1]  B. S. Manjunath,et al.  Identification of piRNAs in the central nervous system. , 2011, RNA.

[2]  G. Hannon,et al.  The Piwi-piRNA Pathway Provides an Adaptive Defense in the Transposon Arms Race , 2007, Science.

[3]  Eugene Berezikov,et al.  Piwi and piRNAs act upstream of an endogenous siRNA pathway to suppress Tc3 transposon mobility in the Caenorhabditis elegans germline. , 2008, Molecular cell.

[4]  Ravi Sachidanandam,et al.  A germline-specific class of small RNAs binds mammalian Piwi proteins , 2006, Nature.

[5]  Geir Skogerbø,et al.  Integrated Sequence-Structure Motifs Suffice to Identify microRNA Precursors , 2012, PloS one.

[6]  Manolis Kellis,et al.  Discrete Small RNA-Generating Loci as Master Regulators of Transposon Activity in Drosophila , 2007, Cell.

[7]  S. Sathiya Keerthi,et al.  Evaluation of simple performance measures for tuning SVM hyperparameters , 2003, Neurocomputing.

[8]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[9]  Ravi Sachidanandam,et al.  A piRNA pathway primed by individual transposons is linked to de novo DNA methylation in mice. , 2008, Molecular cell.

[10]  Haifan Lin,et al.  A novel class of small RNAs in mouse spermatogenic cells. , 2006, Genes & development.

[11]  N. Lau,et al.  A Broadly Conserved Pathway Generates 3′UTR-Directed Primary piRNAs , 2009, Current Biology.

[12]  Yi Zhao,et al.  NONCODE: an integrated knowledge database of non-coding RNAs , 2004, Nucleic Acids Res..

[13]  P. Alexiou,et al.  Mili and Miwi target RNA repertoire reveals piRNA biogenesis and function of Miwi in spermiogenesis , 2012, Nature Structural &Molecular Biology.

[14]  T. Kai,et al.  piRNAs mediate posttranscriptional retroelement silencing and localization to pi-bodies in the Drosophila germline , 2009, The Journal of cell biology.

[15]  Aris Floratos,et al.  Combinatorial pattern discovery in biological sequences: The TEIRESIAS algorithm [published erratum appears in Bioinformatics 1998;14(2): 229] , 1998, Bioinform..

[16]  Yi Zhang,et al.  A k-mer scheme to predict piRNAs and characterize locust piRNAs , 2011, Bioinform..

[17]  P. Zamore,et al.  Small silencing RNAs: an expanding universe , 2009, Nature Reviews Genetics.

[18]  Haifan Lin,et al.  An epigenetic activation role of Piwi and a Piwi-associated piRNA in Drosophila melanogaster , 2007, Nature.

[19]  C. Sander,et al.  A novel class of small RNAs bind to MILI protein in mouse testes , 2006, Nature.

[20]  Doron Betel,et al.  Computational Analysis of Mouse piRNA Sequence and Biogenesis , 2007, PLoS Comput. Biol..

[21]  Gregory J. Hannon,et al.  Small RNAs as Guardians of the Genome , 2009, Cell.

[22]  Santosh K. Mishra,et al.  De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures , 2007, Bioinform..

[23]  Kuniaki Saito,et al.  Specific association of Piwi with rasiRNAs derived from retrotransposon and heterochromatic regions in the Drosophila genome. , 2006, Genes & development.

[24]  Vladimir Gvozdev,et al.  A Distinct Small RNA Pathway Silences Selfish Genetic Elements in the Germline , 2006, Science.