Demonstration of two novel methods for predicting functional siRNA efficiency

BackgroundsiRNAs are small RNAs that serve as sequence determinants during the gene silencing process called RNA interference (RNAi). It is well know that siRNA efficiency is crucial in the RNAi pathway, and the siRNA efficiency for targeting different sites of a specific gene varies greatly. Therefore, there is high demand for reliable siRNAs prediction tools and for the design methods able to pick up high silencing potential siRNAs.ResultsIn this paper, two systems have been established for the prediction of functional siRNAs: (1) a statistical model based on sequence information and (2) a machine learning model based on three features of siRNA sequences, namely binary description, thermodynamic profile and nucleotide composition. Both of the two methods show high performance on the two datasets we have constructed for training the model.ConclusionBoth of the two methods studied in this paper emphasize the importance of sequence information for the prediction of functional siRNAs. The way of denoting a bio-sequence by binary system in mathematical language might be helpful in other analysis work associated with fixed-length bio-sequence.

[1]  Andreas Henschel,et al.  DEQOR: a web-based tool for the design and quality control of siRNAs , 2004, Nucleic Acids Res..

[2]  A. Reynolds,et al.  Rational siRNA design for RNA interference , 2004, Nature Biotechnology.

[3]  C. Mello,et al.  Revealing the world of RNA interference , 2004, Nature.

[4]  K. Ui-Tei,et al.  Guidelines for the selection of highly effective siRNA sequences for mammalian and chick RNA interference. , 2004, Nucleic acids research.

[5]  K. Chou Using subsite coupling to predict signal peptides. , 2001, Protein engineering.

[6]  Kuo-Chen Chou,et al.  Support vector machines for predicting HIV protease cleavage sites in protein , 2002, J. Comput. Chem..

[7]  S. Jayasena,et al.  Functional siRNAs and miRNAs Exhibit Strand Bias , 2003, Cell.

[8]  Yu-Dong Cai,et al.  Support Vector Machines for predicting protein structural class , 2001, BMC Bioinformatics.

[9]  Kuo-Chen Chou,et al.  Support vector machines for prediction of protein signal sequences and their cleavage sites , 2003, Peptides.

[10]  Pål Sætrom,et al.  Predicting the efficacy of short oligonucleotides in antisense and RNAi experiments with boosted genetic programming , 2004, Bioinform..

[11]  G. Hannon,et al.  Unlocking the potential of the human genome with RNA interference , 2004, Nature.

[12]  Luquan Wang,et al.  A Web-based design center for vector-based siRNA and siRNA cassette , 2004, Bioinform..

[13]  Zhirong Sun,et al.  Support vector machine approach for protein subcellular localization prediction , 2001, Bioinform..

[14]  K. Chou,et al.  Support vector machines for prediction of protein subcellular location. , 2000, Molecular cell biology research communications : MCBRC.

[15]  D Haussler,et al.  Knowledge-based analysis of microarray gene expression data by using support vector machines. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[17]  Ola Snøve,et al.  A comparison of siRNA efficacy predictors. , 2004, Biochemical and biophysical research communications.

[18]  K. Chou,et al.  Prediction of protein signal sequences and their cleavage sites , 2001, Proteins.

[19]  D. Turner,et al.  Improved free-energy parameters for predictions of RNA duplex stability. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[20]  Dieter Huesken,et al.  Design of a genome-wide siRNA library using an artificial neural network , 2005, Nature Biotechnology.