Multiple Kernel Support Vector Regression for siRNA Efficacy Prediction

The cell defense mechanism of RNA interference has applicationsin gene function analysis and human disease therapy. To effectivelysilence a target gene, it is desirable to select the initiator siRNA moleculeshaving satisfactory silencing capabilities. Computational prediction forsilencing efficacy of siRNAs can assist this screening process before usingthem in biological experiments. String kernel functions, which operatedirectly on the string objects representing siRNAs and target mRNAs,have been applied to support vector regression for the prediction and improvedaccuracy over numerical kernels in multidimensional vector spacesconstructed from descriptors of siRNA design rules. To fully utilize informationprovided by string and numerical kernels, we propose to unifythe two in the kernel feature space by devising a multiple kernel regressionframework where a linear combination of the kernels are used. Weformulate the multiple kernel learning into a quadratically constrainedquadratic programming (QCQP) problem, which although yields globaloptimal solution, is computationally inefficient and requires a commercialsolver package.We further propose three heuristics based on the principleof kernel-target alignment and predictive accuracy. Empirical results onreal biological data demonstrate that multiple kernel regression can improveaccuracy and decrease model complexity by reducing the numberof support vectors. In addition, multiple kernel regression gives insightsinto the kernel combination, which, for siRNA efficacy prediction, evaluatesthe relative significance of the design rules.

[1]  Ola Snøve,et al.  A comparison of siRNA efficacy predictors. , 2004, Biochemical and biophysical research communications.

[2]  Bernd Jagla,et al.  Sequence characteristics of functional siRNAs. , 2005, RNA.

[3]  Peilin Jia,et al.  Demonstration of two novel methods for predicting functional siRNA efficiency , 2006, BMC Bioinformatics.

[4]  K. Ui-Tei,et al.  Guidelines for the selection of highly effective siRNA sequences for mammalian and chick RNA interference. , 2004, Nucleic acids research.

[5]  Ljubomir J. Buturovic,et al.  A Randomized String Kernel and Its Application to RNA Interference , 2007, AAAI.

[6]  N. Cristianini,et al.  On Kernel-Target Alignment , 2001, NIPS.

[7]  Leon G. Higley,et al.  Forensic Entomology: An Introduction , 2009 .

[8]  Reiji Teramoto,et al.  Prediction of siRNA functionality using generalized string kernel and support vector machine , 2005, FEBS letters.

[9]  A. Reynolds,et al.  Rational siRNA design for RNA interference , 2004, Nature Biotechnology.

[10]  Dieter Huesken,et al.  Design of a genome-wide siRNA library using an artificial neural network , 2005, Nature Biotechnology.

[11]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[12]  Nello Cristianini,et al.  Learning the Kernel Matrix with Semidefinite Programming , 2002, J. Mach. Learn. Res..

[13]  Bernhard Schölkopf,et al.  A Kernel Approach for Learning from Almost Orthogonal Patterns , 2002, PKDD.

[14]  Terran Lane,et al.  RNA string kernels for RNAi off-target evaluation , 2006, Int. J. Bioinform. Res. Appl..

[15]  Jean-Philippe Vert,et al.  An accurate and interpretable model for siRNA efficacy prediction , 2006, BMC Bioinformatics.

[16]  M. Amarzguioui,et al.  An algorithm for selection of functional siRNA sequences. , 2004, Biochemical and biophysical research communications.

[17]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[18]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[19]  R. Bernards,et al.  A System for Stable Expression of Short Interfering RNAs in Mammalian Cells , 2002, Science.

[20]  Terran Lane,et al.  A computational study of off-target effects of RNA interference , 2005, Nucleic acids research.

[21]  T. Tuschl,et al.  On the art of identifying effective and specific siRNAs , 2006, Nature Methods.

[22]  B. Luo,et al.  Prediction of siRNA knockdown efficiency using artificial neural network models. , 2005, Biochemical and biophysical research communications.

[23]  Jan Komorowski,et al.  Principles of Data Mining and Knowledge Discovery , 2001, Lecture Notes in Computer Science.

[24]  Terran Lane,et al.  The RNA String Kernel for siRNA Efficacy Prediction , 2007, 2007 IEEE 7th International Symposium on BioInformatics and BioEngineering.

[25]  G. Hannon RNA interference : RNA , 2002 .