Influence of mRNA Features on siRNA Interference Efficacy

Design of small interference RNA (siRNA) is one of the most important steps in effectively applying the RNA interference (RNAi) technology. The current siRNA design often produces inconsistent design results, which often fail to reliably select siRNA with clear silencing effects. We propose that when designing siRNA, one should consider mRNA global features and near siRNA-binding site local features. By a linear regression study, we discovered strong correlations between inhibitory efficacy and both mRNA global features and neighboring local features. This paper shows that, on average, less GC content, fewer stem secondary structures, and more loop secondary structures of mRNA at both global and local flanking regions of the siRNA binding sites lead to stronger inhibitory efficacy. Thus, the use of mRNA global features and near siRNA-binding site local features are essential to successful gene silencing and hence, a better siRNA design. We use a random forest model to predict siRNA efficacy using siRNA features, mRNA features, and near siRNA binding site features. Our prediction method achieved a correlation coefficient of 0.7 in 10-fold cross validation in contrast to 0.63 when using siRNA features only. Our study demonstrates that considering mRNA and near siRNA binding site features helps improve siRNA design accuracy. The findings may also be helpful in understanding binding efficacy between microRNA and mRNA.

[1]  Aleksey Y. Ogurtsov,et al.  Computational models with thermodynamic and composition features improve siRNA design , 2006, BMC Bioinformatics.

[2]  Christopher C. Holmes,et al.  Approximate Bayesian feature selection on a large meta-dataset offers novel insights on factors that effect siRNA potency , 2009, Bioinform..

[3]  Jean-Philippe Vert,et al.  An accurate and interpretable model for siRNA efficacy prediction , 2006, BMC Bioinformatics.

[4]  T. Katoh,et al.  Specific residues at every third position of siRNA shape its efficient RNAi activity , 2007, Nucleic acids research.

[5]  M. Amarzguioui,et al.  An algorithm for selection of functional siRNA sequences. , 2004, Biochemical and biophysical research communications.

[6]  C. Mello,et al.  Revealing the world of RNA interference , 2004, Nature.

[7]  Ye Ding,et al.  A structural interpretation of the effect of GC-content on efficiency of RNA interference , 2009, BMC Bioinformatics.

[8]  D. Chang,et al.  Using a hydrogen-bond index to predict the gene-silencing efficiency of siRNA based on the local structure of mRNA , 2017, 1710.07413.

[9]  P. Sætrom,et al.  Comparison of approaches for rational siRNA design leading to a new efficient and transparent method , 2007, Nucleic acids research.

[10]  Peilin Jia,et al.  Demonstration of two novel methods for predicting functional siRNA efficiency , 2006, BMC Bioinformatics.

[11]  K. Ui-Tei,et al.  Guidelines for the selection of highly effective siRNA sequences for mammalian and chick RNA interference. , 2004, Nucleic acids research.

[12]  Liangjiang Wang,et al.  Predicting siRNA potency with random forests and support vector machines , 2010, BMC Genomics.

[13]  Chi Yu Chan,et al.  Effect of target secondary structure on RNAi efficiency. , 2007, RNA.

[14]  Rich Caruana,et al.  An empirical evaluation of supervised learning in high dimensions , 2008, ICML '08.

[15]  A. Reynolds,et al.  Rational siRNA design for RNA interference , 2004, Nature Biotechnology.

[16]  J. Manola,et al.  A library of siRNA duplexes targeting the phosphoinositide 3-kinase pathway: determinants of gene silencing for use in cell-based screens. , 2004, Nucleic acids research.

[17]  Jonathan M. Garibaldi,et al.  MysiRNA: Improving siRNA efficacy prediction using a machine-learning model combining multi-tools and whole stacking energy (ΔG) , 2012, J. Biomed. Informatics.

[18]  Shigeru Takasaki,et al.  Efficient prediction methods for selecting effective siRNA sequences , 2010, Comput. Biol. Medicine.

[19]  Dieter Huesken,et al.  Design of a genome-wide siRNA library using an artificial neural network , 2005, Nature Biotechnology.