String Kernels of Imperfect Matches for Off-target Detection in RNA Interference

RNA interference (RNAi) is a posttranscriptional gene silencing mechanism frequently used to study gene functions and knock down viral genes. RNAi has been regarded as a highly effective means of gene repression. However, an “off-target effect” deteriorates its specificity and applicability. The complete off-target effects can only be characterized by examining all factors through systematic investigation of each gene in a genome. However, this complete investigation is too expensive to conduct experimentally which motivates a computational study. The sequence matching between an siRNA and its target mRNA allows for mismatches, G-U wobbles, and the secondary structure bulges, in addition to exact matches. To simulate these matching features, we propose string kernels measuring the similarity between two oligonucleotides and develop novel efficient implementations for RNAi off-target detection. We apply the algorithms for off-target errors in C. elegans and human.

[1]  Amihood Amir,et al.  A rapid method for detection of putative RNAi target genes in genomic data , 2003, ECCB.

[2]  C. Burge,et al.  Prediction of Mammalian MicroRNA Targets , 2003, Cell.

[3]  T. Tuschl,et al.  Functional anatomy of siRNAs for mediating efficient RNAi in Drosophila melanogaster embryo lysate , 2001, The EMBO journal.

[4]  Anindya Dutta,et al.  Small RNAs with Imperfect Match to Endogenous mRNA Repress Translation , 2003, Journal of Biological Chemistry.

[5]  T. Tuschl,et al.  Analysis of gene function in somatic mammalian cells using small interfering RNAs. , 2002, Methods.

[6]  Erik L L Sonnhammer,et al.  Improved and automated prediction of effective siRNA. , 2004, Biochemical and biophysical research communications.

[7]  A. Fire,et al.  Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans , 1998, Nature.

[8]  Jennifer Widom,et al.  Database Systems: The Complete Book , 2001 .

[9]  C. Burge,et al.  Vertebrate MicroRNA Genes , 2003, Science.

[10]  A. Sugimoto,et al.  [Systematic functional analysis of the C. elegans genome]. , 2001, Tanpakushitsu kakusan koso. Protein, nucleic acid, enzyme.

[11]  Y. Dong,et al.  Systematic functional analysis of the Caenorhabditis elegans genome using RNAi , 2003, Nature.

[12]  A. Dillin The specifics of small interfering RNA specificity , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[13]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[14]  R. Lehmann,et al.  Targeted mRNA degradation by double-stranded RNA in vitro. , 1999, Genes & development.

[15]  Gad M. Landau,et al.  Text Indexing and Dictionary Matching with One Error , 2000, J. Algorithms.

[16]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[17]  Jason Weston,et al.  Mismatch string kernels for discriminative protein classification , 2004, Bioinform..

[18]  B. Li,et al.  Expression profiling reveals off-target gene regulation by RNAi , 2003, Nature Biotechnology.