Computational prediction of RNA editing sites

MOTIVATION Some organisms edit their messenger RNA resulting in differences between the genomic sequence for a gene and the corresponding messenger RNA sequence. This difference complicates experimental and computational attempts to find and study genes in organisms with RNA editing even if the full genomic sequence is known. Nevertheless, knowledge of these editing sites is crucial for understanding the editing machinery of these organisms. RESULTS We present a computational technique that predicts the position of editing sites in the genomic sequence. It uses a statistical approach drawing on the protein sequences of related genes and general features of editing sites of the organism. We apply the method to the mitochondrion of the slime mold Physarum polycephalum. It correctly predicts over 90% of the amino acids and over 70% of the editing sites.

[1]  Toshimichi Ikemura,et al.  Codon usage tabulated from the international DNA sequence databases , 1997, Nucleic Acids Res..

[2]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[3]  David J. States,et al.  Identification of protein coding regions by database similarity search , 1993, Nature Genetics.

[4]  R. Mahendran,et al.  RNA editing by cytidine insertion in mitochondria of Physarum polycephalum , 1991, Nature.

[5]  J. Gott,et al.  Expanding genome capacity via RNA editing. , 2003, Comptes rendus biologies.

[6]  Toshimichi Ikemura,et al.  Codon usage tabulated from international DNA sequence databases: status for the year 2000 , 2000, Nucleic Acids Res..

[7]  Anders Krogh,et al.  Hidden Markov models for sequence analysis: extension and analysis of the basic method , 1996, Comput. Appl. Biosci..

[8]  A. B. Robinson,et al.  Distribution of glutamine and asparagine residues and their near neighbors in peptides and proteins. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Y. Miyazawa,et al.  The complete DNA sequence of the mitochondrial genome of Physarum polycephalum , 2001, Molecular and General Genetics MGG.

[10]  R. Mahendran,et al.  Editing of Cytochrome b mRNA inPhysarum Mitochondria* , 1999, The Journal of Biological Chemistry.

[11]  S. Karlin,et al.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[12]  J. Davies,et al.  Molecular Biology of the Cell , 1983, Bristol Medico-Chirurgical Journal.

[13]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[14]  A. Brennicke,et al.  RNA editing. , 1999, FEMS microbiology reviews.

[15]  L. Landweber,et al.  Evolution of four types of RNA editing in myxomycetes. , 2000, RNA.

[16]  Ian Holmes,et al.  Dynamic Programming Alignment Accuracy , 1998, J. Comput. Biol..

[17]  Michael S. Waterman,et al.  Introduction to computational biology , 1995 .

[18]  M. Hanson,et al.  A guide to RNA editing. , 1997, RNA.

[19]  M. W. Gray,et al.  Diversity and Evolution of Mitochondrial RNA Editing Systems , 2003, IUBMB life.

[20]  Temple F. Smith,et al.  Comparison of biosequences , 1981 .

[21]  R. Mahendran,et al.  Insertional editing in mitochondria of Physarum. , 1993, Seminars in cell biology.

[22]  J. Felsenstein,et al.  An evolutionary model for maximum likelihood alignment of DNA sequences , 1991, Journal of Molecular Evolution.

[23]  Terence Hwa,et al.  Statistical Significance of Probabilistic Sequence Alignment and Related Local Hidden Markov Models , 2001, J. Comput. Biol..

[24]  J. Gott,et al.  Substitutional and insertional RNA editing of the cytochrome c oxidase subunit 1 mRNA of Physarum polycephalum. , 1993, The Journal of biological chemistry.