Prediction of C-to-U RNA editing sites in plant mitochondria using both biochemical and evolutionary information.

Although cytidine-to-uridine conversions in plant mitochondria were discovered 18 years ago, it was still an enigmatic process. Since the sequencing projects of plant mitochondrial genomes are providing more and more available sequences, the requirements of computationally identifying C-to-U RNA editing sites are also increasing. By incorporating both evolutionary and biochemical information, we developed a novel algorithm for predicting C-to-U RNA editing sites in plant mitochondria. The algorithm has been implemented as an online service called CURE (Cytidine-to-Uridine Recognizing Editor). CURE performs better than other methods that are based on only biochemical or only evolutionary information. CURE also provides the ability of predicting C-to-U RNA editing sites in non-coding regions and the synonymous C-to-U RNA editing sites in coding regions that are impossible for other methods. Furthermore, CURE can carry out prediction directly on the entire mitochondria genome sequence. The prediction results of CURE suggest the functional importance of synonymous RNA editing sites, which was neglected before. The CURE service can be accessed at http://bioinfo.au.tsinghua.edu.cn/cure.

[1]  D. Haussler,et al.  Aligning multiple genomic sequences with the threaded blockset aligner. , 2004, Genome research.

[2]  Z. Huang,et al.  Using cellular automata images and pseudo amino acid composition to predict protein subcellular location , 2005, Amino Acids.

[3]  H.-B. Shen,et al.  Euk-PLoc: an ensemble classifier for large-scale eukaryotic protein subcellular location prediction , 2007, Amino Acids.

[4]  T. Shikanai,et al.  A pentatricopeptide repeat protein is essential for RNA editing in chloroplasts , 2005, Nature.

[5]  Wei Yu,et al.  Evidence for a Site-specific Cytidine Deamination Reaction Involved in C to U RNA Editing of Plant Mitochondria (*) , 1995, The Journal of Biological Chemistry.

[6]  Daniel S. Myers,et al.  Simple statistical models predict C-to-U edited sites in plant mitochondrial RNA , 2004, BMC Bioinformatics.

[7]  T. Shiina,et al.  Structural dynamics of cereal mitochondrial genomes as revealed by complete nucleotide sequencing of the wheat mitochondrial genome , 2005, Nucleic acids research.

[8]  R. Bock,et al.  Transfer of plastid RNA-editing activity to novel sites suggests a critical role for spacing in editing-site recognition. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Z. Wen,et al.  Using pseudo amino acid composition to predict transmembrane regions in protein: cellular automata and Lempel-Ziv complexity , 2007, Amino Acids.

[10]  J. Grienenberger,et al.  Plant mitochondrial RNA editing: The Strasbourg chapter , 2009, IUBMB life.

[11]  Ernesto Picardi,et al.  REDIdb: the RNA editing database , 2006, Nucleic Acids Res..

[12]  K. Chou,et al.  Digital coding of amino acids based on hydrophobic index. , 2007, Protein and peptide letters.

[13]  Maureen R. Hanson,et al.  Sequence elements critical for efficient RNA editing of a tobacco chloroplast transcript in vivo and in vitro , 2006, Nucleic acids research.

[14]  Kazuo Shinozaki,et al.  Conserved domain structure of pentatricopeptide repeat proteins involved in chloroplast RNA editing , 2007, Proceedings of the National Academy of Sciences.

[15]  K. Chou,et al.  Recent progress in protein subcellular location prediction. , 2007, Analytical biochemistry.

[16]  Zhanchao Li,et al.  Using Chou's amphiphilic pseudo-amino acid composition and support vector machine for prediction of enzyme subfamily classes. , 2007, Journal of theoretical biology.

[17]  Kuo-Chen Chou,et al.  Signal-CF: a subsite-coupled and window-fusing approach for predicting signal peptides. , 2007, Biochemical and biophysical research communications.

[18]  Michael W. Gray,et al.  RNA editing in plant mitochondria , 1989, Nature.

[19]  Loris Nanni,et al.  Genetic programming for creating Chou’s pseudo amino acid based features for submitochondria localization , 2008, Amino Acids.

[20]  T. Shikanai,et al.  RNA editing in plant organelles: machinery, physiological function and evolution , 2006, Cellular and Molecular Life Sciences CMLS.

[21]  F.-M. Li,et al.  Using pseudo amino acid composition to predict protein subnuclear location with improved hybrid approach , 2007, Amino Acids.

[22]  H. Handa,et al.  The complete nucleotide sequence and RNA editing content of the mitochondrial genome of rapeseed (Brassica napus L.): comparative analysis of the mitochondrial genomes of rapeseed and Arabidopsis thaliana. , 2003, Nucleic acids research.

[23]  V. K. Rajasekhar,et al.  RNA Editing in Plant Mitochondria: [alpha]-Phosphate Is Retained during C-to-U Conversion in mRNAs. , 1993, The Plant cell.

[24]  Yanzhi Guo,et al.  Predicting DNA-binding proteins: approached from Chou’s pseudo amino acid composition and other specific sequence features , 2007, Amino Acids.

[25]  Shuba Gopal,et al.  Correction: genetic algorithm learning as a robust approach to RNA editing site site prediction , 2006, BMC Bioinformatics.

[26]  Tao He,et al.  dbRES: a web-oriented database for annotated RNA editing sites , 2006, Nucleic Acids Res..

[27]  Jeffrey P. Mower PREP-Mt: predictive RNA editor for plant mitochondrial genes , 2005, BMC Bioinformatics.

[28]  R. Bock,et al.  Identification of critical nucleotide positions for plastid RNA editing site recognition. , 1997, RNA.

[29]  K. Chou,et al.  Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms , 2008, Nature Protocols.

[30]  M A Williams,et al.  RNA editing site recognition in higher plant mitochondria. , 1999, The Journal of heredity.

[31]  Ralf Bundschuh,et al.  Model for codon position bias in RNA editing. , 2005, Physical review letters.

[32]  Toshiyuki Shimizu,et al.  A Pentatricopeptide Repeat Protein Is a Site Recognition Factor in Chloroplast RNA Editing*♦ , 2006, Journal of Biological Chemistry.

[33]  José M. Gualberto,et al.  RNA editing in wheat mitochondria results in the conservation of protein sequences , 1989, Nature.

[34]  J. Grienenberger,et al.  A prokaryotic-type cytidine deaminase from Arabidopsis thaliana gene expression and functional characterization. , 1999, European journal of biochemistry.

[35]  Y. Notsu,et al.  The complete sequence of the rice (Oryza sativa L.) mitochondrial genome: frequent DNA sequence acquisition and loss during the evolution of flowering plants , 2002, Molecular Genetics and Genomics.

[36]  R. Mulligan,et al.  Computational analysis of RNA editing sites in plant mitochondrial genomes reveals similar information content and a sporadic distribution of editing sites. , 2007, Molecular biology and evolution.

[37]  M. Sugiura,et al.  The complete nucleotide sequence and multipartite organization of the tobacco mitochondrial genome: comparative analysis of mitochondrial genomes in higher plants , 2005, Molecular Genetics and Genomics.

[38]  K. Chou,et al.  EzyPred: a top-down approach for predicting enzyme functional classes and subclasses. , 2007, Biochemical and biophysical research communications.

[39]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[40]  A. Brennicke,et al.  RNA editing in bryophytes and a molecular phylogeny of land plants. , 1996, The EMBO journal.

[41]  Tongliang Zhang,et al.  Using pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes , 2007, Amino Acids.

[42]  K. Chou,et al.  Prediction of linear B-cell epitopes using amino acid pair antigenicity scale , 2007, Amino Acids.

[43]  Ian Small,et al.  A hypothesis on the identification of the editing enzyme in plant organelles , 2007, FEBS letters.

[44]  H.-B. Shen,et al.  Predicting secretory protein signal sequence cleavage sites by fusing the marks of global alignments , 2006, Amino Acids.

[45]  A. Brennicke,et al.  Evidence for RNA editing in mitochondria of all major groups of land plants except the Bryophyta. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[46]  H.-B. Shen,et al.  Using ensemble classifier to identify membrane protein types , 2006, Amino Acids.

[47]  Z. Huang,et al.  Using pseudo amino acid composition to predict protein subcellular location: Approached with Lyapunov index, Bessel function, and Chebyshev filter , 2005, Amino Acids.

[48]  Jeffrey P. Mower,et al.  Patterns of partial RNA editing in mitochondrial genes of Beta vulgaris , 2006, Molecular Genetics and Genomics.

[49]  K. Chou,et al.  Signal-3L: A 3-layer approach for predicting signal peptides. , 2007, Biochemical and biophysical research communications.

[50]  Kuo-Chen Chou,et al.  MemType-2L: a web server for predicting membrane proteins and their types by incorporating evolution information through Pse-PSSM. , 2007, Biochemical and biophysical research communications.

[51]  K. Chou,et al.  Euk-mPLoc: a fusion classifier for large-scale eukaryotic protein subcellular location prediction by incorporating multiple sites. , 2007, Journal of proteome research.

[52]  Axel Brennicke,et al.  Complex cis-elements determine an RNA editing site in pea mitochondria. , 2004, Nucleic acids research.

[53]  Kuo-Chen Chou,et al.  Predicting protein subnuclear location with optimized evidence-theoretic K-nearest classifier and pseudo amino acid composition. , 2005, Biochemical and biophysical research communications.

[54]  Kuo-Chen Chou,et al.  Nuc-PLoc: a new web-server for predicting protein subnuclear localization by fusing PseAA composition and PsePSSM. , 2007, Protein engineering, design & selection : PEDS.

[55]  A. Brennicke,et al.  RNA editing in Arabidopsis mitochondria effects 441 C to U changes in ORFs. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[56]  J. Farré,et al.  Different patterns in the recognition of editing sites in plant mitochondria. , 2004, Nucleic acids research.

[57]  Maureen R. Hanson,et al.  Cross-Competition in Transgenic Chloroplasts Expressing Single Editing Sites Reveals Shared cis Elements , 2002, Molecular and Cellular Biology.

[58]  Z. Huang,et al.  Using complexity measure factor to predict protein subcellular location , 2005, Amino Acids.