aPPRove: An HMM-Based Method for Accurate Prediction of RNA-Pentatricopeptide Repeat Protein Binding Events

Pentatricopeptide repeat containing proteins (PPRs) bind to RNA transcripts originating from mitochondria and plastids. There are two classes of PPR proteins. The P class contains tandem P-type motif sequences, and the PLS class contains alternating P, L and S type sequences. In this paper, we describe a novel tool that predicts PPR-RNA interaction; specifically, our method, which we call aPPRove, determines where and how a PLS-class PPR protein will bind to RNA when given a PPR and one or more RNA transcripts by using a combinatorial binding code for site specificity proposed by Barkan et al. Our results demonstrate that aPPRove successfully locates how and where a PPR protein belonging to the PLS class can bind to RNA. For each binding event it outputs the binding site, the amino-acid-nucleotide interaction, and its statistical significance. Furthermore, we show that our method can be used to predict binding events for PLS-class proteins using a known edit site and the statistical significance of aligning the PPR protein to that site. In particular, we use our method to make a conjecture regarding an interaction between CLB19 and the second intronic region of ycf3. The aPPRove web server can be found at www.cs.colostate.edu/~approve.

[1]  Sandra K. Tanz,et al.  AEF1/MPR25 is implicated in RNA editing of plastid atpF and mitochondrial nad5, and also promotes atpF splicing in Arabidopsis and rice. , 2015, The Plant journal : for cell and molecular biology.

[2]  Ian Small,et al.  Quantitative analysis of motifs contributing to the interaction between PLS-subfamily members and their target RNA sequences in plastid RNA editing. , 2014, The Plant journal : for cell and molecular biology.

[3]  Anita Arenas-M,et al.  The pentatricopeptide repeat protein MEF26 participates in RNA editing in mitochondrial cox3 and nad4 transcripts. , 2014, Mitochondrion.

[4]  Abdullah Ozer,et al.  Comprehensive Analysis of RNA-Protein Interactions by High Throughput Sequencing-RNA Affinity Profiling , 2014, Nature Methods.

[5]  María Martín,et al.  Activities at the Universal Protein Resource (UniProt) , 2013, Nucleic Acids Res..

[6]  Knut Graichen,et al.  Improved Computational Target Site Prediction for Pentatricopeptide Repeat RNA Editing Factors , 2013, PloS one.

[7]  C. Schmitz-Linneweber,et al.  Arabidopsis chloroplast quantitative editotype , 2013, FEBS letters.

[8]  Shimpei Hayashi,et al.  Elucidation of the RNA Recognition Code for Pentatricopeptide Repeat Proteins Involved in Organelle RNA Editing in Plants , 2013, PloS one.

[9]  Kentaro Uesugi,et al.  Measuring Airway Surface Liquid Depth in Ex Vivo Mouse Airways by X-Ray Imaging for the Assessment of Cystic Fibrosis Airway Therapies , 2013, PloS one.

[10]  Xiang-Sun Zhang,et al.  De novo prediction of RNA-protein interactions from sequence information. , 2013, Molecular bioSystems.

[11]  Satoru Miyano,et al.  PRD: A protein–RNA interaction database , 2012, Bioinformation.

[12]  Charles S. Bond,et al.  A Combinatorial Amino Acid Code for RNA Recognition by Pentatricopeptide Repeat Proteins , 2012, PLoS genetics.

[13]  Vasant Honavar,et al.  Predicting RNA-Protein Interactions Using Only Sequence Information , 2011, BMC Bioinformatics.

[14]  Federico Agostini,et al.  Predicting protein associations with long noncoding RNAs , 2011, Nature Methods.

[15]  Robert D. Finn,et al.  HMMER web server: interactive sequence similarity searching , 2011, Nucleic Acids Res..

[16]  J. Bähler,et al.  In silico characterization and prediction of global protein–mRNA interactions in yeast , 2011, Nucleic acids research.

[17]  C. Bond,et al.  Selection patterns on restorer-like genes reveal a conflict between nuclear and mitochondrial genomes throughout angiosperm evolution , 2011, Proceedings of the National Academy of Sciences.

[18]  Vasant Honavar,et al.  PRIDB: a protein–RNA interface database , 2010, Nucleic Acids Res..

[19]  A. Barkan,et al.  Mechanism of RNA stabilization and translational activation by a pentatricopeptide repeat protein , 2010, Proceedings of the National Academy of Sciences.

[20]  Daniel Herschlag,et al.  Diverse RNA-Binding Proteins Interact with Functionally Related Sets of RNAs, Suggesting an Extensive Regulatory System , 2008, PLoS biology.

[21]  T. Glisovic,et al.  RNA‐binding proteins and post‐transcriptional gene regulation , 2008, FEBS letters.

[22]  Johannes Söding,et al.  TPRpred: a tool for prediction of TPR-, PPR- and SEL1-like repeats from protein sequences , 2007, BMC Bioinformatics.

[23]  Amos Bairoch,et al.  ScanProsite: detection of PROSITE signature matches and ProRule-associated functional and structural residues in proteins , 2006, Nucleic Acids Res..

[24]  D. Luo,et al.  Cytoplasmic Male Sterility of Rice with Boro II Cytoplasm Is Caused by a Cytotoxic Peptide and Is Restored by Two Related PPR Motif Genes via Distinct Modes of mRNA Silencing[W] , 2006, The Plant Cell Online.

[25]  Lan Chen,et al.  NPInter: the noncoding RNAs and protein related biomacromolecules interaction database , 2005, Nucleic Acids Res..

[26]  T. Shikanai,et al.  A pentatricopeptide repeat protein is essential for RNA editing in chloroplasts , 2005, Nature.

[27]  Frédérique Bitton,et al.  Genome-Wide Analysis of Arabidopsis Pentatricopeptide Repeat Proteins Reveals Their Essential Role in Organelle Biogenesis , 2004, The Plant Cell Online.

[28]  Amos Bairoch,et al.  ScanProsite: a reference implementation of a PROSITE scanning tool. , 2002, Applied bioinformatics.

[29]  I. Small,et al.  The PPR motif - a TPR-related motif prevalent in plant organellar proteins. , 2000, Trends in biochemical sciences.

[30]  Durbin,et al.  Biological Sequence Analysis , 1998 .

[31]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[32]  A. Bairoch PROSITE: a dictionary of sites and patterns in proteins. , 1991, Nucleic acids research.