Template-Based Modeling of Protein-RNA Interactions

Protein-RNA complexes formed by specific recognition between RNA and RNA-binding proteins play an important role in biological processes. More than a thousand of such proteins in human are curated and many novel RNA-binding proteins are to be discovered. Due to limitations of experimental approaches, computational techniques are needed for characterization of protein-RNA interactions. Although much progress has been made, adequate methodologies reliably providing atomic resolution structural details are still lacking. Although protein-RNA free docking approaches proved to be useful, in general, the template-based approaches provide higher quality of predictions. Templates are key to building a high quality model. Sequence/structure relationships were studied based on a representative set of binary protein-RNA complexes from PDB. Several approaches were tested for pairwise target/template alignment. The analysis revealed a transition point between random and correct binding modes. The results showed that structural alignment is better than sequence alignment in identifying good templates, suitable for generating protein-RNA complexes close to the native structure, and outperforms free docking, successfully predicting complexes where the free docking fails, including cases of significant conformational change upon binding. A template-based protein-RNA interaction modeling protocol PRIME was developed and benchmarked on a representative set of complexes.

[1]  J. Iwakiri,et al.  Dissecting the protein–RNA interface: the role of protein surface shapes and RNA secondary structures in protein–RNA recognition , 2011, Nucleic acids research.

[2]  Andrey Tovchigrechko,et al.  The size of the intermolecular energy funnel in protein–protein interactions , 2008, Proteins.

[3]  Federico Agostini,et al.  Predicting protein associations with long noncoding RNAs , 2011, Nature Methods.

[4]  Janusz M. Bujnicki,et al.  DARS-RNP and QUASI-RNP: New statistical potentials for protein-RNA docking , 2011, BMC Bioinformatics.

[5]  Tyson A. Clark,et al.  HITS-CLIP yields genome-wide insights into brain alternative RNA processing , 2008, Nature.

[6]  Anuj Srivastava,et al.  RNA global alignment in the joint sequence–structure space using elastic shape analysis , 2013, Nucleic acids research.

[7]  J. Janin,et al.  Dissecting protein–RNA recognition sites , 2008, Nucleic acids research.

[8]  Ilya A Vakser,et al.  Protein-protein docking: from interaction to interactome. , 2014, Biophysical journal.

[9]  Y. Shamoo,et al.  Structure-based analysis of protein-RNA interactions using the program ENTANGLE. , 2001, Journal of molecular biology.

[10]  Zhengwei Zhu,et al.  Templates are available to model nearly all complexes of structurally characterized proteins , 2012, Proceedings of the National Academy of Sciences.

[11]  J. Perona,et al.  Influence of transfer RNA tertiary structure on aminoacylation efficiency by glutaminyl and cysteinyl-tRNA synthetases. , 2000, Journal of molecular biology.

[12]  Jianyang Zeng,et al.  A deep learning framework for modeling structural features of RNA-binding protein targets , 2015, Nucleic acids research.

[13]  Ivan Anishchenko,et al.  Protein models docking benchmark 2 , 2015, Proteins.

[14]  Laura Pérez-Cano,et al.  A protein‐RNA docking benchmark (II): Extended set from experimental and homology modeling data , 2012, Proteins.

[15]  Yaoqi Zhou,et al.  Structure-based prediction of RNA-binding domains and RNA-binding sites and application to structural genomics targets , 2010, Nucleic acids research.

[16]  Petras J. Kundrotas,et al.  Protein Docking by the Interface Structure Similarity: How Much Structure Is Needed? , 2012, PloS one.

[17]  Peter M. Kasson,et al.  GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit , 2013, Bioinform..

[18]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[19]  Ilya A Vakser,et al.  Docking of protein models , 2002, Protein science : a publication of the Protein Society.

[20]  Xuegong Zhang,et al.  Computational prediction of associations between long non-coding RNAs and proteins , 2013, BMC Genomics.

[21]  Shigeyuki Yokoyama,et al.  ATP binding by glutamyl‐tRNA synthetase is switched to the productive mode by tRNA binding , 2003, The EMBO journal.

[22]  Marc F Lensink,et al.  Docking, scoring, and affinity prediction in CAPRI , 2013, Proteins.

[23]  Zhengwei Zhu,et al.  CD-HIT: accelerated for clustering the next-generation sequencing data , 2012, Bioinform..

[24]  Richard Bonneau,et al.  The mRNA-bound proteome and its global occupancy profile on protein-coding transcripts. , 2012, Molecular cell.

[25]  Ilya A Vakser,et al.  Global and local structural similarity in protein–protein complexes: Implications for template‐based docking , 2013, Proteins.

[26]  Martin Zacharias,et al.  A coarse-grained force field for Protein–RNA docking , 2011, Nucleic acids research.

[27]  Daisuke Kihara,et al.  Prediction of homoprotein and heteroprotein complexes by protein docking and template‐based modeling: A CASP‐CAPRI experiment , 2016, Proteins.

[28]  Susan Jones,et al.  Evaluating conformational changes in protein structures binding RNA , 2007, Proteins.

[29]  Xiang-Sun Zhang,et al.  De novo prediction of RNA-protein interactions from sequence information. , 2013, Molecular bioSystems.

[30]  I. Longden,et al.  EMBOSS: the European Molecular Biology Open Software Suite. , 2000, Trends in genetics : TIG.

[31]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[32]  Shi-jie Chen RNA folding: conformational statistics, folding kinetics, and ion electrostatics. , 2008, Annual review of biophysics.

[33]  Ana Kozomara,et al.  miRBase: annotating high confidence microRNAs using deep sequencing data , 2013, Nucleic Acids Res..

[34]  Julie L. Yang,et al.  Affinity regression predicts the recognition code of nucleic acid binding proteins , 2015, Nature Biotechnology.

[35]  S. Gerstberger,et al.  A census of human RNA-binding proteins , 2014, Nature Reviews Genetics.

[36]  Yangyu Huang,et al.  A novel protocol for three-dimensional structure prediction of RNA-protein complexes , 2013, Scientific Reports.

[37]  Ilya A Vakser,et al.  Protein–protein alternative binding modes do not overlap , 2013, Protein science : a publication of the Protein Society.

[38]  Scott B. Dewell,et al.  Transcriptome-wide Identification of RNA-Binding Protein and MicroRNA Target Sites by PAR-CLIP , 2010, Cell.

[39]  Amita Barik,et al.  A protein–RNA docking benchmark (I): Nonredundant cases , 2012, Proteins.

[40]  Jun Yu,et al.  LncRNAWiki: harnessing community knowledge in collaborative curation of human long non-coding RNAs , 2014, Nucleic Acids Res..

[41]  V. Suresh,et al.  RPI-Pred: predicting ncRNA-protein interaction using sequence and structural information , 2015, Nucleic acids research.

[42]  Howard Y. Chang,et al.  Long Noncoding RNA as Modular Scaffold of Histone Modification Complexes , 2010, Science.

[43]  Chang-Shung Tung,et al.  Rise of the RNA machines: exploring the structure of long non-coding RNAs. , 2013, Journal of molecular biology.

[44]  R. Russell,et al.  The relationship between sequence and interaction divergence in proteins. , 2003, Journal of molecular biology.

[45]  Rohita Sinha,et al.  Docking by structural similarity at protein‐protein interfaces , 2010, Proteins.

[46]  Marc A. Martí-Renom,et al.  RNA structure alignment by a unit-vector approach , 2008, ECCB.

[47]  Jordan M. Komisarow,et al.  RIP-Chip: the isolation and identification of mRNAs, microRNAs and protein components of ribonucleoprotein complexes from cell extracts , 2006, Nature Protocols.

[48]  B. Frey,et al.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning , 2015, Nature Biotechnology.

[49]  Ilya A Vakser,et al.  Structural templates for modeling homodimers , 2013, Protein science : a publication of the Protein Society.

[50]  Vasant Honavar,et al.  Predicting RNA-Protein Interactions Using Only Sequence Information , 2011, BMC Bioinformatics.

[51]  Xiaoqin Zou,et al.  A knowledge-based scoring function for protein-RNA interactions derived from a statistical mechanics-based iterative method , 2014, Nucleic acids research.

[52]  Ilya A Vakser,et al.  Low-resolution structural modeling of protein interactome. , 2013, Current opinion in structural biology.

[53]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[54]  Janusz M Bujnicki,et al.  Computational modeling of protein-RNA complex structures. , 2014, Methods.

[55]  J. Su,et al.  A new residue‐nucleotide propensity potential with structural information considered for discriminating protein‐RNA docking decoys , 2012, Proteins.

[56]  Norman E. Davey,et al.  Insights into RNA Biology from an Atlas of Mammalian mRNA-Binding Proteins , 2012, Cell.

[57]  Zhiping Weng,et al.  Evaluating template-based and template-free protein-protein complex structure prediction , 2014, Briefings Bioinform..

[58]  J. Bernauer,et al.  Protein-RNA Complexes and Efficient Automatic Docking: Expanding RosettaDock Possibilities , 2014, PloS one.