Bioinformatics Tools and Benchmarks for Computational Docking and 3D Structure Prediction of RNA-Protein Complexes

RNA-protein (RNP) interactions play essential roles in many biological processes, such as regulation of co-transcriptional and post-transcriptional gene expression, RNA splicing, transport, storage and stabilization, as well as protein synthesis. An increasing number of RNP structures would aid in a better understanding of these processes. However, due to the technical difficulties associated with experimental determination of macromolecular structures by high-resolution methods, studies on RNP recognition and complex formation present significant challenges. As an alternative, computational prediction of RNP interactions can be carried out. Structural models obtained by theoretical predictive methods are, in general, less reliable compared to models based on experimental measurements but they can be sufficiently accurate to be used as a basis for to formulating functional hypotheses. In this article, we present an overview of computational methods for 3D structure prediction of RNP complexes. We discuss currently available methods for macromolecular docking and for scoring 3D structural models of RNP complexes in particular. Additionally, we also review benchmarks that have been developed to assess the accuracy of these methods.

[1]  C. Guthrie Messenger RNA splicing in yeast: clues to why the spliceosome is a ribonucleoprotein. , 1991, Science.

[2]  J. Bujnicki,et al.  Sequence-specific cleavage of dsRNA by Mini-III RNase , 2015, Nucleic acids research.

[3]  Janusz M Bujnicki,et al.  Computational modeling of RNA 3D structures and interactions. , 2016, Current opinion in structural biology.

[4]  Valentina Tozzini,et al.  Multiscale modeling of proteins. , 2010, Accounts of chemical research.

[5]  Richard Bonneau,et al.  The mRNA-bound proteome and its global occupancy profile on protein-coding transcripts. , 2012, Molecular cell.

[6]  Gunnar Jeschke,et al.  EPR-aided approach for solution structure determination of large RNAs or protein–RNA complexes , 2014, Nature Communications.

[7]  Yangyu Huang,et al.  The dataset for protein–RNA binding affinity , 2013, Protein science : a publication of the Protein Society.

[8]  Yangyu Huang,et al.  A novel protocol for three-dimensional structure prediction of RNA-protein complexes , 2013, Scientific Reports.

[9]  Donny D. Licatalosi,et al.  RNA processing and its regulation: global insights into biological networks , 2010, Nature Reviews Genetics.

[10]  D. Baker,et al.  A new hydrogen-bonding potential for the design of protein-RNA interactions predicts specific contacts and discriminates decoys. , 2004, Nucleic acids research.

[11]  Yuedong Yang,et al.  Prediction and validation of the unexplored RNA‐binding protein atlas of the human proteome , 2014, Proteins.

[12]  David B. Ascher,et al.  mCSM–NA: predicting the effects of mutations on protein–nucleic acids interactions , 2017, Nucleic Acids Res..

[13]  Z. Luthey-Schulten,et al.  Ab initio protein structure prediction. , 2002, Current opinion in structural biology.

[14]  T. Blundell,et al.  Knowledge-based protein modeling. , 1994, Critical reviews in biochemistry and molecular biology.

[15]  K. Collins,et al.  Structured non-coding RNAs and the RNP Renaissance. , 2008, Current opinion in chemical biology.

[16]  Carles Pons,et al.  pyDockWEB: a web server for rigid-body protein-protein docking using electrostatics and desolvation scoring , 2013, Bioinform..

[17]  Structural Insight into Inhibition of CsrA-RNA Interaction Revealed by Docking, Molecular Dynamics and Free Energy Calculations , 2017, Scientific Reports.

[18]  L. Perez-Cano,et al.  Structural and energy determinants in protein-RNA docking. , 2017, Methods.

[19]  M. Jurica Detailed close-ups and the big picture of spliceosomes. , 2008, Current opinion in structural biology.

[20]  J. Doudna,et al.  Crystallization of RNA and RNA-protein complexes. , 2004, Methods.

[21]  Juan Fernández-Recio,et al.  Efficient restraints for protein-protein docking by comparison of observed amino acid substitution patterns with those predicted from local environment. , 2006, Journal of molecular biology.

[22]  Jeroen Krijgsveld,et al.  The Cardiomyocyte RNA-Binding Proteome: Links to Intermediary Metabolism and Heart Disease , 2016, Cell reports.

[23]  J. Bujnicki,et al.  Structural basis for the methylation of A1408 in 16S rRNA by a panaminoglycoside resistance methyltransferase NpmA from a clinical isolate and analysis of the NpmA interactions with the 30S ribosomal subunit , 2010, Nucleic acids research.

[24]  Alexander A. Makarov,et al.  Meta-server for automatic analysis, scoring and ranking of docking models , 2018, Bioinform..

[25]  Zhichao Miao,et al.  Prediction of nucleic acid binding probability in proteins: a neighboring residue network based score , 2015, Nucleic acids research.

[26]  Yaoqi Zhou,et al.  Structure-based prediction of RNA-binding domains and RNA-binding sites and application to structural genomics targets , 2010, Nucleic acids research.

[27]  Martin Zacharias,et al.  A coarse-grained force field for Protein–RNA docking , 2011, Nucleic acids research.

[28]  David W. Ritchie,et al.  PEPSI-Dock: a detailed data-driven protein-protein interaction potential accelerated by polar Fourier correlation , 2016, Bioinform..

[29]  Kate B. Cook,et al.  RBPDB: a database of RNA-binding specificities , 2010, Nucleic Acids Res..

[30]  Feng Ding,et al.  RNA-Puzzles: a CASP-like evaluation of RNA three-dimensional structure prediction. , 2012, RNA.

[31]  Kristian Rother,et al.  RNA and protein 3D structure modeling: similarities and differences , 2011, Journal of molecular modeling.

[32]  Isaure Chauvot de Beauchêne,et al.  A web interface for easy flexible protein-protein docking with ATTRACT. , 2015, Biophysical journal.

[33]  David H Mathews,et al.  RNA structure prediction: an overview of methods. , 2012, Methods in molecular biology.

[34]  Marc F Lensink,et al.  Docking and scoring protein interactions: CAPRI 2009 , 2010, Proteins.

[35]  Ilya A Vakser,et al.  Protein-protein docking: from interaction to interactome. , 2014, Biophysical journal.

[36]  F. Allain,et al.  FROM STRUCTURE TO FUNCTION OF RNA BINDING DOMAINS , 2013 .

[37]  Ruth Nussinov,et al.  PatchDock and SymmDock: servers for rigid and symmetric docking , 2005, Nucleic Acids Res..

[38]  Alexandre M J J Bonvin,et al.  M3: an integrative framework for structure determination of molecular machines , 2017, Nature Methods.

[39]  K. Morris,et al.  The rise of regulatory RNA , 2014, Nature Reviews Genetics.

[40]  Sandor Vajda,et al.  ClusPro: a fully automated algorithm for protein-protein docking , 2004, Nucleic Acids Res..

[41]  R. Bahadur,et al.  An account of solvent accessibility in protein-RNA recognition , 2018, Scientific Reports.

[42]  Ben M. Webb,et al.  Putting the Pieces Together: Integrative Modeling Platform Software for Structure Determination of Macromolecular Assemblies , 2012, PLoS biology.

[43]  Ruben Abagyan,et al.  Protein-RNA Docking Using ICM. , 2018, Journal of chemical theory and computation.

[44]  E. Katchalski‐Katzir,et al.  Molecular surface recognition: determination of geometric fit between proteins and their ligands by correlation techniques. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[45]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[46]  Janusz M. Bujnicki,et al.  FILTREST3D: discrimination of structural models using restraints from experimental data , 2010, Bioinform..

[47]  J. Bernauer,et al.  Protein-RNA Complexes and Efficient Automatic Docking: Expanding RosettaDock Possibilities , 2014, PloS one.

[48]  G. Dreyfuss,et al.  The pre-mRNA binding K protein contains a novel evolutionarily conserved motif. , 1993, Nucleic acids research.

[49]  B. Peterlin,et al.  7SK snRNA: a noncoding RNA that plays a major role in regulating eukaryotic transcription , 2012, Wiley interdisciplinary reviews. RNA.

[50]  T. Glisovic,et al.  RNA‐binding proteins and post‐transcriptional gene regulation , 2008, FEBS letters.

[51]  H. Noller Ribosomal RNA and translation. , 1991, Annual review of biochemistry.

[52]  Elspeth F. Garman,et al.  Developments in X-ray Crystallographic Structure Determination of Biological Macromolecules , 2014, Science.

[53]  Martin Zacharias,et al.  Accounting for conformational changes during protein-protein docking. , 2010, Current opinion in structural biology.

[54]  Dima Kozakov,et al.  Sampling and scoring: A marriage made in heaven , 2013, Proteins.

[55]  Laura Pérez-Cano,et al.  A protein‐RNA docking benchmark (II): Extended set from experimental and homology modeling data , 2012, Proteins.

[56]  C. Dominguez,et al.  HADDOCK: a protein-protein docking approach based on biochemical or biophysical information. , 2003, Journal of the American Chemical Society.

[57]  S. Pongor,et al.  Alanine-scanning mutagenesis of the predicted rRNA-binding domain of ErmC' redefines the substrate-binding site and suggests a model for protein-RNA interactions. , 2003, Nucleic acids research.

[58]  R. Sowdhamini,et al.  Genome-wide survey of putative RNA-binding proteins encoded in the human proteome. , 2016, Molecular bioSystems.

[59]  Eric Westhof,et al.  A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs , 2015, PLoS Comput. Biol..

[60]  Joanna M. Kasprzak,et al.  Structural and functional insights into tRNA binding and adenosine N1-methylation by an archaeal Trm10 homologue , 2015, Nucleic acids research.

[61]  Xiaoqin Zou,et al.  A nonredundant structure dataset for benchmarking protein‐RNA computational docking , 2013, J. Comput. Chem..

[62]  Gabriele Varani,et al.  Protein families and RNA recognition , 2005, The FEBS journal.

[63]  Andrey Tovchigrechko,et al.  GRAMM-X public web server for protein–protein docking , 2006, Nucleic Acids Res..

[64]  Matthias W. Hentze,et al.  Metabolic Enzymes Enjoying New Partnerships as RNA-Binding Proteins , 2015, Trends in Endocrinology & Metabolism.

[65]  Abhishek Mishra,et al.  PRince: a web server for structural and physicochemical analysis of Protein-RNA interface , 2012, Nucleic Acids Res..

[66]  Yangyu Huang,et al.  Using 3dRPC for RNA–protein complex structure prediction , 2017, Biophysics reports.

[67]  Eric Westhof,et al.  RNA Structure: Advances and Assessment of 3D Structure Prediction. , 2017, Annual review of biophysics.

[68]  Katarzyna H. Kaminska,et al.  Structural analysis of human 2′-O-ribose methyltransferases involved in mRNA cap structure formation , 2014, Nature Communications.

[69]  João P. G. L. M. Rodrigues,et al.  Defining distance restraints in HADDOCK , 2018, Nature Protocols.

[70]  Joanna M. Kasprzak,et al.  Modeling of Protein-RNA Complex Structures Using Computational Docking Methods. , 2016, Methods in molecular biology.

[71]  K. Nishikura,et al.  ADAR gene family and A-to-I RNA editing: diverse roles in posttranscriptional gene regulation. , 2005, Progress in nucleic acid research and molecular biology.

[72]  G C P van Zundert,et al.  The HADDOCK2.2 Web Server: User-Friendly Integrative Modeling of Biomolecular Complexes. , 2016, Journal of molecular biology.

[73]  Audrone Lapinaite,et al.  The structure of the box C/D enzyme reveals regulation of RNA methylation , 2013, Nature.

[74]  E Westhof,et al.  Statistical analysis of atomic contacts at RNA–protein interfaces , 2001, Journal of molecular recognition : JMR.

[75]  J. Su,et al.  A new residue‐nucleotide propensity potential with structural information considered for discriminating protein‐RNA docking decoys , 2012, Proteins.

[76]  L. Scott,et al.  RNA structure determination by NMR. , 2008, Methods in molecular biology.

[77]  Trushar R. Patel,et al.  Characterization of the termini of the West Nile virus genome and their interactions with the small isoform of the 2' 5'-oligoadenylate synthetase family. , 2015, Journal of structural biology.

[78]  J. Bujnicki,et al.  Identification of protein structural elements responsible for the diversity of sequence preferences among Mini-III RNases , 2016, Scientific Reports.

[79]  S. Sauer,et al.  Serial interactome capture of the human cell nucleus , 2016, Nature Communications.

[80]  H. Le Hir,et al.  5' exon interactions within the human spliceosome establish a framework for exon junction complex structure and assembly. , 2002, Genes & development.

[81]  Chen Zeng,et al.  RBind: computational network method to predict RNA binding sites , 2018, Bioinform..

[82]  Pedro Alexandrino Fernandes,et al.  Protein–protein docking dealing with the unknown , 2009, J. Comput. Chem..

[83]  J. Tuszynski,et al.  Software for molecular docking: a review , 2017, Biophysical Reviews.

[84]  Trushar R. Patel,et al.  Structural studies of RNA-protein complexes: A hybrid approach involving hydrodynamics, scattering, and computational methods. , 2017, Methods.

[85]  Zixiang Wang,et al.  Computational identification of binding energy hot spots in protein–RNA complexes using an ensemble approach , 2018, Bioinform..

[86]  Amita Barik,et al.  Probing binding hot spots at protein–RNA recognition sites , 2015, Nucleic acids research.

[87]  C. Dominguez,et al.  The RNA recognition motif, a plastic RNA‐binding platform to regulate post‐transcriptional gene expression , 2005, The FEBS journal.

[88]  K Fidelis,et al.  A large‐scale experiment to assess protein structure prediction methods , 1995, Proteins.

[89]  S C Schultz,et al.  Molecular basis of double‐stranded RNA‐protein interactions: structure of a dsRNA‐binding domain complexed with dsRNA , 1998, The EMBO journal.

[90]  T. Steitz,et al.  The complete atomic structure of the large ribosomal subunit at 2.4 A resolution. , 2000, Science.

[91]  Amita Barik,et al.  A protein–RNA docking benchmark (I): Nonredundant cases , 2012, Proteins.

[92]  Ranjit Prasad Bahadur,et al.  A non‐redundant protein–RNA docking benchmark version 2.0 , 2017, Proteins.

[93]  Lazaros Mavridis,et al.  HexServer: an FFT-based protein docking server powered by graphics processors , 2010, Nucleic Acids Res..

[94]  Sandor Vajda,et al.  CAPRI: A Critical Assessment of PRedicted Interactions , 2003, Proteins.

[95]  Michal Otyepka,et al.  How to understand atomistic molecular dynamics simulations of RNA and protein–RNA complexes? , 2017, Wiley interdisciplinary reviews. RNA.

[96]  M. Hentze,et al.  Identification of RNA-binding Proteins in Macrophages by Interactome Capture* , 2016, Molecular & Cellular Proteomics.

[97]  Kiyoshi Asai,et al.  Analysis of base-pairing probabilities of RNA molecules involved in protein-RNA interactions , 2013, Bioinform..

[98]  Janusz M. Bujnicki,et al.  GeneSilico protein structure prediction meta-server , 2003, Nucleic Acids Res..

[99]  Stephen R. Comeau,et al.  PIPER: An FFT‐based protein docking program with pairwise potentials , 2006, Proteins.

[100]  Katarzyna J Purzycka,et al.  RNA-Puzzles Round III: 3D RNA structure prediction of five riboswitches and one ribozyme. , 2017, RNA.

[101]  Richard A. Cunha,et al.  RNA Structural Dynamics As Captured by Molecular Simulations: A Comprehensive Overview , 2018, Chemical reviews.

[102]  Zhengwei Zhu,et al.  Templates are available to model nearly all complexes of structurally characterized proteins , 2012, Proceedings of the National Academy of Sciences.

[103]  Kai-Wei Chang,et al.  RNA-binding proteins in human genetic disease. , 2008, Trends in genetics : TIG.

[104]  Petras J. Kundrotas,et al.  Template-Based Modeling of Protein-RNA Interactions , 2016, PLoS Comput. Biol..

[105]  R. Sperling,et al.  Structure and function of the Pre-mRNA splicing machine. , 2008, Structure.

[106]  Raymond F. Gesteland,et al.  RNA worlds : from life's origins to diversity in gene regulation , 2011 .

[107]  Janusz M. Bujnicki,et al.  NPDock: a web server for protein–nucleic acid docking , 2015, Nucleic Acids Res..

[108]  Wen Cheng,et al.  A Graph Approach to Mining Biological Patterns in the Binding Interfaces , 2017, J. Comput. Biol..

[109]  Zhiqiang Yan,et al.  Optimizing Scoring Function of Protein-Nucleic Acid Interactions with Both Affinity and Specificity , 2013, PloS one.

[110]  Ramanathan Sowdhamini,et al.  hRBPome: a central repository of all known human RNA-binding proteins , 2018, bioRxiv.

[111]  Kiyoshi Asai,et al.  Improved Accuracy in RNA-Protein Rigid Body Docking by Incorporating Force Field for Molecular Dynamics Simulation into the Scoring Function. , 2016, Journal of chemical theory and computation.

[112]  A. Shilatifard,et al.  Drosophila TDP-43 RNA-Binding Protein Facilitates Association of Sister Chromatid Cohesion Proteins with Genes, Enhancers and Polycomb Response Elements , 2016, PLoS genetics.

[113]  K. Bastard,et al.  Accounting for Large Amplitude Protein Deformation during in Silico Macromolecular Docking , 2011, International journal of molecular sciences.

[114]  Yang Zhang,et al.  Identification of near‐native structures by clustering protein docking conformations , 2007, Proteins.

[115]  Joanna M. Kasprzak,et al.  YbeA is the m3Psi methyltransferase RlmH that targets nucleotide 1915 in 23S rRNA. , 2008, RNA.

[116]  Amita Barik,et al.  Molecular architecture of protein-RNA recognition sites , 2015, Journal of biomolecular structure & dynamics.

[117]  T. Steitz A structural understanding of the dynamic ribosome machine , 2008, Nature Reviews Molecular Cell Biology.

[118]  S. Genheden,et al.  The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities , 2015, Expert opinion on drug discovery.

[119]  Norman E. Davey,et al.  Insights into RNA Biology from an Atlas of Mammalian mRNA-Binding Proteins , 2012, Cell.

[120]  Rolf Boelens,et al.  Information-driven protein–DNA docking using HADDOCK: it is a matter of flexibility , 2006, Nucleic acids research.

[121]  A. Biegert,et al.  HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment , 2011, Nature Methods.

[122]  Carles Pons,et al.  Pacific Symposium on Biocomputing 15:269-280(2010) STRUCTURAL PREDICTION OF PROTEIN-RNA INTERACTION BY COMPUTATIONAL DOCKING WITH PROPENSITY-BASED STATISTICAL POTENTIALS , 2022 .

[123]  Ling Liu,et al.  dbAMEPNI: a database of alanine mutagenic effects for protein–nucleic acid interactions , 2018, Database J. Biol. Databases Curation.

[124]  R. Bahadur,et al.  Hydration of protein–RNA recognition sites , 2014, Nucleic acids research.

[125]  Aaron Klug,et al.  Crystal structure of a zinc-finger–RNA complex reveals two modes of molecular recognition , 2003, Nature.

[126]  S. Gerstberger,et al.  A census of human RNA-binding proteins , 2014, Nature Reviews Genetics.

[127]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP)—Round XII , 2018, Proteins.

[128]  Ruhong Zhou,et al.  Multiscale modeling of macromolecular biosystems , 2012, Briefings Bioinform..

[129]  Lili Wan,et al.  RNA and Disease , 2009, Cell.

[130]  Pei Zhou,et al.  HDOCK: a web server for protein–protein and protein–DNA/RNA docking based on a hybrid strategy , 2017, Nucleic Acids Res..

[131]  Ramanathan Sowdhamini,et al.  RStrucFam: a web server to associate structure and cognate RNA for RNA-binding proteins from sequence information , 2016, BMC Bioinformatics.

[132]  Janusz M. Bujnicki,et al.  DARS-RNP and QUASI-RNP: New statistical potentials for protein-RNA docking , 2011, BMC Bioinformatics.

[133]  Gabriele Varani,et al.  RNA is rarely at a loss for companions; as soon as RNA , 2008 .

[134]  Lukasz A. Kurgan,et al.  A comprehensive comparative review of sequence-based predictors of DNA- and RNA-binding residues , 2016, Briefings Bioinform..

[135]  Xiaoqin Zou,et al.  A knowledge-based scoring function for protein-RNA interactions derived from a statistical mechanics-based iterative method , 2014, Nucleic acids research.

[136]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[137]  Gabriele Varani,et al.  A knowledge‐based potential function predicts the specificity and relative binding energy of RNA‐binding proteins , 2007, The FEBS journal.

[138]  Zhiping Weng,et al.  ZDOCK server: interactive docking prediction of protein-protein complexes and symmetric multimers , 2014, Bioinform..

[139]  Marc F Lensink,et al.  Docking, scoring, and affinity prediction in CAPRI , 2013, Proteins.

[140]  L. Minvielle-Sebastia,et al.  mRNA polyadenylation and its coupling to other RNA processing reactions and to transcription. , 1999, Current opinion in cell biology.

[141]  J. Castle,et al.  Genome-Wide Survey of Human Alternative Pre-mRNA Splicing with Exon Junction Microarrays , 2003, Science.