A combinatorial scoring function for protein–RNA docking

Protein–RNA docking is still an open question. One of the main challenges is to develop an effective scoring function that can discriminate near‐native structures from the incorrect ones. To solve the problem, we have constructed a knowledge‐based residue‐nucleotide pairwise potential with secondary structure information considered for nonribosomal protein–RNA docking. Here we developed a weighted combined scoring function RpveScore that consists of the pairwise potential and six physics‐based energy terms. The weights were optimized using the multiple linear regression method by fitting the scoring function to L_rmsd for the bound docking decoys from Benchmark II. The scoring functions were tested on 35 unbound docking cases. The results show that the scoring function RpveScore including all terms performs best. Also RpveScore was compared with the statistical mechanics‐based method derived potential ITScore‐PR, and the united atom‐based statistical potentials QUASI‐RNP and DARS‐RNP. The success rate of RpveScore is 71.6% for the top 1000 structures and the number of cases where a near‐native structure is ranked in top 30 is 25 out of 35 cases. For 32 systems (91.4%), RpveScore can find the binding mode in top 5 that has no lower than 50% native interface residues on protein and nucleotides on RNA. Additionally, it was found that the long‐range electrostatic attractive energy plays an important role in distinguishing near‐native structures from the incorrect ones. This work can be helpful for the development of protein–RNA docking methods and for the understanding of protein–RNA interactions. RpveScore program is available to the public at http://life.bjut.edu.cn/kxyj/kycg/2017116/14845362285362368_1.html Proteins 2017; 85:741–752. © 2016 Wiley Periodicals, Inc.

[1]  T. Glisovic,et al.  RNA‐binding proteins and post‐transcriptional gene regulation , 2008, FEBS letters.

[2]  Nicholas Sawyer,et al.  Protein–protein interactions: General trends in the relationship between binding affinity and interfacial buried surface area , 2013, Protein science : a publication of the Protein Society.

[3]  R. M. Burnett,et al.  DARWIN: A program for docking flexible molecules , 2000, Proteins.

[4]  J. Iwakiri,et al.  Dissecting the protein–RNA interface: the role of protein surface shapes and RNA secondary structures in protein–RNA recognition , 2011, Nucleic acids research.

[5]  J. Su,et al.  A new residue‐nucleotide propensity potential with structural information considered for discriminating protein‐RNA docking decoys , 2012, Proteins.

[6]  Marc F Lensink,et al.  Blind predictions of protein interfaces by docking calculations in CAPRI , 2010, Proteins.

[7]  Xiaoqin Zou,et al.  A knowledge-based scoring function for protein-RNA interactions derived from a statistical mechanics-based iterative method , 2014, Nucleic acids research.

[8]  Yangyu Huang,et al.  A novel protocol for three-dimensional structure prediction of RNA-protein complexes , 2013, Scientific Reports.

[9]  E. Katchalski‐Katzir,et al.  Molecular surface recognition: determination of geometric fit between proteins and their ligands by correlation techniques. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[10]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[11]  Tyson A. Clark,et al.  HITS-CLIP yields genome-wide insights into brain alternative RNA processing , 2008, Nature.

[12]  Huan‐Xiang Zhou,et al.  Fundamental aspects of protein-protein association kinetics. , 2009, Chemical reviews.

[13]  Xiaoqin Zou,et al.  A nonredundant structure dataset for benchmarking protein‐RNA computational docking , 2013, J. Comput. Chem..

[14]  C. Aflalo,et al.  Hydrophobic docking: A proposed enhancement to molecular recognition techniques , 1994, Proteins.

[15]  Eric T. Wang,et al.  Transcriptome-wide Regulation of Pre-mRNA Splicing and mRNA Localization by Muscleblind Proteins , 2012, Cell.

[16]  Yangyu Huang,et al.  Preorientation of protein and RNA just before contacting , 2013, Journal of biomolecular structure & dynamics.

[17]  D. Baker,et al.  Native protein sequences are close to optimal for their structures. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Carles Pons,et al.  Pacific Symposium on Biocomputing 15:269-280(2010) STRUCTURAL PREDICTION OF PROTEIN-RNA INTERACTION BY COMPUTATIONAL DOCKING WITH PROPENSITY-BASED STATISTICAL POTENTIALS , 2022 .

[19]  Janusz M. Bujnicki,et al.  DARS-RNP and QUASI-RNP: New statistical potentials for protein-RNA docking , 2011, BMC Bioinformatics.

[20]  Shan Chang,et al.  Amino acid network and its scoring application in protein-protein docking. , 2008, Biophysical chemistry.

[21]  D. Draper Themes in RNA-protein recognition. , 1999, Journal of molecular biology.

[22]  Shan Chang,et al.  A filter enhanced sampling and combinatorial scoring study for protein docking in CAPRI , 2007, Proteins.

[23]  Kai-Wei Chang,et al.  RNA-binding proteins in human genetic disease. , 2008, Trends in genetics : TIG.

[24]  Shan Chang,et al.  Complex-type-dependent scoring functions in protein-protein docking. , 2007, Biophysical chemistry.

[25]  Meng-long Li,et al.  Identification of RNA-binding sites in proteins by integrating various sequence information , 2010, Amino Acids.

[26]  Laura Pérez-Cano,et al.  A protein‐RNA docking benchmark (II): Extended set from experimental and homology modeling data , 2012, Proteins.

[27]  C. Dominguez,et al.  HADDOCK: a protein-protein docking approach based on biochemical or biophysical information. , 2003, Journal of the American Chemical Society.

[28]  Sandor Vajda,et al.  Combination of scoring functions improves discrimination in protein–protein docking , 2003, Proteins.

[29]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[30]  J M Blaney,et al.  A geometric approach to macromolecule-ligand interactions. , 1982, Journal of molecular biology.

[31]  M. Karplus,et al.  Simulation of activation free energies in molecular systems , 1996 .

[32]  Libin Cao,et al.  Protein–protein docking with binding site patch prediction and network‐based terms enhanced combinatorial scoring , 2010, Proteins.

[33]  Sandor Vajda,et al.  ClusPro: an automated docking and discrimination method for the prediction of protein complexes , 2004, Bioinform..

[34]  M. Gribskov,et al.  The role of RNA sequence and structure in RNA--protein interactions. , 2011, Journal of molecular biology.