Incorporating specificity into optimization: evaluation of SPA using CSAR 2014 and CASF 2013 benchmarks

Scoring functions of protein–ligand interactions are widely used in computationally docking software and structure-based drug discovery. Accurate prediction of the binding energy between the protein and the ligand is the main task of the scoring function. The accuracy of a scoring function is normally evaluated by testing it on the benchmarks of protein–ligand complexes. In this work, we report the evaluation analysis of an improved version of scoring function SPecificity and Affinity (SPA). By testing on two independent benchmarks Community Structure-Activity Resource (CSAR) 2014 and Comparative Assessment of Scoring Functions (CASF) 2013, the assessment shows that SPA is relatively more accurate than other compared scoring functions in predicting the interactions between the protein and the ligand. We conclude that the inclusion of the specificity in the optimization can effectively suppress the competitive state on the funnel-like binding energy landscape, and make SPA more accurate in identifying the “native” conformation and scoring the binding decoys. The evaluation of SPA highlights the importance of binding specificity in improving the accuracy of the scoring functions.

[1]  J. Janin,et al.  Quantifying biological specificity: the statistical mechanics of molecular recognition. , 1996, Proteins.

[2]  W A Koppensteiner,et al.  Knowledge-based potentials--back to the roots. , 1998, Biochemistry. Biokhimiia.

[3]  J. Correa-Basurto,et al.  Automated docking for novel drug discovery , 2013, Expert opinion on drug discovery.

[4]  Eric T. Kim,et al.  How does a drug molecule find its target binding site? , 2011, Journal of the American Chemical Society.

[5]  R. Nussinov,et al.  Folding funnels, binding funnels, and protein function , 1999, Protein science : a publication of the Protein Society.

[6]  Fenglou Mao,et al.  Potential of mean force for protein–protein interaction studies , 2002, Proteins.

[7]  Jesús S. Dehesa,et al.  Insight into the informational-structure behavior of the Diels-Alder reaction of cyclopentadiene and maleic anhydride , 2014, Journal of Molecular Modeling.

[8]  Xiaoqin Zou,et al.  A knowledge-based scoring function for protein-RNA interactions derived from a statistical mechanics-based iterative method , 2014, Nucleic acids research.

[9]  J. Onuchic,et al.  Funnels, pathways, and the energy landscape of protein folding: A synthesis , 1994, Proteins.

[10]  Liang Hu,et al.  A comparison of various optimization algorithms of protein–ligand docking programs by fitness accuracy , 2014, Journal of Molecular Modeling.

[11]  Eugene I Shakhnovich,et al.  Native atom types for knowledge-based potentials: application to binding energy prediction. , 2004, Journal of medicinal chemistry.

[12]  Julia M. Shifman,et al.  Exploring the origins of binding specificity through the computational redesign of calmodulin , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Richard D. Smith,et al.  CSAR Benchmark Exercise of 2010: Combined Evaluation Across All Submitted Scoring Functions , 2011, J. Chem. Inf. Model..

[14]  Jie Liu,et al.  Classification of Current Scoring Functions , 2015, J. Chem. Inf. Model..

[15]  Jie Li,et al.  PDB-wide collection of binding data: current status of the PDBbind database , 2015, Bioinform..

[16]  Gevorg Grigoryan,et al.  Design of protein-interaction specificity affords selective bZIP-binding peptides , 2009, Nature.

[17]  Peter G Wolynes,et al.  Protein topology determines binding mechanism. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[18]  J. Irwin,et al.  Benchmarking sets for molecular docking. , 2006, Journal of medicinal chemistry.

[19]  Brian K. Shoichet,et al.  Virtual screening of chemical libraries , 2004, Nature.

[20]  Zhiqiang Yan,et al.  Specificity quantification of biomolecular recognition and its implication for drug discovery , 2012, Scientific Reports.

[21]  Yu Liu,et al.  FIPSDock: A new molecular docking technique driven by fully informed swarm optimization algorithm , 2013, J. Comput. Chem..

[22]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[23]  M. Dickson,et al.  Key factors in the rising cost of new drug discovery and development , 2004, Nature Reviews Drug Discovery.

[24]  P. Harbury,et al.  Automated design of specificity in molecular recognition , 2003, Nature Structural Biology.

[25]  Xiliang Zheng,et al.  Quantifying intrinsic specificity: a potential complement to affinity in drug screening. , 2007, Physical review letters.

[26]  J. Janin,et al.  Principles of protein-protein recognition from structure to thermodynamics. , 1995, Biochimie.

[27]  Jun-tao Guo,et al.  Quantitative evaluation of protein–DNA interactions using an optimized knowledge-based potential , 2005, Nucleic acids research.

[28]  Xiliang Zheng,et al.  Thermodynamic and kinetic specificities of ligand binding , 2013 .

[29]  Jing Li,et al.  Knowledge-Based Scoring Functions in Drug Design: 2. Can the Knowledge Base Be Enriched? , 2011, J. Chem. Inf. Model..

[30]  Eugene I Shakhnovich,et al.  Structural mining: self-consistent design on flexible protein-peptide docking and transferable binding affinity potential. , 2004, Journal of the American Chemical Society.

[31]  T. Baker,et al.  Specificity versus stability in computational protein design. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[32]  Renxiao Wang,et al.  The PDBbind database: collection of binding affinities for protein-ligand complexes with known three-dimensional structures. , 2004, Journal of medicinal chemistry.

[33]  Stefano Forli,et al.  Virtual screening with AutoDock: theory and practice , 2010, Expert opinion on drug discovery.

[34]  Jung-Hsin Lin,et al.  Scoring functions for prediction of protein-ligand interactions. , 2013, Current pharmaceutical design.

[35]  Zhiqiang Yan,et al.  Optimizing the affinity and specificity of ligand binding with the inclusion of solvation effect , 2015, Proteins.

[36]  Jin Wang,et al.  Specificity and affinity quantification of protein-protein interactions , 2013, Bioinform..

[37]  Zhihai Liu,et al.  Comparative Assessment of Scoring Functions on an Updated Benchmark: 2. Evaluation Methods and General Results , 2014, J. Chem. Inf. Model..

[38]  Xiaoqin Zou,et al.  Scoring functions and their evaluation methods for protein-ligand docking: recent advances and future directions. , 2010, Physical chemistry chemical physics : PCCP.

[39]  K A Dill,et al.  Ligand binding to proteins: The binding landscape model , 1997, Protein science : a publication of the Protein Society.

[40]  D. Baker,et al.  Computational redesign of endonuclease DNA binding and cleavage specificity , 2006, Nature.

[41]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[42]  Jie Li,et al.  Comparative Assessment of Scoring Functions on an Updated Benchmark: 1. Compilation of the Test Set , 2014, J. Chem. Inf. Model..

[43]  Gennady M Verkhivker,et al.  Energy landscape theory, funnels, specificity, and optimal criterion of biomolecular binding. , 2003, Physical review letters.

[44]  Asad U Khan,et al.  Structure based virtual screening to discover putative drug candidates: necessary considerations and successful case studies. , 2015, Methods.

[45]  Jian Zhang,et al.  Design and designability of protein-based assemblies. , 2014, Current opinion in structural biology.

[46]  Zhirong Sun,et al.  Quantitative prediction of protein–protein binding affinity with a potential of mean force considering volume correction , 2009, Protein science : a publication of the Protein Society.

[47]  Gennady M Verkhivker,et al.  Unraveling principles of lead discovery: from unfrustrated energy landscapes to novel molecular anchors. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[48]  D. Baker,et al.  Computational redesign of protein-protein interaction specificity , 2004, Nature Structural &Molecular Biology.

[49]  Richard D. Smith,et al.  CSAR Benchmark Exercise 2011–2012: Evaluation of Results from Docking and Relative Ranking of Blinded Congeneric Series , 2013, J. Chem. Inf. Model..

[50]  Renxiao Wang,et al.  The PDBbind database: methodologies and updates. , 2005, Journal of medicinal chemistry.

[51]  Maria João Ramos,et al.  Virtual screening in drug design and development. , 2010, Combinatorial chemistry & high throughput screening.

[52]  Song Liu,et al.  A knowledge-based energy function for protein-ligand, protein-protein, and protein-DNA complexes. , 2005, Journal of medicinal chemistry.

[53]  Zhiqiang Yan,et al.  Optimizing Scoring Function of Protein-Nucleic Acid Interactions with Both Affinity and Specificity , 2013, PloS one.