Evaluation of the novel algorithm of flexible ligand docking with moveable target-protein atoms

We present the novel docking algorithm based on the Tensor Train decomposition and the TT-Cross global optimization. The algorithm is applied to the docking problem with flexible ligand and moveable protein atoms. The energy of the protein-ligand complex is calculated in the frame of the MMFF94 force field in vacuum. The grid of precalculated energy potentials of probe ligand atoms in the field of the target protein atoms is not used. The energy of the protein-ligand complex for any given configuration is computed directly with the MMFF94 force field without any fitting parameters. The conformation space of the system coordinates is formed by translations and rotations of the ligand as a whole, by the ligand torsions and also by Cartesian coordinates of the selected target protein atoms. Mobility of protein and ligand atoms is taken into account in the docking process simultaneously and equally. The algorithm is realized in the novel parallel docking SOL-P program and results of its performance for a set of 30 protein-ligand complexes are presented. Dependence of the docking positioning accuracy is investigated as a function of parameters of the docking algorithm and the number of protein moveable atoms. It is shown that mobility of the protein atoms improves docking positioning accuracy. The SOL-P program is able to perform docking of a flexible ligand into the active site of the target protein with several dozens of protein moveable atoms: the native crystallized ligand pose is correctly found as the global energy minimum in the search space with 157 dimensions using 4700 CPU ∗ h at the Lomonosov supercomputer.

[1]  Jeffrey S. Vetter,et al.  Contemporary High Performance Computing - From Petascale toward Exascale , 2019, Chapman and Hall / CRC computational science series.

[2]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[3]  V. B. Sulimov,et al.  Accuracy comparison of several common implicit solvent models and their implementations in the context of protein-ligand binding. , 2017, Journal of molecular graphics & modelling.

[4]  Christopher R. Corbeil,et al.  Docking Ligands into Flexible and Solvated Macromolecules, 1. Development and Validation of FITTED 1.0 , 2007, J. Chem. Inf. Model..

[5]  Tamara G. Kolda,et al.  Tensor Decompositions and Applications , 2009, SIAM Rev..

[6]  Jorge Nocedal,et al.  A Limited Memory Algorithm for Bound Constrained Optimization , 1995, SIAM J. Sci. Comput..

[7]  S. Goreinov,et al.  A Theory of Pseudoskeleton Approximations , 1997 .

[8]  Eugene E. Tyrtyshnikov,et al.  Breaking the Curse of Dimensionality, Or How to Use SVD in Many Dimensions , 2009, SIAM J. Sci. Comput..

[9]  David S. Goodsell,et al.  AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility , 2009, J. Comput. Chem..

[10]  Feng Ding,et al.  Rapid Flexible Docking Using a Stochastic Rotamer Library of Ligands , 2010, J. Chem. Inf. Model..

[11]  Eugene E. Tyrtyshnikov,et al.  Evaluation of the docking algorithm based on tensor train global pptimization , 2015 .

[12]  Alexander D. MacKerell,et al.  CHARMM general force field: A force field for drug‐like molecules compatible with the CHARMM all‐atom additive biological force fields , 2009, J. Comput. Chem..

[13]  Eugene E. Tyrtyshnikov,et al.  Incomplete Cross Approximation in the Mosaic-Skeleton Method , 2000, Computing.

[14]  Thomas A. Halgren Merck molecular force field. I. Basis, form, scope, parameterization, and performance of MMFF94 , 1996, J. Comput. Chem..

[15]  L. Kuhn,et al.  Virtual screening with solvation and ligand-induced complementarity , 2000 .

[16]  Donald Ervin Knuth,et al.  The Art of Computer Programming , 1968 .

[17]  S. Goreinov,et al.  How to find a good submatrix , 2010 .

[18]  T. Rowan Functional stability analysis of numerical algorithms , 1990 .

[19]  F. Bushman,et al.  Developing a dynamic pharmacophore model for HIV-1 integrase. , 2000, Journal of medicinal chemistry.

[20]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998, J. Comput. Chem..

[21]  Harold W. Kuhn,et al.  The Hungarian method for the assignment problem , 1955, 50 Years of Integer Programming.

[22]  Ruben Abagyan,et al.  ICM—A new method for protein modeling and design: Applications to docking and structure prediction from the distorted native conformation , 1994, J. Comput. Chem..

[23]  Wei Chen,et al.  Modeling Protein-Ligand Binding by Mining Minima. , 2010, Journal of chemical theory and computation.

[24]  Alexander V. Tikhonravov,et al.  "Lomonosov": Supercomputing at Moscow State University , 2013, HiPC 2013.

[25]  Vladimir B. Sulimov,et al.  New Synthetic Thrombin Inhibitors: Molecular Design and Experimental Verification , 2011, PloS one.

[26]  R. Friesner,et al.  Novel procedure for modeling ligand/receptor induced fit effects. , 2006, Journal of medicinal chemistry.

[27]  William J. Allen,et al.  DOCK 6: Impact of new features and current docking performance , 2015, J. Comput. Chem..

[28]  W. L. Jorgensen,et al.  Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids , 1996 .

[29]  E. Tyrtyshnikov,et al.  TT-cross approximation for multidimensional arrays , 2010 .

[30]  Jorge Nocedal,et al.  Algorithm 778: L-BFGS-B: Fortran subroutines for large-scale bound-constrained optimization , 1997, TOMS.

[31]  Junmei Wang,et al.  Development and testing of a general amber force field , 2004, J. Comput. Chem..

[32]  William H. Press,et al.  Numerical Recipes: The Art of Scientific Computing , 1987 .

[33]  Victor Guallar,et al.  Exploring hierarchical refinement techniques for induced fit docking with protein and ligand flexibility , 2009, J. Comput. Chem..

[34]  A. Leach,et al.  Ligand docking to proteins with discrete side-chain flexibility. , 1994, Journal of molecular biology.

[35]  S. Kim,et al.  "Soft docking": matching of molecular surface cubes. , 1991, Journal of molecular biology.

[36]  F. Grigoriev,et al.  Surface Generalized Born Method: A Simple, Fast, and Precise Implicit Solvent Model beyond the Coulomb Approximation , 2004 .

[37]  A. V. Sulimov,et al.  Application of Molecular Modeling to Urokinase Inhibitors Development , 2014, BioMed research international.

[38]  Adam Pecina,et al.  The SQM/COSMO filter: reliable native pose identification based on the quantum-mechanical description of protein-ligand interactions and implicit COSMO solvation. , 2016, Chemical communications.

[39]  L. Kavraki,et al.  Understanding the challenges of protein flexibility in drug design , 2015, Expert opinion on drug discovery.

[40]  Matteo Masetti,et al.  Protein Flexibility in Drug Discovery: From Theory to Computation , 2015, ChemMedChem.

[41]  Jonathan Kadmon,et al.  Molecular dynamics simulations of palmitate entry into the hydrophobic pocket of the fatty acid binding protein , 2007, FEBS letters.

[42]  Somesh D. Sharma,et al.  Managing protein flexibility in docking and its applications. , 2009, Drug discovery today.

[43]  David L. Mobley,et al.  Guidelines for the analysis of free energy calculations , 2015, Journal of Computer-Aided Molecular Design.

[44]  Danil C. Kutov,et al.  Application of the Docking Program SOL for CSAR Benchmark , 2013, J. Chem. Inf. Model..

[45]  Ruben Abagyan,et al.  Docking and scoring with ICM: the benchmarking results and strategies for improvement , 2012, Journal of Computer-Aided Molecular Design.

[46]  Edward W. Lowe,et al.  Computational Methods in Drug Discovery , 2014, Pharmacological Reviews.

[47]  P. Sokkar,et al.  Computational modeling on the recognition of the HRE motif by HIF-1: molecular docking and molecular dynamics studies , 2012, Journal of Molecular Modeling.

[48]  Thomas Lengauer,et al.  FlexE: efficient molecular docking considering protein structure variations. , 2001, Journal of molecular biology.

[49]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[50]  David S. Goodsell,et al.  A semiempirical free energy force field with charge‐based desolvation , 2007, J. Comput. Chem..

[51]  D. Goodsell,et al.  Automated docking to multiple target structures: Incorporation of protein mobility and structural water heterogeneity in AutoDock , 2002, Proteins.

[52]  S. Goreinov,et al.  The maximum-volume concept in approximation by low-rank matrices , 2001 .

[53]  Martin Smiesko DOLINA - Docking Based on a Local Induced-Fit Algorithm: Application toward Small-Molecule Binding to Nuclear Receptors , 2013, J. Chem. Inf. Model..

[54]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[55]  J. Tainer,et al.  Screening a peptidyl database for potential ligands to proteins with side‐chain flexibility , 1998, Proteins.

[56]  Amedeo Caflisch,et al.  Docking small ligands in flexible binding sites , 1998, J. Comput. Chem..

[57]  M. Rosales-Hernández,et al.  o-Alkylselenenylated benzoic acid accesses several sites in serum albumin according to fluorescence studies, Raman spectroscopy and theoretical simulations. , 2013, Protein and peptide letters.

[58]  Danil C. Kutov,et al.  Combined Docking with Classical Force Field and Quantum Chemical Semiempirical Method PM7 , 2017, Adv. Bioinformatics.

[59]  Gennady M Verkhivker,et al.  Predicting structural effects in HIV‐1 protease mutant complexes with flexible ligand docking and protein side‐chain optimization , 1998, Proteins.

[60]  Ivan Oseledets,et al.  Tensor-Train Decomposition , 2011, SIAM J. Sci. Comput..

[61]  F. Ataullakhanov,et al.  Application of Molecular Modeling to Development of New Factor Xa Inhibitors , 2015, BioMed research international.

[62]  Brian K. Shoichet,et al.  The incorporation of protein flexibility and conformational energy penalties in docking screens to improve ligand discovery , 2014, Nature chemistry.

[63]  Vladimir V. Voevodin,et al.  Evaluation of Docking Target Functions by the Comprehensive Investigation of Protein-Ligand Energy Minima , 2015, Adv. Bioinformatics.

[64]  António J. M. Ribeiro,et al.  Protein-ligand docking in the new millennium--a retrospective of 10 years in the field. , 2013, Current medicinal chemistry.

[65]  K. Dill,et al.  Binding of small-molecule ligands to proteins: "what you see" is not always "what you get". , 2009, Structure.

[66]  R. Glen,et al.  Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. , 1995, Journal of molecular biology.