Mining flexible-receptor docking experiments to select promising protein receptor snapshots

BackgroundMolecular docking simulation is the Rational Drug Design (RDD) step that investigates the affinity between protein receptors and ligands. Typically, molecular docking algorithms consider receptors as rigid bodies. Receptors are, however, intrinsically flexible in the cellular environment. The use of a time series of receptor conformations is an approach to explore its flexibility in molecular docking computer simulations, but it is extensively time-consuming. Hence, selection of the most promising conformations can accelerate docking experiments and, consequently, the RDD efforts.ResultsWe previously docked four ligands (NADH, TCL, PIF and ETH) to 3,100 conformations of the InhA receptor from M. tuberculosis. Based on the receptor residues-ligand distances we preprocessed all docking results to generate appropriate input to mine data. Data preprocessing was done by calculating the shortest interatomic distances between the ligand and the receptor’s residues for each docking result. They were the predictive attributes. The target attribute was the estimated free-energy of binding (FEB) value calculated by the AutodDock3.0.5 software. The mining inputs were submitted to the M5P model tree algorithm. It resulted in short and understandable trees. On the basis of the correlation values, for NADH, TCL and PIF we obtained more than 95% correlation while for ETH, only about 60%. Post processing the generated model trees for each of its linear models (LMs), we calculated the average FEB for their associated instances. From these values we considered a LM as representative if its average FEB was smaller than or equal the average FEB of the test set. The instances in the selected LMs were considered the most promising snapshots. It totalized 1,521, 1,780, 2,085 and 902 snapshots, for NADH, TCL, PIF and ETH respectively.ConclusionsBy post processing the generated model trees we were able to propose a criterion of selection of linear models which, in turn, is capable of selecting a set of promising receptor conformations. As future work we intend to go further and use these results to elaborate a strategy to preprocess the receptors 3-D spatial conformation in order to predict FEB values. Besides, we intend to select other compounds, among the million catalogued, that may be promising as new drug candidates for our particular protein receptor target.

[1]  S. Kim,et al.  "Soft docking": matching of molecular surface cubes. , 1991, Journal of molecular biology.

[2]  I. Kuntz Structure-Based Strategies for Drug Design and Discovery , 1992, Science.

[3]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[4]  A. Leach,et al.  Ligand docking to proteins with discrete side-chain flexibility. , 1994, Journal of molecular biology.

[5]  Yuan-Ping Pang,et al.  Prediction of the binding sites of huperzine A in acetylcholinesterase by docking studies , 1994, J. Comput. Aided Mol. Des..

[6]  T. Lybrand Ligand-protein docking and rational drug design. , 1995, Current Opinion in Structural Biology.

[7]  J. Sacchettini,et al.  Crystal structure and function of the isoniazid target of Mycobacterium tuberculosis , 1995, Science.

[8]  G. A. Jeffrey,et al.  An Introduction to Hydrogen Bonding , 1997 .

[9]  P Willett,et al.  Development and validation of a genetic algorithm for flexible docking. , 1997, Journal of molecular biology.

[10]  I. Kuntz,et al.  Molecular docking to ensembles of protein structures. , 1997, Journal of molecular biology.

[11]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998 .

[12]  Amedeo Caflisch,et al.  Docking small ligands in flexible binding sites , 1998, J. Comput. Chem..

[13]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998, J. Comput. Chem..

[14]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[15]  Ernst Althaus,et al.  A combinatorial approach to protein docking with flexible side-chains , 2000, RECOMB '00.

[16]  Paul D Lyne,et al.  Structure-based virtual screening: an overview. , 2002, Drug discovery today.

[17]  Ernst Althaus,et al.  A Combinatorial Approach to Protein Docking with Flexible Side Chains , 2002, J. Comput. Biol..

[18]  Djamal Bouzida,et al.  Complexity and simplicity of ligand-macromolecule interactions: the energy landscape perspective. , 2002, Current opinion in structural biology.

[19]  D. Goodsell,et al.  Automated docking to multiple target structures: Incorporation of protein mobility and structural water heterogeneity in AutoDock , 2002, Proteins.

[20]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[21]  J. Mccammon,et al.  Computational drug design accommodating receptor flexibility: the relaxed complex scheme. , 2002, Journal of the American Chemical Society.

[22]  M L Teodoro,et al.  Conformational flexibility models for the receptor in structure based drug design. , 2003, Current pharmaceutical design.

[23]  David Alland,et al.  Targeting Tuberculosis and Malaria through Inhibition of Enoyl Reductase , 2003, Journal of Biological Chemistry.

[24]  A. Bourinbaiar,et al.  Low-cost anti-HIV compounds: potential application for AIDS therapy in developing countries. , 2003, Current pharmaceutical design.

[25]  Jung-Hsin Lin,et al.  The relaxed complex method: Accommodating receptor flexibility for drug design with an improved scoring scheme. , 2003, Biopolymers.

[26]  L. A. Basso,et al.  An inorganic iron complex that inhibits wild-type and an isoniazid-resistant mutant 2-trans-enoyl-ACP (CoA) reductase from Mycobacterium tuberculosis. , 2004, Chemical communications.

[27]  Brian K. Shoichet,et al.  ZINC - A Free Database of Commercially Available Compounds for Virtual Screening , 2005, J. Chem. Inf. Model..

[28]  Holger Gohlke,et al.  The Amber biomolecular simulation programs , 2005, J. Comput. Chem..

[29]  O. N. de Souza,et al.  Molecular dynamics simulation studies of the wild-type, I21V, and I16T mutants of isoniazid-resistant Mycobacterium tuberculosis enoyl reductase (InhA) in complex with NADH: toward the understanding of NADH-InhA different affinities. , 2005, Biophysical journal.

[30]  J. Gready,et al.  Combining docking and molecular dynamic simulations in drug design , 2006, Medicinal research reviews.

[31]  O. N. de Souza,et al.  Slow-onset inhibition of 2-trans-enoyl-ACP (CoA) reductase from Mycobacterium tuberculosis by an inorganic complex. , 2006, Current pharmaceutical design.

[32]  Osmar Norberto de Souza,et al.  Automating Molecular Docking with Explicit Receptor Flexibility Using Scientific Workflows , 2007, BSB.

[33]  X. Zou,et al.  Ensemble docking of multiple protein structures: Considering protein structural variations in molecular docking , 2006, Proteins.

[34]  P. Tonge,et al.  Probing mechanisms of resistance to the tuberculosis drug isoniazid: Conformational changes caused by inhibition of InhA, the enoyl reductase from Mycobacterium tuberculosis , 2007, Protein science : a publication of the Protein Society.

[35]  James C. Sacchettini,et al.  Mechanism of thioamide drug action against tuberculosis and leprosy , 2007, The Journal of experimental medicine.

[36]  C Thomas Caskey,et al.  The drug development crisis: efficiency and safety. , 2007, Annual review of medicine.

[37]  Holger Gohlke,et al.  Target flexibility: an emerging consideration in drug discovery and design. , 2008, Journal of medicinal chemistry.

[38]  R. Abagyan,et al.  Flexible ligand docking to multiple receptor conformations: a practical alternative. , 2008, Current opinion in structural biology.

[39]  Rommie E. Amaro,et al.  An improved relaxed complex scheme for receptor flexibility in computer-aided drug design , 2008, J. Comput. Aided Mol. Des..

[40]  Chung F Wong,et al.  Flexible ligand-flexible protein docking in protein kinase systems. , 2008, Biochimica et biophysica acta.

[41]  Wagner Meira,et al.  Protein cutoff scanning: A comparative analysis of cutoff dependent and cutoff free methods for prospecting contacts in proteins , 2009, Proteins.

[42]  Osmar Norberto de Souza,et al.  FReDD: Supporting Mining Strategies through a Flexible-Receptor Docking Database , 2009, BSB.

[43]  Somesh D. Sharma,et al.  Managing protein flexibility in docking and its applications. , 2009, Drug discovery today.

[44]  Alex Alves Freitas,et al.  On the Importance of Comprehensible Classification Models for Protein Function Prediction , 2010, IEEE/ACM Transactions on Computational Biology and Bioinformatics.