Solvent structure improves docking prediction in lectin-carbohydrate complexes.

Recognition and complex formation between proteins and carbohydrates is a key issue in many important biological processes. Determination of the three-dimensional structure of such complexes is thus most relevant, but particularly challenging because of their usually low binding affinity. In silico docking methods have a long-standing tradition in predicting protein-ligand complexes, and allow a potentially fast exploration of a number of possible protein-carbohydrate complex structures. However, determining which of these predicted complexes represents the correct structure is not always straightforward. In this work, we present a modification of the scoring function provided by AutoDock4, a widely used docking software, on the basis of analysis of the solvent structure adjacent to the protein surface, as derived from molecular dynamics simulations, that allows the definition and characterization of regions with higher water occupancy than the bulk solvent, called water sites. They mimic the interaction held between the carbohydrate -OH groups and the protein. We used this information for an improved docking method in relation to its capacity to correctly predict the protein-carbohydrate complexes for a number of tested proteins, whose ligands range in size from mono- to tetrasaccharide. Our results show that the presented method significantly improves the docking predictions. The resulting solvent-structure-biased docking protocol, therefore, appears as a powerful tool for the design and optimization of development of glycomimetic drugs, while providing new insights into protein-carbohydrate interactions. Moreover, the achieved improvement also underscores the relevance of the solvent structure to the protein carbohydrate recognition process.

[1]  T. Lazaridis Inhomogeneous Fluid Approach to Solvation Thermodynamics. 1. Theory , 1998 .

[2]  M. Akke,et al.  The Carbohydrate-Binding Site in Galectin-3 Is Preorganized To Recognize a Sugarlike Framework of Oxygens: Ultra-High-Resolution Structures and Water Dynamics , 2011, Biochemistry.

[3]  B. Berne,et al.  Role of the active-site solvent in the thermodynamics of factor Xa ligand binding. , 2008, Journal of the American Chemical Society.

[4]  Marcelo A Martí,et al.  Characterization of the galectin-1 carbohydrate recognition domain in terms of solvent occupancy. , 2007, The journal of physical chemistry. B.

[5]  D. Logan,et al.  Structural basis for carbohydrate-binding specificity--a comparative assessment of two engineered carbohydrate-binding modules. , 2012, Glycobiology.

[6]  A. Boraston,et al.  Carbohydrate recognition by a large sialidase toxin from Clostridium perfringens. , 2007, Biochemistry.

[7]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998 .

[8]  Oliver Kohlbacher,et al.  SLICK — Scoring and Energy Functions for Protein—Carbohydrate Interactions. , 2006 .

[9]  Elizabeth Yuriev,et al.  Molecular Docking of Carbohydrate Ligands to Antibodies: Structural Validation against Crystal Structures , 2009, J. Chem. Inf. Model..

[10]  Themis Lazaridis,et al.  Inhomogeneous Fluid Approach to Solvation Thermodynamics. 2. Applications to Simple Fluids , 1998 .

[11]  W. Weis,et al.  Structural Basis for Selective Recognition of Oligosaccharides by DC-SIGN and DC-SIGNR , 2001, Science.

[12]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998, J. Comput. Chem..

[13]  Robert J Woods,et al.  Computational glycoscience: characterizing the spatial and temporal properties of glycans and glycan-protein complexes. , 2010, Current opinion in structural biology.

[14]  Themis Lazaridis,et al.  The effect of water displacement on binding thermodynamics: concanavalin A. , 2005, The journal of physical chemistry. B.

[15]  Brian K Shoichet,et al.  Prediction of protein-ligand interactions. Docking and scoring: successes and gaps. , 2006, Journal of medicinal chemistry.

[16]  David S. Goodsell,et al.  AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility , 2009, J. Comput. Chem..

[17]  Themis Lazaridis,et al.  Thermodynamic contributions of the ordered water molecule in HIV-1 protease. , 2003, Journal of the American Chemical Society.

[18]  Holger Gohlke,et al.  The Amber biomolecular simulation programs , 2005, J. Comput. Chem..

[19]  I. Campbell,et al.  Structures of the Cd44–hyaluronan complex provide insight into a fundamental carbohydrate-protein interaction , 2007, Nature Structural &Molecular Biology.

[20]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[21]  Dario A. Estrin,et al.  An Integrated Computational Analysis of the Structure, Dynamics, and Ligand Binding Interactions of the Human Galectin Network , 2011, J. Chem. Inf. Model..

[22]  G. Rabinovich,et al.  When galectins recognize glycans: from biochemistry to physiology and back again. , 2011, Biochemistry.

[23]  Jon Hill,et al.  Cheminformatic/bioinformatic analysis of large corporate databases: Application to drug repurposing , 2011 .

[24]  G. Rabinovich Galectin-1 as a potential cancer target , 2005, British Journal of Cancer.

[25]  Ajit Varki,et al.  Siglecs and their roles in the immune system , 2007, Nature Reviews Immunology.

[26]  Jonathan W. Essex,et al.  A review of protein-small molecule docking methods , 2002, J. Comput. Aided Mol. Des..

[27]  J. Balzarini Targeting the glycans of glycoproteins: a novel paradigm for antiviral therapy , 2007, Nature Reviews Microbiology.

[28]  Robert J Woods,et al.  Molecular simulations of carbohydrates and protein-carbohydrate interactions: motivation, issues and prospects. , 2010, Drug discovery today.

[29]  V. Hornak,et al.  Comparison of multiple Amber force fields and development of improved protein backbone parameters , 2006, Proteins.

[30]  Maureen E. Taylor,et al.  Structural insights into what glycan arrays tell us about how glycan-binding proteins interact with their ligands , 2009, Glycobiology.

[31]  J. Drews Drug discovery: a historical perspective. , 2000, Science.

[32]  M. Martí,et al.  Structural basis for ligand recognition in a mushroom lectin: solvent structure as specificity predictor. , 2011, Carbohydrate research.

[33]  J. Hirabayashi Lectin-based structural glycomics: Glycoproteomics and glycan profiling , 2004, Glycoconjugate Journal.

[34]  Baldomero Oliva,et al.  How different from random are docking predictions when ranked by scoring functions? , 2010, Proteins.

[35]  W. L. Jorgensen,et al.  Energetics of displacing water molecules from protein binding sites: consequences for ligand optimization. , 2009, Journal of the American Chemical Society.

[36]  David S. Goodsell,et al.  Distributed automated docking of flexible ligands to proteins: Parallel applications of AutoDock 2.4 , 1996, J. Comput. Aided Mol. Des..

[37]  C. F. Brewer,et al.  Lectins as pattern recognition molecules: the effects of epitope density in innate immunity. , 2010, Glycobiology.

[38]  Elizabeth Yuriev,et al.  Identification of preferred carbohydrate binding modes in xenoreactive antibodies by combining conformational filters and binding site maps. , 2010, Glycobiology.

[39]  Carolyn R. Bertozzi,et al.  Essentials of Glycobiology , 1999 .

[40]  F. J. Luque,et al.  Binding site detection and druggability index from first principles. , 2009, Journal of medicinal chemistry.

[41]  Nicolas Moitessier,et al.  Docking Ligands into Flexible and Solvated Macromolecules. 5. Force-Field-Based Prediction of Binding Affinities of Ligands to Proteins , 2009, J. Chem. Inf. Model..

[42]  L. Amzel,et al.  Disaccharide binding to galectin-1: free energy calculations and molecular recognition mechanism. , 2011, Biophysical journal.

[43]  Dacheng Wang,et al.  Structural insights into the recognition mechanism between an antitumor galectin AAL and the Thomsen‐Friedenreich antigen , 2010, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[44]  Martin Frank,et al.  Bioinformatics and molecular modeling in glycobiology , 2010, Cellular and Molecular Life Sciences.

[45]  R. Woods,et al.  Involvement of water in carbohydrate-protein binding: concanavalin A revisited. , 2008, Journal of the American Chemical Society.

[46]  L. Wyns,et al.  A Structure of the Complex between Concanavalin A and Methyl-3,6-di-O-(α-D-mannopyranosyl)-α-D-mannopyranoside Reveals Two Binding Modes* , 1996, The Journal of Biological Chemistry.

[47]  Nicolas Moitessier,et al.  Docking Ligands into Flexible and Solvated Macromolecules. 4. Are Popular Scoring Functions Accurate for this Class of Proteins? , 2009, J. Chem. Inf. Model..

[48]  Robert Abel,et al.  Motifs for molecular recognition exploiting hydrophobic enclosure in protein–ligand binding , 2007, Proceedings of the National Academy of Sciences.

[49]  D S Goodsell,et al.  Automated docking of flexible ligands: Applications of autodock , 1996, Journal of molecular recognition : JMR.

[50]  Beat Ernst,et al.  From carbohydrate leads to glycomimetic drugs , 2009, Nature Reviews Drug Discovery.

[51]  F. Javier Luque,et al.  Molecular simulation methods in drug discovery: a prospective outlook , 2011, Journal of Computer-Aided Molecular Design.

[52]  H. Berendsen,et al.  Molecular dynamics with coupling to an external bath , 1984 .

[53]  Matthew P. Repasky,et al.  Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. , 2004, Journal of medicinal chemistry.

[54]  Sushil Kumar Mishra,et al.  In Silico Mutagenesis and Docking Study of Ralstonia solanacearum RSL Lectin: Performance of Docking Software To Predict Saccharide Binding , 2012, J. Chem. Inf. Model..

[55]  Natasja Brooijmans,et al.  Molecular recognition and docking algorithms. , 2003, Annual review of biophysics and biomolecular structure.

[56]  Julien Michel,et al.  Prediction of the water content in protein binding sites. , 2009, The journal of physical chemistry. B.

[57]  B. Seaton,et al.  Contributions of Phenylalanine 335 to Ligand Recognition by Human Surfactant Protein D , 2006, Journal of Biological Chemistry.

[58]  Martin Frank,et al.  EUROCarbDB: An open-access platform for glycoinformatics , 2010, Glycobiology.

[59]  Julien Michel,et al.  Effects of Water Placement on Predictions of Binding Affinities for p38α MAP Kinase Inhibitors. , 2010, Journal of chemical theory and computation.

[60]  L M Amzel,et al.  Structure-based drug design. , 1998, Current opinion in biotechnology.

[61]  N. Vermeulen,et al.  The role of water molecules in computational drug design. , 2010, Current topics in medicinal chemistry.

[62]  Maureen E. Taylor,et al.  Engineered carbohydrate-recognition domains for glycoproteomic analysis of cell surface glycosylation and ligands for glycan-binding receptors. , 2010, Methods in enzymology.

[63]  Irwin D. Kuntz,et al.  Development and validation of a modular, extensible docking program: DOCK 5 , 2006, J. Comput. Aided Mol. Des..

[64]  J. Irwin,et al.  Lead discovery using molecular docking. , 2002, Current opinion in chemical biology.

[65]  H. Gabius,et al.  Thermodynamic binding studies of bivalent oligosaccharides to galectin-1, galectin-3, and the carbohydrate recognition domain of galectin-3. , 2004, Glycobiology.

[66]  S. Barondes,et al.  X-ray crystal structure of the human galectin-3 carbohydrate recognition domain at 2.1-A resolution. , 1998, The Journal of biological chemistry.

[67]  Marcelo A Martí,et al.  Carbohydrate-binding proteins: Dissecting ligand structures through solvent environment occupancy. , 2009, The journal of physical chemistry. B.

[68]  Dirk Neumann,et al.  BALLDock/SLICK: A New Method for Protein-Carbohydrate Docking , 2008, J. Chem. Inf. Model..

[69]  A. Leach,et al.  Prediction of Protein—Ligand Interactions. Docking and Scoring: Successes and Gaps , 2006 .

[70]  David S. Goodsell,et al.  A semiempirical free energy force field with charge‐based desolvation , 2007, J. Comput. Chem..

[71]  Elizabeth Yuriev,et al.  A Computational Approach for Exploring Carbohydrate Recognition by Lectins in Innate Immunity , 2011, Front. Immun..

[72]  W. Weis,et al.  Structural Basis for Langerin Recognition of Diverse Pathogen and Mammalian Glycans through a Single Binding Site , 2011, Journal of molecular biology.

[73]  Li Li,et al.  RDOCK: Refinement of rigid‐body protein docking predictions , 2003, Proteins.