AutoSite: an automated approach for pseudo-ligands prediction - from ligand-binding sites identification to predicting key ligand atoms

MOTIVATION The identification of ligand-binding sites from a protein structure facilitates computational drug design and optimization, and protein function assignment. We introduce AutoSite: an efficient software tool for identifying ligand-binding sites and predicting pseudo ligand corresponding to each binding site identified. Binding sites are reported as clusters of 3D points called fills in which every point is labelled as hydrophobic or as hydrogen bond donor or acceptor. From these fills AutoSite derives feature points: a set of putative positions of hydrophobic-, and hydrogen-bond forming ligand atoms. RESULTS We show that AutoSite identifies ligand-binding sites with higher accuracy than other leading methods, and produces fills that better matches the ligand shape and properties, than the fills obtained with a software program with similar capabilities, AutoLigand In addition, we demonstrate that for the Astex Diverse Set, the feature points identify 79% of hydrophobic ligand atoms, and 81% and 62% of the hydrogen acceptor and donor hydrogen ligand atoms interacting with the receptor, and predict 81.2% of water molecules mediating interactions between ligand and receptor. Finally, we illustrate potential uses of the predicted feature points in the context of lead optimization in drug discovery projects. AVAILABILITY AND IMPLEMENTATION http://adfr.scripps.edu/AutoDockFR/autosite.html CONTACT: sanner@scripps.eduSupplementary information: Supplementary data are available at Bioinformatics online.

[1]  R. Laskowski SURFNET: a program for visualizing molecular surfaces, cavities, and intermolecular interactions. , 1995, Journal of molecular graphics.

[2]  P. Jaccard Distribution de la flore alpine dans le bassin des Dranses et dans quelques régions voisines , 1901 .

[3]  David S. Goodsell,et al.  AutoDockFR: Advances in Protein-Ligand Docking with Explicitly Specified Binding Site Flexibility , 2015, PLoS Comput. Biol..

[4]  F. Sommen,et al.  Fragment-based discovery of type I inhibitors of maternal embryonic leucine zipper kinase. , 2015, ACS medicinal chemistry letters.

[5]  Roberto Sanchez,et al.  Beyond structural genomics: computational approaches for the identification of ligand binding sites in protein structures , 2011, Journal of Structural and Functional Genomics.

[6]  M Hendlich,et al.  LIGSITE: automatic and efficient detection of potential small molecule-binding sites in proteins. , 1997, Journal of molecular graphics & modelling.

[7]  Petra Schneider,et al.  Inhibitors of Helicobacter pylori Protease HtrA Found by ‘Virtual Ligand’ Screening Combat Bacterial Invasion of Epithelia , 2011, PloS one.

[8]  J. Deisenhofer,et al.  Structural Mechanism for Statin Inhibition of HMG-CoA Reductase , 2001, Science.

[9]  S. Ōmura,et al.  High-resolution structures of a chitinase complexed with natural product cyclopentapeptide inhibitors: Mimicry of carbohydrate substrate , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[10]  M. Sanner,et al.  Reduced surface: an efficient way to compute molecular surfaces. , 1996, Biopolymers.

[11]  Gabriele Cruciani,et al.  A Common Reference Framework for Analyzing/Comparing Proteins and Ligands. Fingerprints for Ligands And Proteins (FLAP): Theory and Application , 2007, J. Chem. Inf. Model..

[12]  R. Abagyan,et al.  Comprehensive identification of "druggable" protein ligand binding sites. , 2004, Genome informatics. International Conference on Genome Informatics.

[13]  Paul N. Mortenson,et al.  Diverse, high-quality test set for the validation of protein-ligand docking performance. , 2007, Journal of medicinal chemistry.

[14]  David S. Goodsell,et al.  AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility , 2009, J. Comput. Chem..

[15]  Penny J. Beuning,et al.  Biochemical functional predictions for protein structures of unknown or uncertain function , 2015, Computational and structural biotechnology journal.

[16]  Thomas A. Halgren,et al.  Identifying and Characterizing Binding Sites and Assessing Druggability , 2009, J. Chem. Inf. Model..

[17]  Jie Liang,et al.  CASTp: computed atlas of surface topography of proteins with structural and topographical mapping of functionally annotated residues , 2006, Nucleic Acids Res..

[18]  Dario Ghersi,et al.  SITEHOUND-web: a server for ligand binding site identification in protein structures , 2009, Nucleic Acids Res..

[19]  William J. Allen,et al.  DOCK 6: Impact of new features and current docking performance , 2015, J. Comput. Chem..

[20]  J. Skolnick,et al.  A threading-based method (FINDSITE) for ligand-binding site prediction and functional annotation , 2008, Proceedings of the National Academy of Sciences.

[21]  David S. Goodsell,et al.  A semiempirical free energy force field with charge‐based desolvation , 2007, J. Comput. Chem..

[22]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[23]  Stéphanie Pérot,et al.  Druggable pockets and binding site centric chemical space: a paradigm shift in drug discovery. , 2010, Drug discovery today.

[24]  Robert B Russell,et al.  Finding functional sites in structural genomics proteins. , 2004, Structure.

[25]  Mona Singh,et al.  Predicting Protein Ligand Binding Sites by Combining Evolutionary Sequence Conservation and 3D Structure , 2009, PLoS Comput. Biol..

[26]  Richard M. Jackson,et al.  Q-SiteFinder: an energy-based method for the prediction of protein-ligand binding sites , 2005, Bioinform..

[27]  R. Sanchez,et al.  Improving accuracy and efficiency of blind protein‐ligand docking by focusing on predicted binding sites , 2009, Proteins.

[28]  P. Goodford A computational procedure for determining energetically favorable binding sites on biologically important macromolecules. , 1985, Journal of medicinal chemistry.

[29]  Ajay N. Jain Surflex: fully automatic flexible molecular docking using a molecular similarity-based search engine. , 2003, Journal of medicinal chemistry.

[30]  J. Tainer,et al.  Screening a peptidyl database for potential ligands to proteins with side‐chain flexibility , 1998, Proteins.

[31]  B. Synstad,et al.  Structure-based exploration of cyclic dipeptide chitinase inhibitors. , 2004, Journal of medicinal chemistry.

[32]  D. Goodsell,et al.  Automated prediction of ligand‐binding sites in proteins , 2007, Proteins.

[33]  R. Wade,et al.  Computational approaches to identifying and characterizing protein binding sites for ligand design , 2009, Journal of molecular recognition : JMR.

[34]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[35]  E. Proschak,et al.  Structure‐Based Pharmacophores for Virtual Screening , 2011, Molecular informatics.