Ranking Enzyme Structures in the PDB by Bound Ligand Similarity to Biological Substrates

Summary There are numerous applications that use the structures of protein-ligand complexes from the PDB, such as 3D pharmacophore identification, virtual screening, and fragment-based drug design. The structures underlying these applications are potentially much more informative if they contain biologically relevant bound ligands, with high similarity to the cognate ligands. We present a study of ligand-enzyme complexes that compares the similarity of bound and cognate ligands, enabling the best matches to be identified. We calculate the molecular similarity scores using a method called PARITY (proportion of atoms residing in identical topology), which can conveniently be combined to give a similarity score for all cognate reactants or products in the reaction. Thus, we generate a rank-ordered list of related PDB structures, according to the biological similarity of the ligands bound in the structures.

[1]  Fei Luo,et al.  Predicting target-ligand interactions using protein ligand-binding site and ligand substructures , 2015, BMC Systems Biology.

[2]  Jürgen Bajorath,et al.  Three-Dimensional Similarity in Molecular Docking: Prioritizing Ligand Poses on the Basis of Experimental Binding Modes , 2016, J. Chem. Inf. Model..

[3]  Janet M. Thornton,et al.  PDBsum more: new summaries and analyses of the known 3D structures of proteins and nucleic acids , 2004, Nucleic Acids Res..

[4]  Didier Rognan,et al.  Protein-Ligand-Based Pharmacophores: Generation and Utility Assessment in Computational Ligand Profiling , 2012, J. Chem. Inf. Model..

[5]  Sameer Velankar,et al.  PDBe: Protein Data Bank in Europe , 2010, Nucleic Acids Res..

[6]  Michael G. Lerner,et al.  Binding MOAD (Mother Of All Databases) , 2005, Proteins.

[7]  Ruben Abagyan,et al.  Pocketome: an encyclopedia of small-molecule binding sites in 4D , 2011, Nucleic Acids Res..

[8]  Maria Jesus Martin,et al.  SIFTS: Structure Integration with Function, Taxonomy and Sequences resource , 2012, Nucleic Acids Res..

[9]  Dusanka Janezic,et al.  Modeling enzyme-ligand binding in drug discovery , 2015, Journal of Cheminformatics.

[10]  K. Kinoshita,et al.  Landscape of protein–small ligand binding modes , 2016, Protein science : a publication of the Protein Society.

[11]  Marc A. Martí-Renom,et al.  Ligand-Target Prediction by Structural Network Biology Using nAnnoLyze , 2015, PLoS Comput. Biol..

[12]  Jooyoung Lee,et al.  LigDockCSA: Protein–ligand docking using conformational space annealing , 2011, J. Comput. Chem..

[13]  Benjamin A. Shoemaker,et al.  Inferred Biomolecular Interaction Server—a web server to analyze and predict protein interacting partners and binding sites , 2009, Nucleic Acids Res..

[14]  Dusanka Janezic,et al.  ProBiS-Database: Precalculated Binding Site Similarities and Local Pairwise Alignments of PDB Structures , 2012, J. Chem. Inf. Model..

[15]  Hongyi Zhou,et al.  FINDSITEcomb: A Threading/Structure-Based, Proteomic-Scale Virtual Ligand Screening Approach , 2013, J. Chem. Inf. Model..

[16]  Malgorzata N. Drwal,et al.  Combination of ligand- and structure-based methods in virtual screening. , 2013, Drug discovery today. Technologies.

[17]  Susumu Goto,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 2000, Nucleic Acids Res..

[18]  Dusanka Janezic,et al.  ProBiS-ligands: a web server for prediction of ligands by examination of protein binding sites , 2014, Nucleic Acids Res..

[19]  Minoru Kanehisa,et al.  KEGG as a reference resource for gene and protein annotation , 2015, Nucleic Acids Res..

[20]  Vladimir A. Ivanisenko,et al.  PDBSite: a database of the 3D structure of protein functional sites , 2004, Nucleic Acids Res..

[21]  David S. Goodsell,et al.  The RCSB Protein Data Bank: views of structural biology for basic and applied research and education , 2014, Nucleic Acids Res..

[22]  Jeffrey Skolnick,et al.  PoLi: A Virtual Screening Pipeline Based on Template Pocket and Ligand Similarity , 2015, J. Chem. Inf. Model..

[23]  Didier Rognan,et al.  sc-PDB-Frag: A Database of Protein-Ligand Interaction Patterns for Bioisosteric Replacements , 2014, J. Chem. Inf. Model..

[24]  Sameer Velankar,et al.  E-MSD: an integrated data resource for bioinformatics , 2004, Nucleic Acids Res..

[25]  Russ B. Altman,et al.  Knowledge-based Fragment Binding Prediction , 2014, PLoS Comput. Biol..

[26]  Ajay N. Jain,et al.  Knowledge-guided docking: accurate prospective prediction of bound configurations of novel ligands using Surflex-Dock , 2015, Journal of Computer-Aided Molecular Design.

[27]  Richard M. Jackson,et al.  ReverseScreen3D: A Structure-Based Ligand Matching Method To Identify Protein Targets , 2011, J. Chem. Inf. Model..

[28]  Didier Rognan,et al.  sc-PDB: a 3D-database of ligandable binding sites—10 years on , 2014, Nucleic Acids Res..

[29]  Pedro M. Coutinho,et al.  The carbohydrate-active enzymes database (CAZy) in 2013 , 2013, Nucleic Acids Res..

[30]  Robert B. Russell,et al.  Combinations of Protein-Chemical Complex Structures Reveal New Targets for Established Drugs , 2011, PLoS Comput. Biol..

[31]  Rafael Najmanovich,et al.  IsoCleft Finder – a web-based tool for the detection and analysis of protein binding-site geometric and chemical similarities , 2013, F1000Research.

[32]  Gerhard Klebe,et al.  Relibase: design and development of a database for comprehensive analysis of protein-ligand interactions. , 2003, Journal of molecular biology.

[33]  Philip E. Bourne,et al.  An integrated workflow for proteome-wide off-target identification and polypharmacology drug design , 2014, Tsinghua Science and Technology.

[34]  Valentin A. Ilyin,et al.  LigBase: a database of families of aligned ligand binding sites in known protein sequences and structures , 2002, Bioinform..

[35]  Janet M. Thornton,et al.  Detection of 3D atomic similarities and their use in the discrimination of small molecule protein-binding sites , 2008, ECCB.

[36]  Chaok Seok,et al.  GalaxySite: ligand-binding-site prediction by using molecular docking , 2014, Nucleic Acids Res..

[37]  Janet M. Thornton,et al.  Mechanism and Catalytic Site Atlas (M-CSA): a database of enzyme reaction mechanisms and active sites , 2017, Nucleic Acids Res..

[38]  Igor I. Baskin,et al.  Predicting Ligand Binding Modes from Neural Networks Trained on Protein-Ligand Interaction Fingerprints , 2013, J. Chem. Inf. Model..

[39]  Richard A. Stanton,et al.  Ligand similarity guided receptor selection enhances docking accuracy and recall for non-nucleoside HIV reverse transcriptase inhibitors , 2015, Journal of Molecular Modeling.

[40]  Matthias Rarey,et al.  The Art of Compiling Protein Binding Site Ensembles , 2016, Molecular informatics.