Virtual screening of GPCRs: An in silico chemogenomics approach

BackgroundThe G-protein coupled receptor (GPCR) superfamily is currently the largest class of therapeutic targets. In silico prediction of interactions between GPCRs and small molecules in the transmembrane ligand-binding site is therefore a crucial step in the drug discovery process, which remains a daunting task due to the difficulty to characterize the 3D structure of most GPCRs, and to the limited amount of known ligands for some members of the superfamily. Chemogenomics, which attempts to characterize interactions between all members of a target class and all small molecules simultaneously, has recently been proposed as an interesting alternative to traditional docking or ligand-based virtual screening strategies.ResultsWe show that interaction prediction in the chemogenomics framework outperforms state-of-the-art individual ligand-based methods in accuracy both for receptor with known ligands and without known ligands. This is done with no knowledge of the receptor 3D structure. In particular we are able to predict ligands of orphan GPCRs with an estimated accuracy of 78.1%.ConclusionWe propose new methods for in silico chemogenomics and validate them on the virtual screening of GPCRs. The methods represent an extension of a recently proposed machine learning strategy, based on support vector machines (SVM), which provides a flexible framework to incorporate various information sources on the biological space of targets and on the chemical space of small molecules. We investigate the use of 2D and 3D descriptors for small molecules, and test a variety of descriptors for GPCRs. We show that incorporating information about the known hierarchical classification of the target family and about key residues in their inferred binding pockets significantly improves the prediction accuracy of our model.

[1]  Thomas Gärtner,et al.  On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.

[2]  Jean-Philippe Vert,et al.  Local Alignment Kernels for Biological Sequences , 2004 .

[3]  Claudio N. Cavasotto,et al.  Discovery of novel chemotypes to a G-protein-coupled receptor through ligand-steered homology modeling and structure-based virtual screening. , 2008, Journal of medicinal chemistry.

[4]  Ke Wang,et al.  Profile-based string kernels for remote homology detection and motif extraction , 2004, Proceedings. 2004 IEEE Computational Systems Bioinformatics Conference, 2004. CSB 2004..

[5]  Pierre Acklin,et al.  Similarity Metrics for Ligands Reflecting the Similarity of the Target Proteins , 2003, J. Chem. Inf. Comput. Sci..

[6]  Arun K Shukla,et al.  A crystal clear view of the β2-adrenergic receptor , 2008, Nature Biotechnology.

[7]  Stephen R. Johnson,et al.  Molecular properties that influence the oral bioavailability of drug candidates. , 2002, Journal of medicinal chemistry.

[8]  J. Bockaert,et al.  Molecular tinkering of G protein‐coupled receptors: an evolutionary success , 1999, The EMBO journal.

[9]  Jean-Philippe Vert,et al.  The context-tree kernel for strings , 2005, Neural Networks.

[10]  Pierre Baldi,et al.  Graph kernels for chemical informatics , 2005, Neural Networks.

[11]  Marcus Elstner,et al.  The retinal conformation and its environment in rhodopsin in light of a new 2.2 A crystal structure. , 2004, Journal of molecular biology.

[12]  R. Stevens,et al.  High-resolution crystal structure of an engineered human beta2-adrenergic G protein-coupled receptor. , 2007, Science.

[13]  Arthur Christopoulos,et al.  Critical Role for the Second Extracellular Loop in the Binding of Both Orthosteric and Allosteric G Protein-coupled Receptor Ligands* , 2007, Journal of Biological Chemistry.

[14]  J J Baldwin,et al.  Prediction of drug absorption using multivariate statistics. , 2000, Journal of medicinal chemistry.

[15]  Junmei Wang,et al.  GPCR Structure-Based Virtual Screening Approach for CB2 Antagonist Search , 2007, J. Chem. Inf. Model..

[16]  Øyvind Edvardsen,et al.  A database of mutants and effects of site‐directed mutagenesis experiments on G protein‐coupled receptors , 1996, Proteins.

[17]  H. Manji,et al.  G protein-coupled receptors in major psychiatric disorders. , 2007, Biochimica et biophysica acta.

[18]  Jean-Philippe Vert A tree kernel to analyze phylog enetic profi les , 2002 .

[19]  O. Civelli,et al.  Orphan G protein‐coupled receptors: targets for new therapeutic interventions , 2004, Annals of medicine.

[20]  Jean-Philippe Vert,et al.  Efficient peptide-MHC-I binding prediction for alleles with few known binders , 2008, Bioinform..

[21]  Jean-Philippe Vert,et al.  The Pharmacophore Kernel for Virtual Screening with Support Vector Machines , 2006, J. Chem. Inf. Model..

[22]  J. Gasteiger,et al.  Chemoinformatics: A Textbook , 2003 .

[23]  Yoshua Bengio,et al.  Collaborative Filtering on a Family of Biological Targets , 2006, J. Chem. Inf. Model..

[24]  H. Kashima,et al.  Kernels for graphs , 2004 .

[25]  Didier Rognan,et al.  Protein‐based virtual screening of chemical databases. II. Are homology models of g‐protein coupled receptors suitable targets? , 2002, Proteins.

[26]  Claudio N. Cavasotto,et al.  Structure‐based identification of binding sites, native ligands and potential inhibitors for G‐protein coupled receptors , 2003, Proteins.

[27]  Jonas Boström,et al.  Reproducing the conformations of protein-bound ligands: A critical evaluation of several popular conformational searching tools , 2001, J. Comput. Aided Mol. Des..

[28]  Hisashi Kashima,et al.  Marginalized Kernels Between Labeled Graphs , 2003, ICML.

[29]  Francis R. Bach,et al.  A New Approach to Collaborative Filtering: Operator Estimation with Spectral Regularization , 2008, J. Mach. Learn. Res..

[30]  S. Hill,et al.  G‐protein‐coupled receptors: past, present and future , 2006, British journal of pharmacology.

[31]  Øyvind Edvardsen,et al.  GPCRDB: information system for G protein-coupled receptors , 2010, Nucleic Acids Res..

[32]  Tatsuya Akutsu,et al.  Protein homology detection using string alignment kernels , 2004, Bioinform..

[33]  Gert Vriend,et al.  GPCRDB information system for G protein-coupled receptors , 2003, Nucleic Acids Res..

[34]  David A. Gough,et al.  Predicting protein-protein interactions from primary structure , 2001, Bioinform..

[35]  P. Dobson,et al.  Predicting enzyme class from protein structure without alignments. , 2005, Journal of molecular biology.

[36]  Edwin V. Bonilla,et al.  Multi-task Gaussian Process Prediction , 2007, NIPS.

[37]  D. Rognan Chemogenomic approaches to rational drug design , 2007, British journal of pharmacology.

[38]  J. Caldwell,et al.  An Introduction to Drug Disposition: The Basic Principles of Absorption, Distribution, Metabolism, and Excretion , 1995, Toxicologic pathology.

[39]  P. Charifson,et al.  Conformational analysis of drug-like molecules bound to proteins: an extensive study of ligand reorganization upon binding. , 2004, Journal of medicinal chemistry.

[40]  Bernhard Schölkopf,et al.  Kernel Methods in Computational Biology , 2005 .

[41]  Jean-Philippe Vert,et al.  A tree kernel to analyse phylogenetic profiles , 2002, ISMB.

[42]  Martin Ebeling,et al.  An Automated System for the Analysis of G Protein-Coupled Receptor Transmembrane Binding Pockets: Alignment, Receptor-Based Pharmacophores, and Their Application , 2005, J. Chem. Inf. Model..

[43]  David A. Gough,et al.  Virtual Screen for Ligands of Orphan G Protein-Coupled Receptors , 2005, J. Chem. Inf. Model..

[44]  Hugo Kubinyi,et al.  Chemogenomics in Drug Discovery: A Medicinal Chemistry Perspective , 2004 .

[45]  B B Fredholm,et al.  G‐protein‐coupled receptors: an update , 2007, Acta physiologica.

[46]  Evi Kostenis,et al.  A physicogenetic method to assign ligand-binding relationships between 7TM receptors. , 2005, Bioorganic & medicinal chemistry letters.

[47]  Peteris Prusis,et al.  Unbiased descriptor and parameter selection confirms the potential of proteochemometric modelling , 2005, BMC Bioinformatics.

[48]  William Stafford Noble,et al.  A structural alignment kernel for protein structures , 2007, Bioinform..

[49]  Massimiliano Pontil,et al.  Multi-Task Feature Learning , 2006, NIPS.

[50]  A. Hopkins,et al.  The druggable genome , 2002, Nature Reviews Drug Discovery.

[51]  Peteris Prusis,et al.  Improved approach for proteochemometrics modeling: application to organic compound - amine G protein-coupled receptor interactions , 2005, Bioinform..

[52]  Shay Bar-Haim,et al.  G protein-coupled receptors: in silico drug discovery in 3D. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Y. Martin,et al.  A bioavailability score. , 2005, Journal of medicinal chemistry.

[54]  Francis R. Bach,et al.  Low-rank matrix factorization with attributes , 2006, ArXiv.

[55]  Yasushi Okuno,et al.  GLIDA: GPCR-ligand database for chemical genomic drug discovery , 2005, Nucleic Acids Res..

[56]  K. Wanner,et al.  Methods and Principles in Medicinal Chemistry , 2007 .

[57]  Charles A. Micchelli,et al.  Learning Multiple Tasks with Kernel Methods , 2005, J. Mach. Learn. Res..

[58]  Rodrigo Lopez,et al.  Multiple sequence alignment with the Clustal series of programs , 2003, Nucleic Acids Res..

[59]  D. Horvath,et al.  G-protein-coupled receptor affinity prediction based on the use of a profiling dataset: QSAR design, synthesis, and experimental validation. , 2005, Journal of medicinal chemistry.

[60]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[61]  Ruedi Stoop,et al.  An Ontology for Pharmaceutical Ligands and Its Application for in Silico Screening and Library Design , 2002, J. Chem. Inf. Comput. Sci..

[62]  Thomas Klabunde Chemogenomics Approaches to Ligand Design , 2006 .

[63]  T. Klabunde,et al.  Structure-based drug discovery using GPCR homology modeling: successful virtual screening for antagonists of the alpha1A adrenergic receptor. , 2005, Journal of medicinal chemistry.

[64]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[65]  Jean-Philippe Vert,et al.  Protein-ligand interaction prediction: an improved chemogenomics approach , 2008, Bioinform..

[66]  T. Klabunde Chemogenomic approaches to drug discovery: similar receptors bind similar ligands , 2007, British journal of pharmacology.

[67]  Pierre Baldi,et al.  One- to Four-Dimensional Kernels for Virtual Screening and the Prediction of Physical, Chemical, and Biological Properties , 2007, J. Chem. Inf. Model..

[68]  Eleazar Eskin,et al.  The Spectrum Kernel: A String Kernel for SVM Protein Classification , 2001, Pacific Symposium on Biocomputing.

[69]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. , 2001, Advanced drug delivery reviews.

[70]  David Haussler,et al.  A Discriminative Framework for Detecting Remote Protein Homologies , 2000, J. Comput. Biol..

[71]  Krzysztof Palczewski,et al.  Sequence analyses of G-protein-coupled receptors: similarities to rhodopsin. , 2003, Biochemistry.

[72]  B. Kobilka G protein coupled receptor structure and activation. , 2007, Biochimica et biophysica acta.

[73]  Tatsuya Akutsu,et al.  Graph Kernels for Molecular Structure-Activity Relationship Analysis with Support Vector Machines , 2005, J. Chem. Inf. Model..

[74]  Leonardo Pardo,et al.  Structural models of class a G protein-coupled receptors as a tool for drug design: insights on transmembrane bundle plasticity. , 2007, Current topics in medicinal chemistry.

[75]  Konstantin V. Balakin,et al.  Property-Based Design of GPCR-Targeted Library , 2002, J. Chem. Inf. Comput. Sci..

[76]  Didier Rognan,et al.  A chemogenomic analysis of the transmembrane binding cavity of human G‐protein‐coupled receptors , 2005, Proteins.

[77]  Maya Topf,et al.  PREDICT modeling and in‐silico screening for G‐protein coupled receptors , 2004, Proteins.

[78]  Jonas Boström,et al.  Assessing the performance of OMEGA with respect to retrieving bioactive conformations. , 2003, Journal of molecular graphics & modelling.

[79]  Hans-Peter Kriegel,et al.  Protein function prediction via graph kernels , 2005, ISMB.

[80]  Jason Weston,et al.  Mismatch string kernels for discriminative protein classification , 2004, Bioinform..

[81]  D. Deshpande,et al.  Targeting G protein-coupled receptor signaling in asthma. , 2006, Cellular signalling.

[82]  Kiyoshi Asai,et al.  Marginalized kernels for biological sequences , 2002, ISMB.

[83]  Xavier Deupi,et al.  Coupling ligand structure to specific conformational switches in the β2-adrenoceptor , 2006, Nature chemical biology.

[84]  Roberto Todeschini,et al.  Handbook of Molecular Descriptors , 2002 .

[85]  G. Barton,et al.  Multiple protein sequence alignment from tertiary structure comparison: Assignment of global and residue confidence levels , 1992, Proteins.