GLASS: a comprehensive database for experimentally validated GPCR-ligand associations

MOTIVATION G protein-coupled receptors (GPCRs) are probably the most attractive drug target membrane proteins, which constitute nearly half of drug targets in the contemporary drug discovery industry. While the majority of drug discovery studies employ existing GPCR and ligand interactions to identify new compounds, there remains a shortage of specific databases with precisely annotated GPCR-ligand associations. RESULTS We have developed a new database, GLASS, which aims to provide a comprehensive, manually curated resource for experimentally validated GPCR-ligand associations. A new text-mining algorithm was proposed to collect GPCR-ligand interactions from the biomedical literature, which is then crosschecked with five primary pharmacological datasets, to enhance the coverage and accuracy of GPCR-ligand association data identifications. A special architecture has been designed to allow users for making homologous ligand search with flexible bioactivity parameters. The current database contains ∼500 000 unique entries, of which the vast majority stems from ligand associations with rhodopsin- and secretin-like receptors. The GLASS database should find its most useful application in various in silico GPCR screening and functional annotation studies. AVAILABILITY AND IMPLEMENTATION The website of GLASS database is freely available at http://zhanglab.ccmb.med.umich.edu/GLASS/. CONTACT zhng@umich.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Dragomir R. Radev,et al.  Semi-Supervised Classification for Extracting Protein Interaction Sentences using Dependency Parsing , 2007, EMNLP.

[2]  Satoshi Niijima,et al.  GLIDA: GPCR—ligand database for chemical genomics drug discovery—database and tools update , 2007, Nucleic Acids Res..

[3]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[4]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[5]  Nathanael Weill,et al.  Development and Validation of a Novel Protein-Ligand Fingerprint To Mine Chemogenomic Space: Application to G Protein-Coupled Receptors and Their Ligands , 2009, J. Chem. Inf. Model..

[6]  T. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2010, Nucleic Acids Res..

[7]  Bas Vroling,et al.  GPCRdb: an information system for G protein-coupled receptors , 2015, Nucleic Acids Res..

[8]  Dragomir R. Radev,et al.  Supervised Classification for Extracting Biomedical Events , 2009, BioNLP@HLT-NAACL.

[9]  A. IJzerman,et al.  TinyGRAP database: a bioinformatics tool to mine G-protein-coupled receptor mutant data. , 1999, Trends in pharmacological sciences.

[10]  J. Gutkind,et al.  G-protein-coupled receptors and cancer , 2007, Nature Reviews Cancer.

[11]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[12]  Elena Marchiori,et al.  Gaussian interaction profile kernels for predicting drug-target interaction , 2011, Bioinform..

[13]  Michele Magrane,et al.  UniProt Knowledgebase: a hub of integrated protein data , 2011, Database J. Biol. Databases Curation.

[14]  Jeffrey Skolnick,et al.  FINDSITE(X): a structure-based, small molecule virtual screening approach with application to all identified human GPCRs. , 2012, Molecular pharmaceutics.

[15]  Bas Vroling,et al.  GPCRdb: an information system for G protein-coupled receptors , 2016, Nucleic acids research.

[16]  R. Stevens,et al.  High-resolution crystal structure of an engineered human beta2-adrenergic G protein-coupled receptor. , 2007, Science.

[17]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[18]  John P. Overington,et al.  How many drug targets are there? , 2006, Nature Reviews Drug Discovery.

[19]  Adam D. Schuyler,et al.  SciMiner: web-based literature mining tool for target identification and functional enrichment analysis , 2009, Bioinform..

[20]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[21]  Yang Zhang,et al.  GPCRRD: G protein-coupled receptor spatial restraint database for 3D structure modeling and function annotation , 2010, Bioinform..

[22]  Gerhard Hessler,et al.  Drug Design Strategies for Targeting G‐Protein‐Coupled Receptors , 2002, Chembiochem : a European journal of chemical biology.

[23]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[24]  Xin Wen,et al.  BindingDB: a web-accessible database of experimentally determined protein–ligand binding affinities , 2006, Nucleic Acids Res..

[25]  Claudio N. Cavasotto,et al.  Ligand and Decoy Sets for Docking to G Protein-Coupled Receptors , 2012, J. Chem. Inf. Model..

[26]  George Khelashvili,et al.  GPCR-OKB: the G Protein Coupled Receptor Oligomer Knowledge Base , 2010, Bioinform..

[27]  Peter Ertl,et al.  JSME: a free molecule editor in JavaScript , 2013, Journal of Cheminformatics.

[28]  Feng Xu,et al.  Therapeutic target database update 2014: a resource for targeted therapeutics , 2013, Nucleic Acids Res..

[29]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. , 2001, Advanced drug delivery reviews.

[30]  Elspeth A. Bruford,et al.  Genenames.org: the HGNC resources in 2015 , 2014, Nucleic Acids Res..

[31]  Ulf Leser,et al.  ChemSpot: a hybrid system for chemical named entity recognition , 2012, Bioinform..

[32]  Joanna L. Sharman,et al.  IUPHAR-DB: new receptors and tools for easy searching and visualization of pharmacological data , 2010, Nucleic Acids Res..