CSgator: an integrated web platform for compound set analysis

Drug discovery typically involves investigation of a set of compounds (e.g. drug screening hits) in terms of target, disease, and bioactivity. CSgator is a comprehensive analytic tool for set-wise interpretation of compounds. It has two unique analytic features of Compound Set Enrichment Analysis (CSEA) and Compound Cluster Analysis (CCA), which allows batch analysis of compound set in terms of (i) target, (ii) bioactivity, (iii) disease, and (iv) structure. CSEA and CCA present enriched profiles of targets and bioactivities in a compound set, which leads to novel insights on underlying drug mode-of-action, and potential targets. Notably, we propose a novel concept of ‘Hit Enriched Assays”, i.e. bioassays of which hits are enriched among a given set of compounds. As an example, we show its utility in revealing drug mode-of-action or identifying hidden targets for anti-lymphangiogenesis screening hits. CSgator is available at http://csgator.ewha.ac.kr, and most analytic results are downloadable.

[1]  T. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2006, Nucleic Acids Res..

[2]  Xin Chen,et al.  DCDB 2.0: a major update of the drug combination database , 2014, Database J. Biol. Databases Curation.

[3]  Yang Zhang,et al.  GLASS: a comprehensive database for experimentally validated GPCR-ligand associations , 2015, Bioinform..

[4]  Margaret Miller MEDLINE: A Guide to Effective Searching in PubMed and Other Interfaces, 2nd Edition , 2007 .

[5]  Minoru Kanehisa,et al.  KEGG: new perspectives on genomes, pathways, diseases and drugs , 2016, Nucleic Acids Res..

[6]  Jae U. Jung,et al.  Lymphatic reprogramming by Kaposi sarcoma herpes virus promotes the oncogenic activity of the virus-encoded G-protein-coupled receptor. , 2012, Cancer research.

[7]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[8]  Gang Feng,et al.  Disease Ontology: a backbone for disease semantic integration , 2011, Nucleic Acids Res..

[9]  X. Chen,et al.  TTD: Therapeutic Target Database , 2002, Nucleic Acids Res..

[10]  A. Ruddell,et al.  The Lymphatic Endothelial mCLCA1 Antibody Induces Proliferation and Growth of Lymph Node Lymphatic Sinuses , 2016, PloS one.

[11]  Michael K. Gilson,et al.  BindingDB in 2015: A public database for medicinal chemistry, computational chemistry and systems pharmacology , 2015, Nucleic Acids Res..

[12]  D. Bojanic,et al.  Impact of high-throughput screening in biomedical research , 2011, Nature Reviews Drug Discovery.

[13]  B. Roth,et al.  The Multiplicity of Serotonin Receptors: Uselessly Diverse Molecules or an Embarrassment of Riches? , 2000 .

[14]  D. Carr,et al.  VEGF-A expression by HSV-1–infected cells drives corneal lymphangiogenesis , 2009, The Journal of experimental medicine.

[15]  Tara N Moore-Medlin,et al.  Anti-lymphangiogenic properties of mTOR inhibitors in head and neck squamous cell carcinoma experimental models , 2013, BMC Cancer.

[16]  Gisbert Schneider,et al.  Phenotype-based high-content chemical library screening identifies statins as inhibitors of in vivo lymphangiogenesis , 2012, Proceedings of the National Academy of Sciences.

[17]  F. She,et al.  TNF-alpha promotes lymphangiogenesis and lymphatic metastasis of gallbladder cancer through the ERK1/2/AP-1/VEGF-D pathway , 2016, BMC Cancer.

[18]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[19]  Juancarlos Chan,et al.  Gene Ontology Consortium: going forward , 2014, Nucleic Acids Res..

[20]  Yanli Wang,et al.  PubChem BioAssay: 2014 update , 2013, Nucleic Acids Res..

[21]  Y. Martin,et al.  Do structurally similar molecules have similar biological activity? , 2002, Journal of medicinal chemistry.

[22]  Richard D. Smith,et al.  Binding MOAD, a high-quality protein–ligand database , 2007, Nucleic Acids Res..

[23]  Michele Magrane,et al.  UniProt Knowledgebase: a hub of integrated protein data , 2011, Database J. Biol. Databases Curation.

[24]  J. Tille,et al.  Lymphangiogenesis and tumor metastasis , 2003, Thrombosis and Haemostasis.

[25]  Jordi Mestres,et al.  Polypharmacology in Precision Oncology: Current Applications and Future 
Prospects , 2016, Current pharmaceutical design.

[26]  Peter Ertl,et al.  Compound Set Enrichment: A Novel Approach to Analysis of Primary HTS Data , 2010, J. Chem. Inf. Model..

[27]  Thomas Girke,et al.  bioassayR: Cross-Target Analysis of Small Molecule Bioactivity , 2016, J. Chem. Inf. Model..

[28]  Deng Pan,et al.  DGIdb 2.0: mining clinically relevant drug–gene interactions , 2015, Nucleic Acids Res..

[29]  Helge G. Roider,et al.  Drug2Gene: an exhaustive resource to explore effectively the drug-target relation network , 2013, BMC Bioinformatics.

[30]  G. V. Paolini,et al.  Quantifying the chemical beauty of drugs. , 2012, Nature chemistry.

[31]  Anne Mai Wassermann,et al.  Public Domain HTS Fingerprints: Design and Evaluation of Compound Bioactivity Profiles from PubChem's Bioassay Repository , 2016, J. Chem. Inf. Model..

[32]  Ryan G. Coleman,et al.  ZINC: A Free Tool to Discover Chemistry for Biology , 2012, J. Chem. Inf. Model..

[33]  S. Stacker,et al.  From anti-angiogenesis to anti-lymphangiogenesis: emerging trends in cancer therapy. , 2008, Lymphatic research and biology.

[34]  Damian Szklarczyk,et al.  STITCH 5: augmenting protein–chemical interaction networks with tissue and affinity data , 2015, Nucleic Acids Res..

[35]  George Papadatos,et al.  The ChEMBL bioactivity database: an update , 2013, Nucleic Acids Res..

[36]  S. Schoppmann,et al.  Thrombocytes Correlate with Lymphangiogenesis in Human Esophageal Cancer and Mediate Growth of Lymphatic Endothelial Cells In Vitro , 2013, PloS one.

[37]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database: update 2017 , 2016, Nucleic Acids Res..

[38]  M. Kahn,et al.  Platelets: covert regulators of lymphatic development. , 2010, Arteriosclerosis, thrombosis, and vascular biology.

[39]  Michael Darsow,et al.  ChEBI: a database and ontology for chemical entities of biological interest , 2007, Nucleic Acids Res..

[40]  Robert B. Russell,et al.  SuperTarget and Matador: resources for exploring drug-target relationships , 2007, Nucleic Acids Res..

[41]  Xiaodong Zhou,et al.  Lymphangiogenesis in Gastric Cancer regulated through Akt/mTOR-VEGF-C/VEGF-D axis , 2015, BMC Cancer.

[42]  Russ B Altman,et al.  PharmGKB: the Pharmacogenomics Knowledge Base. , 2013, Methods in molecular biology.

[43]  Yanli Wang,et al.  Identifying Compound-Target Associations by Combining Bioactivity Profile Similarity Search and Public Databases Mining , 2011, J. Chem. Inf. Model..

[44]  Gang Fu,et al.  PubChem Substance and Compound databases , 2015, Nucleic Acids Res..

[45]  Pablo Tamayo,et al.  Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[46]  M. Skobe,et al.  Lymphangiogenesis and tumor metastasis , 2003, Cell and Tissue Research.

[47]  Joanna L. Sharman,et al.  IUPHAR-DB: updated database content and new features , 2012, Nucleic Acids Res..

[48]  Roberto Sessa,et al.  Lymphangiogenesis: a new player in herpes simplex virus 1‐triggered T‐cell response , 2017, Immunology and cell biology.

[49]  W. Schiemann,et al.  Identification and characterization of regulator of G protein signaling 4 (RGS4) as a novel inhibitor of tubulogenesis: RGS4 inhibits mitogen-activated protein kinases and vascular endothelial growth factor signaling. , 2004, Molecular biology of the cell.

[50]  Xiujin Li,et al.  Vascular endothelial growth factor-D promotes growth, lymphangiogenesis and lymphatic metastasis in gallbladder cancer. , 2012, Cancer letters.