Chemotext: A Publicly Available Web Server for Mining Drug-Target-Disease Relationships in PubMed

Elucidation of the mechanistic relationships between drugs, their targets, and diseases is at the core of modern drug discovery research. Thousands of studies relevant to the drug-target-disease (DTD) triangle have been published and annotated in the Medline/PubMed database. Mining this database affords rapid identification of all published studies that confirm connections between vertices of this triangle or enable new inferences of such connections. To this end, we describe the development of Chemotext, a publicly available Web server that mines the entire compendium of published literature in PubMed annotated by Medline Subject Heading (MeSH) terms. The goal of Chemotext is to identify all known DTD relationships and infer missing links between vertices of the DTD triangle. As a proof-of-concept, we show that Chemotext could be instrumental in generating new drug repurposing hypotheses or annotating clinical outcomes pathways for known drugs. The Chemotext Web server is freely available at http://chemotext.mml.unc.edu .

[1]  Zhiyong Lu,et al.  PubTator: a web-based text mining tool for assisting biocuration , 2013, Nucleic Acids Res..

[2]  J. Fletcher,et al.  KIT oncogenic signaling mechanisms in imatinib-resistant gastrointestinal stromal tumor: PI3-kinase/AKT is a crucial survival pathway , 2007, Oncogene.

[3]  A. Marchevsky,et al.  Presence of c-KIT-positive mast cells in obliterative bronchiolitis from diverse causes. , 2009, Archives of pathology & laboratory medicine.

[4]  Nicola Nosengo Can you teach old drugs new tricks? , 2016, Nature.

[5]  Gang Fu,et al.  PubChem Substance and Compound databases , 2015, Nucleic Acids Res..

[6]  Bradley M. Hemminger,et al.  Mining connections between chemicals, proteins, and diseases extracted from Medline annotations , 2010, J. Biomed. Informatics.

[7]  R. J. Roberts PubMed Central: The GenBank of the published literature. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Yanli Wang,et al.  PubChem BioAssay: 2014 update , 2013, Nucleic Acids Res..

[9]  D. Tang,et al.  Role of Abl in airway hyperresponsiveness and airway remodeling , 2013, Respiratory Research.

[10]  D. Swanson Fish Oil, Raynaud's Syndrome, and Undiscovered Public Knowledge , 2015, Perspectives in biology and medicine.

[11]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[12]  L. Reber,et al.  Stem cell factor and its receptor c-Kit as targets for inflammatory diseases. , 2006, European journal of pharmacology.

[13]  Alexander Tropsha,et al.  Expanding the scope of drug repurposing in pediatrics: the Children's Pharmacy Collaborative. , 2014, Drug discovery today.

[14]  A. D. Van den Abbeele,et al.  Kinase mutations and imatinib response in patients with metastatic gastrointestinal stromal tumor. , 2003, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[15]  Alexander Tropsha,et al.  QSAR models of human data can enrich or replace LLNA testing for human skin sensitization. , 2016, Green chemistry : an international journal and green chemistry resource : GC.

[16]  Nancy C Baker,et al.  Drug Side Effect Profiles as Molecular Descriptors for Predictive Modeling of Target Bioactivity , 2015, Molecular informatics.

[17]  Alexander Tropsha,et al.  Chembench: A Publicly Accessible, Integrated Cheminformatics Portal , 2017, J. Chem. Inf. Model..

[18]  Stephen Frye,et al.  US academic drug discovery , 2011, Nature Reviews Drug Discovery.

[19]  Jyoti Rani,et al.  pubmed.mineR: An R package with text-mining algorithms to analyse PubMed abstracts , 2015, Journal of Biosciences.

[20]  Jing Zhou,et al.  MeSHSim: An R/Bioconductor package for measuring semantic similarity over MeSH headings and MEDLINE documents , 2015, 2015 34th Chinese Control Conference (CCC).

[21]  D. Swanson Migraine and Magnesium: Eleven Neglected Connections , 2015, Perspectives in biology and medicine.