Prioritizing Adverse Drug Reaction and Drug Repositioning Candidates Generated by Literature-Based Discovery

Literature based discovery (LBD) is a well-known paradigm to discover hidden knowledge in scientific literature. By identifying and utilizing reported findings in literature, LBD hypothesizes novel discoveries. Most often, LBD systems generate a long list of potential discoveries and it would be time consuming and expensive to validate all of those discoveries. Preliminary validation or prioritization of the discoveries can improve the significance of LBD systems. In this study, we proposed a method utilizing information surrounding causal findings to prioritize discoveries generated by LBD systems. As a case study, we focused on discovering drug-disease relations, which have potential to identify drug repositioning candidates or adverse drug reactions. Our LBD system used drug-gene and gene-disease semantic predication in SemMedDB as causal findings and Swanson's ABC model to generate potential drug-disease relations. Using sentences, which causal findings extracted from, our ranking method trained a binary classifier to classify generated drug-disease relations into desired classes. We trained and tested our classifier for three different purposes: a) drug repositioning b) adverse drug events c) drug-disease relation detection. The classifier obtained 0.78, 0.86, and 0.83 f-measure respectively for these tasks. The number of causal findings of each hypothesis, which were classified as positive by the classifier, is the main metric for ranking the hypotheses in the proposed method. To evaluate the ranking method, we counted and compared the number of true relations in the top 100 pairs, which were ranked by our method and one of previous methods. Out of 181 true relations in the test dataset, the proposed method ranked 20 of them in top 100 relations while this number was 13 for the other method.

[1]  William M. Pottenger,et al.  Recent Advances in Literature Based Discovery , 2005 .

[2]  Vassilis Virvilis,et al.  Literature mining, ontologies and information visualization for drug repurposing , 2011, Briefings Bioinform..

[3]  James L. Gilbert,et al.  Rebuilding Big Pharma ’ s Business Model , 2022 .

[4]  D. Swanson Fish Oil, Raynaud's Syndrome, and Undiscovered Public Knowledge , 2015, Perspectives in biology and medicine.

[5]  Halil Kilicoglu,et al.  Using the Literature-Based Discovery Paradigm to Investigate Drug Mechanisms , 2007, AMIA.

[6]  Xin Wu,et al.  Drug repositioning by applying 'expression profiles' generated by integrating chemical structure similarity and gene semantic similarity. , 2014, Molecular bioSystems.

[7]  Thomas C. Wiegers,et al.  The Comparative Toxicogenomics Database's 10th year anniversary: update 2015 , 2014, Nucleic Acids Res..

[8]  Murat Dogru,et al.  Ocular Surface and MUC5AC Alterations in Atopic Patients with Corneal Shield Ulcers , 2005, Current eye research.

[9]  Saso Dzeroski,et al.  Supporting Discovery in Medicine by Association Rule Mining in Medline and UMLS , 2001, MedInfo.

[10]  Simon M Lin,et al.  Opportunities for drug repositioning from phenome-wide association studies , 2015, Nature Biotechnology.

[11]  N R Smalheiser,et al.  Using ARROWSMITH: a computer-assisted approach to formulating and assessing scientific hypotheses. , 1998, Computer methods and programs in biomedicine.

[12]  Hua Xu,et al.  Identifying Plausible Adverse Drug Reactions Using Knowledge Extracted from the Literature , 2014, AMIA.

[13]  C. Ferreira,et al.  Making PubMed searching simple: learning to retrieve medical literature through interactive problem solving. , 2006, The oncologist.

[14]  D. Swanson Migraine and Magnesium: Eleven Neglected Connections , 2015, Perspectives in biology and medicine.

[15]  Marcelo Fiszman,et al.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text , 2003, J. Biomed. Informatics.

[16]  Hongfang Liu,et al.  A new method for prioritizing drug repositioning candidates extracted by literature-based discovery , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[17]  Joyce A. Mitchell,et al.  Using literature-based discovery to identify disease candidate genes , 2005, Int. J. Medical Informatics.

[18]  Priya Nambisan,et al.  Using Social Media Data to Identify Potential Candidates for Drug Repurposing: A Feasibility Study , 2016, JMIR research protocols.

[19]  Joel Dudley,et al.  Exploiting drug-disease relationships for computational drug repositioning , 2011, Briefings Bioinform..

[20]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[21]  B. Cestnik,et al.  Predicting future discoveries from current scientific literature. , 2014, Methods in molecular biology.

[22]  Halil Kilicoglu,et al.  SemMedDB: a PubMed-scale repository of biomedical semantic predications , 2012, Bioinform..

[23]  Elizabeth Gibney How to tame the flood of literature , 2014, Nature.

[24]  Wanda Pratt,et al.  A new evaluation methodology for literature-based discovery systems , 2009, J. Biomed. Informatics.

[25]  Carol Friedman,et al.  Exploiting Semantic Relations for Literature-Based Discovery , 2006, AMIA.

[26]  Neil R. Smalheiser,et al.  Ranking indirect connections in literature-based discovery: The role of medical subject headings: Research Articles , 2006 .

[27]  Funda Meric-Bernstam,et al.  Challenges and perspective of drug repurposing strategies in early phase clinical trials , 2015, Oncoscience.

[28]  Neil R. Smalheiser,et al.  Ranking indirect connections in literature-based discovery: The role of medical subject headings , 2006, J. Assoc. Inf. Sci. Technol..

[29]  Charles C. Persinger,et al.  How to improve R&D productivity: the pharmaceutical industry's grand challenge , 2010, Nature Reviews Drug Discovery.

[30]  Philip R. O. Payne,et al.  ‘RE:fine drugs’: an interactive dashboard to access drug repurposing opportunities , 2016, Database J. Biol. Databases Curation.

[31]  Vera Lúcia Strube de Lima,et al.  A systematic review of named entity recognition in biomedical texts , 2011, Journal of the Brazilian Computer Society.

[32]  Peter Davies,et al.  Discovering discovery patterns with predication-based Semantic Indexing , 2012, J. Biomed. Informatics.

[33]  Zhiyong Lu,et al.  Pathway-based drug repositioning using causal inference , 2013, BMC Bioinformatics.

[34]  Erik M. van Mulligen,et al.  Finding potentially new multimorbidity patterns of psychiatric and somatic diseases: exploring the use of literature-based discovery in primary care research , 2014, J. Am. Medical Informatics Assoc..

[35]  Ying Li,et al.  Validating drug repurposing signals using electronic health records: a case study of metformin associated with reduced cancer mortality , 2014, J. Am. Medical Informatics Assoc..

[36]  Michael D. Gordon,et al.  Literature-Based Discovery by Lexical Statistics , 1999, J. Am. Soc. Inf. Sci..

[37]  Pankaj Agarwal,et al.  Systematic Drug Repositioning Based on Clinical Side-Effects , 2011, PloS one.

[38]  T. Rindflesch,et al.  A closed literature-based discovery technique finds a mechanistic link between hypogonadism and diminished sleep quality in aging men. , 2012, Sleep.

[39]  D. Swanson,et al.  Indomethacin and Alzheimer's disease , 1996, Neurology.

[40]  Hongfang Liu,et al.  Operationalizing Semantic Medline for meeting the information needs at point of care , 2015, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[41]  T. J. Nickola,et al.  Dexamethasone-mediated repression of MUC5AC gene expression in human lung epithelial cells. , 2006, American journal of respiratory cell and molecular biology.

[42]  Van V. Brantner,et al.  Estimating the cost of new drug development: is it really 802 million dollars? , 2006, Health affairs.

[43]  Hongfang Liu,et al.  Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature , 2015, MedInfo.

[44]  D. Swanson Somatomedin C and Arginine: Implicit Connections between Mutually Isolated Literatures , 2015, Perspectives in biology and medicine.

[45]  Wanda Pratt,et al.  Using statistical and knowledge-based approaches for literature-based discovery , 2006, J. Biomed. Informatics.

[46]  Yi Hu,et al.  Simulation of Swanson's Literature-Based Discovery: Anandamide Treatment Inhibits Growth of Gastric Cancer Cells In Vitro and In Silico , 2014, PloS one.

[47]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[48]  P. Sanseau,et al.  Computational Drug Repositioning: From Data to Therapeutics , 2013, Clinical pharmacology and therapeutics.

[49]  Amit P. Sheth,et al.  Context-Driven Automatic Subgraph Creation for Literature-Based Discovery , 2015, J. Biomed. Informatics.

[50]  L. Cardon,et al.  Use of genome-wide association studies for drug repositioning , 2012, Nature Biotechnology.

[51]  Marc Weeber,et al.  Using concepts in literature-based discovery: Simulating Swanson's Raynaud-fish oil and migraine-magnesium discoveries , 2001, J. Assoc. Inf. Sci. Technol..

[52]  Xu Han,et al.  Literature Based Drug Interaction Prediction with Clinical Assessment Using Electronic Medical Records: Novel Myopathy Associated Drug Interactions , 2012, PLoS Comput. Biol..

[53]  Marc Weeber,et al.  Using concepts in literature-based discovery: simulating Swanson's Raynaud-fish oil and migraine-magnesium discoveries , 2001 .

[54]  Marcelo Fiszman,et al.  Semantic MEDLINE for Discovery Browsing: Using Semantic Predications and the Literature-Based Discovery Paradigm to Elucidate a Mechanism for the Obesity Paradox , 2013, AMIA.

[55]  Jonathan D. Wren,et al.  Extending the mutual information measure to rank inferred literature relationships , 2004, BMC Bioinformatics.

[56]  T. Ashburn,et al.  Drug repositioning: identifying and developing new uses for existing drugs , 2004, Nature Reviews Drug Discovery.