Leveraging syntactic and semantic graph kernels to extract pharmacokinetic drug drug interactions from biomedical literature

BackgroundInformation about drug–drug interactions (DDIs) supported by scientific evidence is crucial for establishing computational knowledge bases for applications like pharmacovigilance. Since new reports of DDIs are rapidly accumulating in the scientific literature, text-mining techniques for automatic DDI extraction are critical. We propose a novel approach for automated pharmacokinetic (PK) DDI detection that incorporates syntactic and semantic information into graph kernels, to address the problem of sparseness associated with syntactic-structural approaches. First, we used a novel all-path graph kernel using shallow semantic representation of sentences. Next, we statistically integrated fine-granular semantic classes into the dependency and shallow semantic graphs.ResultsWhen evaluated on the PK DDI corpus, our approach significantly outperformed the original all-path graph kernel that is based on dependency structure. Our system that combined dependency graph kernel with semantic classes achieved the best F-scores of 81.94 % for in vivo PK DDIs and 69.34 % for in vitro PK DDIs, respectively. Further, combining shallow semantic graph kernel with semantic classes achieved the highest precisions of 84.88 % for in vivo PK DDIs and 74.83 % for in vitro PK DDIs, respectively.ConclusionsWe presented a graph kernel based approach to combine syntactic and semantic information for extracting pharmacokinetic DDIs from Biomedical Literature. Experimental results showed that our proposed approach could extract PK DDIs from literature effectively, which significantly enhanced the performance of the original all-path graph kernel based on dependency structure.

[1]  Alessandro Moschitti,et al.  Making Tree Kernels Practical for Natural Language Learning , 2006, EACL.

[2]  Alberto Lavelli,et al.  Exploiting the Scope of Negations and Heterogeneous Features for Relation Extraction: A Case Study for Drug-Drug Interaction Extraction , 2013, HLT-NAACL.

[3]  Emily R. Hajjar,et al.  Polypharmacy in elderly patients. , 2007, The American journal of geriatric pharmacotherapy.

[4]  Chitta Baral,et al.  Discovering drug–drug interactions: a text-mining and reasoning approach based on properties of drug metabolism , 2010, Bioinform..

[5]  R. Altman,et al.  Informatics confronts drug-drug interactions. , 2013, Trends in pharmacological sciences.

[6]  R. Niska,et al.  National Hospital Ambulatory Medical Care Survey: 2007 emergency department summary. , 2010, National health statistics reports.

[7]  Lang Li,et al.  Extraction of drug-drug interactions using all paths graph kernel , 2011 .

[8]  Carol Friedman,et al.  Research Paper: A General Natural-language Text Processor for Clinical Radiology , 1994, J. Am. Medical Informatics Assoc..

[9]  Herrero-ZazoMaría,et al.  The DDI corpus , 2013 .

[10]  Daniel Gildea,et al.  The Proposition Bank: An Annotated Corpus of Semantic Roles , 2005, CL.

[11]  Jun'ichi Tsujii,et al.  Feature Forest Models for Probabilistic HPSG Parsing , 2008, CL.

[12]  K. Bretonnel Cohen,et al.  UColorado_SOM: Extraction of Drug-Drug Interactions from Biomedical Text using Knowledge-rich and Knowledge-poor Features , 2013, SemEval@NAACL-HLT.

[13]  César de Pablo-Sánchez,et al.  Using a shallow linguistic kernel for drug-drug interaction extraction , 2011, J. Biomed. Informatics.

[14]  Isabelle Ragueneau-Majlessi,et al.  A useful tool for drug interaction evaluation: The University of Washington Metabolism and Transport Drug Interaction Database , 2010, Human Genomics.

[15]  Paloma Martínez,et al.  The DDI corpus: An annotated corpus with pharmacological substances and drug-drug interactions , 2013, J. Biomed. Informatics.

[16]  K. Bretonnel Cohen,et al.  Recognizing Sublanguages in Scientific Journal Articles through Closure Properties , 2013, BioNLP@ACL.

[17]  Stephan Oepen,et al.  SemEval 2014 Task 8: Broad-Coverage Semantic Dependency Parsing , 2014, *SEMEVAL.

[18]  Christopher D. Manning,et al.  Generating Typed Dependency Parses from Phrase Structure Parses , 2006, LREC.

[19]  Lisa E Hines,et al.  Recommendations for Generating, Evaluating, and Implementing Drug‐Drug Interaction Evidence , 2012, Pharmacotherapy.

[20]  Amy J Grizzle,et al.  Concordance of severity ratings provided in four drug interaction compendia. , 2004, Journal of the American Pharmacists Association : JAPhA.

[21]  J. Lightwood,et al.  Black Box Warning Contraindicated Comedications: Concordance Among Three Major Drug Interaction Screening Programs , 2010, The Annals of pharmacotherapy.

[22]  Sadid A. Hasan,et al.  Using Syntactic and Shallow Semantic Kernels to Improve Multi-Modality Manifold-Ranking for Topic-Focused Multi-Document Summarization , 2011, IJCNLP.

[23]  Hans C. Boas,et al.  Sign-Based Construction Grammar , 2012 .

[24]  C. Pollard,et al.  Center for the Study of Language and Information , 2022 .

[25]  W. Stigelman,et al.  Goodman and Gilman's the Pharmacological Basis of Therapeutics , 1986 .

[26]  Ira J. Kalet,et al.  Computing with evidence: Part II: An evidential approach to predicting metabolic drug-drug interactions , 2009, J. Biomed. Informatics.

[27]  Shiew-Mei Huang,et al.  Drug interactions evaluation: an integrated part of risk assessment of therapeutics. , 2010, Toxicology and applied pharmacology.

[28]  Paloma Martínez,et al.  SemEval-2013 Task 9 : Extraction of Drug-Drug Interactions from Biomedical Texts (DDIExtraction 2013) , 2013, *SEMEVAL.

[29]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[30]  Z. Harris A Theory of Language and Information: A Mathematical Approach , 1991 .

[31]  Xu Han,et al.  An integrated pharmacokinetics ontology and corpus for text mining , 2013, BMC Bioinformatics.

[32]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[33]  L. Goodman,et al.  The Pharmacological Basis of Therapeutics , 1941 .

[34]  Halil Kilicoglu,et al.  SemMedDB: a PubMed-scale repository of biomedical semantic predications , 2012, Bioinform..

[35]  Isabel Segura-Bedmar,et al.  The 1st DDIExtraction-2011 challenge task: Extraction of Drug-Drug Interactions from biomedical texts , 2011 .

[36]  Shiew-Mei Huang,et al.  Predicting Drug–Drug Interactions: An FDA Perspective , 2009, The AAPS Journal.

[37]  Anna Maria Di Sciullo,et al.  Natural Language Understanding , 2009, SoMeT.

[38]  Razvan C. Bunescu,et al.  A Shortest Path Dependency Kernel for Relation Extraction , 2005, HLT.

[39]  B. Stricker,et al.  Hospitalisations and emergency department visits due to drug–drug interactions: a literature review , 2007, Pharmacoepidemiology and drug safety.

[40]  Richard D. Boyce,et al.  Using natural language processing to identify pharmacokinetic drug-drug interactions described in drug package inserts , 2012 .

[41]  Peter M. A. Sloot,et al.  A novel feature-based approach to extract drug-drug interactions from biomedical text , 2014, Bioinform..

[42]  Supinya Dechanont,et al.  Hospital admissions/visits associated with drug–drug interactions: a systematic review and meta‐analysis , 2014, Pharmacoepidemiology and drug safety.

[43]  Roberto Basili,et al.  Exploiting Syntactic and Shallow Semantic Kernels for Question Answer Classification , 2007, ACL.

[44]  Paloma Martínez,et al.  Lessons learnt from the DDIExtraction-2013 Shared Task , 2014, J. Biomed. Informatics.

[45]  I. Edwards,et al.  Adverse drug reactions: definitions, diagnosis, and management , 2000, The Lancet.

[46]  Alberto Lavelli,et al.  FBK-irst : A Multi-Phase Kernel Based Approach for Drug-Drug Interaction Detection and Classification that Exploits Linguistic Information , 2013, *SEMEVAL.

[47]  Hongfei Lin,et al.  Extracting Drug-Drug Interaction from the Biomedical Literature Using a Stacked Generalization-Based Approach , 2013, PloS one.

[48]  Stephan Oepen,et al.  Broad-Coverage Semantic Dependency Parsing , 2014 .

[49]  Yoshimasa Tsuruoka,et al.  Extraction from Biomedical Literature Using Predicate-Argument Structure Patterns , 2013 .

[50]  James F. Allen Natural language understanding (2nd ed.) , 1995 .

[51]  Thomas C. Wiegers,et al.  A CTD–Pfizer collaboration: manual curation of 88 000 scientific articles text mined for drug–disease and drug–phenotype interactions , 2013, Database J. Biol. Databases Curation.

[52]  M. J. Hall,et al.  National Hospital Discharge Survey: 2007 summary. , 2010, National health statistics reports.

[53]  Carol Friedman,et al.  Two biomedical sublanguages: a description based on the theories of Zellig Harris , 2002, J. Biomed. Informatics.

[54]  Lisa E. Hines,et al.  Ability of pharmacy clinical decision-support software to alert users about clinically important drug-drug interactions , 2011, J. Am. Medical Informatics Assoc..

[55]  Jun'ichi Tsujii,et al.  Towards efficient probabilistic HPSG parsing: integrating semantic and syntactic preference to gu , 2004 .