Automatically classifying the evidence type of drug-drug interaction research papers as a step toward computer supported evidence curation

A longstanding issue with knowledge bases that discuss drug-drug interactions (DDIs) is that they are inconsistent with one another. Computerized support might help experts be more objective in assessing DDI evidence. A requirement for such systems is accurate automatic classification of evidence types. In this pilot study, we developed a hierarchical classifier to classify clinical DDI studies into formally defined evidence types. The area under the ROC curve for sub-classifiers in the ensemble ranged from 0.78 to 0.87. The entire system achieved an F1 of 0.83 and 0.63 on two held-out datasets, the latter consisting focused on completely novel drugs from what the system was trained on. The results suggest that it is feasible to accurately automate the classification of a sub-set of DDI evidence types and that the hierarchical approach shows promise. Future work will test more advanced feature engineering techniques while expanding the system to classify a more complex set of evidence types.

[1]  Harry Hochheiser,et al.  Information needs for making clinical recommendations about potential drug-drug interactions: a synthesis of literature review and interviews , 2017, BMC Medical Informatics and Decision Making.

[2]  Jodi Schneider,et al.  Towards a foundational representation of potential drug-drug interaction knowledge. , 2014, CEUR workshop proceedings.

[3]  Ira J. Kalet,et al.  Modeling Drug Mechanism Knowledge Using Evidence and Truth Maintenance , 2007, IEEE Transactions on Information Technology in Biomedicine.

[4]  Ira J. Kalet,et al.  Computing with evidence: Part I: A drug-mechanism evidence taxonomy oriented toward confidence assignment , 2009, J. Biomed. Informatics.

[5]  Harry Hochheiser,et al.  Testing the face validity and inter-rater agreement of a simple approach to drug-drug interaction evidence assessment , 2020, J. Biomed. Informatics.

[6]  Olivier Bodenreider,et al.  Comparison of three commercial knowledge bases for detection of drug-drug interactions in clinical decision support , 2017, J. Am. Medical Informatics Assoc..

[7]  W. Raub From the National Institutes of Health. , 1990, JAMA.

[8]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[9]  Stan Matwin,et al.  Functional Annotation of Genes Using Hierarchical Text Categorization , 2005 .

[10]  Catia Marzolini,et al.  Development of an evidence evaluation and synthesis system for drug-drug interactions, and its application to a systematic review of HIV and malaria co-infection , 2017, PloS one.

[11]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[12]  Dean F Sittig,et al.  Implementation pearls from a new guidebook on improving medication use and outcomes with clinical decision support. Effective CDS is essential for addressing healthcare performance improvement imperatives. , 2009, Journal of healthcare information management : JHIM.

[13]  Marc Berg,et al.  Research Paper: Turning Off Frequently Overridden Drug Alerts: Limited Opportunities for Doing It Safely , 2008, J. Am. Medical Informatics Assoc..

[14]  Mathias Brochhausen,et al.  Formalizing Knowledge and Evidence about Potential Drug-drug Interactions , 2015, BDM2I@ISWC.

[15]  Kush R. Varshney,et al.  A Natural Language Processing System for Extracting Evidence of Drug Repurposing from Scientific Publications , 2020, AAAI.

[16]  Michael A. Wittie,et al.  Consensus Recommendations for Systematic Evaluation of Drug–Drug Interaction Evidence for Clinical Decision Support , 2015, Drug Safety.

[17]  Tuomas Korhonen,et al.  SFINX—a drug-drug interaction database designed for clinical decision support systems , 2009, European Journal of Clinical Pharmacology.