Using Drug Similarities for Discovery of Possible Adverse Reactions

We propose a new computational method for discovery of possible adverse drug reactions. The method consists of two key steps. First we use openly available resources to semi-automatically compile a consolidated data set describing drugs and their features (e.g., chemical structure, related targets, indications or known adverse reaction). The data set is represented as a graph, which allows for definition of graph-based similarity metrics. The metrics can then be used for propagating known adverse reactions between similar drugs, which leads to weighted (i.e., ranked) predictions of previously unknown links between drugs and their possible side effects. We implemented the proposed method in the form of a software prototype and evaluated our approach by discarding known drug-side effect links from our data and checking whether our prototype is able to re-discover them. As this is an evaluation methodology used by several recent state of the art approaches, we could compare our results with them. Our approach scored best in all widely used metrics like precision, recall or the ratio of relevant predictions present among the top ranked results. The improvement was as much as 125.79% over the next best approach. For instance, the F1 score was 0.5606 (66.35% better than the next best method). Most importantly, in 95.32% of cases, the top five results contain at least one, but typically three correctly predicted side effect (36.05% better than the second best approach).

[1]  C. Tappert,et al.  A Survey of Binary Similarity and Distance Measures , 2010 .

[2]  Yoshihiro Yamanishi,et al.  Drug Side-Effect Prediction Based on the Integration of Chemical and Biological Spaces , 2012, J. Chem. Inf. Model..

[3]  Feng Liu,et al.  Predicting drug side effects by multi-label learning and ensemble learning , 2015, BMC Bioinformatics.

[4]  I. Edwards,et al.  Adverse drug reactions: definitions, diagnosis, and management , 2000, The Lancet.

[5]  Sebastian Schneeweiss,et al.  Admissions caused by adverse drug events to internal medicine and emergency departments in hospitals: a longitudinal population-based study , 2002, European Journal of Clinical Pharmacology.

[6]  Carol Friedman,et al.  Combing signals from spontaneous reports and electronic health records for detection of adverse drug reactions , 2013, J. Am. Medical Informatics Assoc..

[7]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[8]  Tie-Yan Liu,et al.  Learning to Rank for Information Retrieval , 2011 .

[9]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[10]  Hinrich Schütze,et al.  Introduction to Information Retrieval: Evaluation in information retrieval , 2008 .

[11]  Evgeniy Gabrilovich,et al.  A Review of Relational Machine Learning for Knowledge Graphs , 2015, Proceedings of the IEEE.

[12]  Peer Bork,et al.  The SIDER database of drugs and side effects , 2015, Nucleic Acids Res..

[13]  Zoubin Ghahramani,et al.  Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[14]  Yoshihiro Yamanishi,et al.  Predicting drug side-effect profiles: a chemical fragment-based approach , 2011, BMC Bioinformatics.

[15]  J. Skolnick,et al.  Comprehensive prediction of drug-protein interactions and side effects for the human proteome , 2015, Scientific Reports.

[16]  Cécile Paris,et al.  Text and Data Mining Techniques in Adverse Drug Reaction Detection , 2015, ACM Comput. Surv..

[17]  Roded Sharan,et al.  An Algorithmic Framework for Predicting Side-Effects of Drugs , 2010, RECOMB.

[18]  Nicole Tourigny,et al.  Bio2RDF: Towards a mashup to build bioinformatics knowledge systems , 2008, J. Biomed. Informatics.

[19]  Min-Ling Zhang,et al.  A Review on Multi-Label Learning Algorithms , 2014, IEEE Transactions on Knowledge and Data Engineering.

[20]  D. Bates,et al.  Incidence and preventability of adverse drug events among older persons in the ambulatory setting. , 2003, JAMA.