Measure clinical drug-drug similarity using Electronic Medical Records

OBJECTIVE Quantitative measurement of clinical drug-drug similarity has many potential applications in assessing medication therapy similarity and patient similarity. Currently, most of the methods to measure drug-drug similarity were not directly obtained from clinical data and cannot cover clinical drugs. We sought to propose a computational approach to measure clinical drug-drug similarity based on the Electronic Medical Record (EMR) system. MATERIALS AND METHODS We used the Bonferroni-corrected hypergeometric P value to generate statistically significant associations between drugs and diagnoses in an EMR dataset which contained 812 554 medication records and 339 269 discharge diagnosis codes. Then the Jaccard similarity coefficient was used to measure the distances between drugs. A k-means based bootstrapping method was proposed to generate drug clusters. RESULTS The similarity matrix contains total 1210 clinical drugs used in the hospital was calculated. The clinical drug-drug similarity shows significant correlation with the chemical similarity of drugs and literature-based drug-drug similarity but with unique features. Based on this drug-drug similarity, 36 clinical drug clusters most of which were related to specific clinical conditions were generated. Detail of this drug clusters available at http://kb4md.org:4000/drugcluster. DISCUSSION This method provided a whole new view of the relationship among clinical drugs. Furthermore, it has the potential to evaluate the effectiveness of drug knowledge translation and provide quantitative knowledge resources for many applications such as treatment comparisons and patient similarity. CONCLUSION We proposed a clinical drug-drug similarity measurement that generated from clinical practice data and covers all clinical drugs.

[1]  Yi Pan,et al.  Drug repositioning based on comprehensive similarity measures and Bi-Random walk algorithm , 2016, Bioinform..

[2]  Hong Sun,et al.  Semantic processing of EHR data for clinical research , 2015, J. Biomed. Informatics.

[3]  David Sánchez,et al.  A framework for unifying ontology-based semantic similarity measures: A study in the biomedical domain , 2014, J. Biomed. Informatics.

[4]  Yoshihiro Yamanishi,et al.  Target-Based Drug Repositioning Using Large-Scale Chemical-Protein Interactome Data , 2015, J. Chem. Inf. Model..

[5]  M. Brusco,et al.  Choosing the number of clusters in Κ-means clustering. , 2011, Psychological methods.

[6]  Ping Zhang,et al.  Towards Drug Repositioning: A Unified Computational Framework for Integrating Multiple Aspects of Drug Similarity and Disease Similarity , 2014, AMIA.

[7]  Chirag J. Patel,et al.  MeSHDD: Literature-based drug-drug similarity for drug repositioning , 2016, J. Am. Medical Informatics Assoc..

[8]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[9]  Károly Héberger,et al.  Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? , 2015, Journal of Cheminformatics.

[10]  Lise Getoor,et al.  A probabilistic approach for collective similarity-based drug-drug interaction prediction , 2016, Bioinform..

[11]  David Sánchez,et al.  Ontology-based information content computation , 2011, Knowl. Based Syst..

[12]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[13]  Xinmin Liu,et al.  The Classification and Application of Toxic Chinese Materia Medica , 2014, Phytotherapy research : PTR.

[14]  Richard A Armstrong,et al.  When to use the Bonferroni correction , 2014, Ophthalmic & physiological optics : the journal of the British College of Ophthalmic Opticians.

[15]  Reza Safdari,et al.  Computational prediction of drug-drug interactions based on drugs functional similarities , 2017, J. Biomed. Informatics.

[16]  P. Bork,et al.  Drug Target Identification Using Side-Effect Similarity , 2008, Science.

[17]  Michael J. Keiser,et al.  Large Scale Prediction and Testing of Drug Activity on Side-Effect Targets , 2012, Nature.

[18]  Christian Hennig,et al.  Cluster-wise assessment of cluster stability , 2007, Comput. Stat. Data Anal..

[19]  Ross Ihaka,et al.  Gentleman R: R: A language for data analysis and graphics , 1996 .

[20]  Gwan-Su Yi,et al.  Drug Similarity Search Based on Combined Signatures in Gene Expression Profiles , 2014, Healthcare informatics research.

[21]  R. Altman,et al.  Data-Driven Prediction of Drug Effects and Interactions , 2012, Science Translational Medicine.

[22]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[23]  Sherry‐Ann Brown Patient Similarity: Emerging Concepts in Systems and Precision Medicine , 2016, Front. Physiol..