Determining molecular predictors of adverse drug reactions with causality analysis based on structure learning

OBJECTIVE Adverse drug reaction (ADR) can have dire consequences. However, our current understanding of the causes of drug-induced toxicity is still limited. Hence it is of paramount importance to determine molecular factors of adverse drug responses so that safer therapies can be designed. METHODS We propose a causality analysis model based on structure learning (CASTLE) for identifying factors that contribute significantly to ADRs from an integration of chemical and biological properties of drugs. This study aims to address two major limitations of the existing ADR prediction studies. First, ADR prediction is mostly performed by assessing the correlations between the input features and ADRs, and the identified associations may not indicate causal relations. Second, most predictive models lack biological interpretability. RESULTS CASTLE was evaluated in terms of prediction accuracy on 12 organ-specific ADRs using 830 approved drugs. The prediction was carried out by first extracting causal features with structure learning and then applying them to a support vector machine (SVM) for classification. Through rigorous experimental analyses, we observed significant increases in both macro and micro F1 scores compared with the traditional SVM classifier, from 0.88 to 0.89 and 0.74 to 0.81, respectively. Most importantly, identified links between the biological factors and organ-specific drug toxicities were partially supported by evidence in Online Mendelian Inheritance in Man. CONCLUSIONS The proposed CASTLE model not only performed better in prediction than the baseline SVM but also produced more interpretable results (ie, biological factors responsible for ADRs), which is critical to discovering molecular activators of ADRs.

[1]  Bin Chen,et al.  Gaining Insight into Off-Target Mediated Effects of Drug Candidates with a Comprehensive Systems Chemical Biology Analysis , 2009, J. Chem. Inf. Model..

[2]  Yoshihiro Yamanishi,et al.  Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework , 2010, Bioinform..

[3]  Trevor Hastie,et al.  Regularization Paths for Generalized Linear Models via Coordinate Descent. , 2010, Journal of statistical software.

[4]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[5]  David L Veenstra,et al.  Association between CYP2C9 genetic variants and anticoagulation-related outcomes during warfarin therapy. , 2002, JAMA.

[6]  J. Johnson,et al.  Pharmacogenomics: the inherited basis for interindividual differences in drug response. , 2001, Annual review of genomics and human genetics.

[7]  Magnus Ingelman-Sundberg,et al.  Pharmacogenomic biomarkers for prediction of severe adverse drug reactions. , 2008, The New England journal of medicine.

[8]  Ruichu Cai,et al.  Causal gene identification using combinatorial V-structure search , 2013, Neural Networks.

[9]  Judea Pearl,et al.  A Theory of Inferred Causation , 1991, KR.

[10]  Z. Birnbaum,et al.  One-Sided Confidence Contours for Probability Distribution Functions , 1951 .

[11]  M Pirmohamed,et al.  Genetic susceptibility to adverse drug reactions. , 2001, Trends in pharmacological sciences.

[12]  R. N. Macsween,et al.  Co-amoxiclav jaundice: clinical and histological features and HLA class II association , 2000, Gut.

[13]  A. Chiang,et al.  Data‐Driven Methods to Discover Molecular Determinants of Serious Adverse Drug Events , 2009, Clinical pharmacology and therapeutics.

[14]  P. Spirtes,et al.  Causation, Prediction, and Search, 2nd Edition , 2001 .

[15]  Ruichu Cai,et al.  BASSUM: A Bayesian semi-supervised method for classification feature selection , 2011, Pattern Recognit..

[16]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[17]  E. Simpson,et al.  Aromatase deficiency confers paradoxical postischemic cardioprotection. , 2011, Endocrinology.

[18]  R. Collins,et al.  SLCO1B1 variants and statin-induced myopathy--a genomewide study. , 2008, The New England journal of medicine.

[19]  Yanli Wang,et al.  PubChem: Integrated Platform of Small Molecules and Biological Activities , 2008 .

[20]  Hua Xu,et al.  Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs , 2012, J. Am. Medical Informatics Assoc..

[21]  Carol Friedman,et al.  Enhancing Adverse Drug Event Detection in Electronic Health Records Using Molecular Structure Similarity: Application to Pancreatitis , 2012, PloS one.

[22]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[23]  Jonathan M. Teich,et al.  Identifying hospital admissions due to adverse drug events using a computer‐based monitor , 2001, AMIA.

[24]  Yong Hu,et al.  Software project risk analysis using Bayesian networks with causality constraints , 2013, Decis. Support Syst..

[25]  Gregory F. Cooper,et al.  Causal discovery from medical textual data , 2000, AMIA.

[26]  P. Bork,et al.  A side effect resource to capture phenotypic effects of drugs , 2010, Molecular systems biology.

[27]  P. Corey,et al.  Incidence of Adverse Drug Reactions in Hospitalized Patients , 2012 .

[28]  E. Brown,et al.  The Medical Dictionary for Regulatory Activities (MedDRA) , 1999, Drug safety.

[29]  P. Spirtes,et al.  Causation, prediction, and search , 1993 .

[30]  Evan Bolton,et al.  An overview of the PubChem BioAssay resource , 2009, Nucleic Acids Res..

[31]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[32]  Peter Bühlmann,et al.  Estimating High-Dimensional Directed Acyclic Graphs with the PC-Algorithm , 2007, J. Mach. Learn. Res..

[33]  U. Meyer Pharmacogenetics and adverse drug reactions , 2000, The Lancet.

[34]  C. Grohé,et al.  Expression of oestrogen receptor alpha and beta in rat heart: role of local oestrogen synthesis. , 1998, The Journal of endocrinology.

[35]  Ann K Daly Pharmacogenomics of adverse drug reactions , 2013, Genome Medicine.

[36]  A. Fliri,et al.  Analysis of drug-induced effect patterns to link structure and side effects of medicines , 2005, Nature chemical biology.

[37]  P. Clayton,et al.  Diagnosis in bile acid-CoA: amino acid N-acyltransferase deficiency. , 2012, World journal of gastroenterology.

[38]  Russ Altman,et al.  Pharmacogenomics: Challenges and Opportunities , 2006, Annals of Internal Medicine.

[39]  Carol Friedman,et al.  Facilitating adverse drug event detection in pharmacovigilance databases using molecular structure similarity: application to rhabdomyolysis , 2011, J. Am. Medical Informatics Assoc..

[40]  Philip E. Bourne,et al.  Drug Discovery Using Chemical Systems Biology: Identification of the Protein-Ligand Binding Network To Explain the Side Effects of CETP Inhibitors , 2009, PLoS Comput. Biol..

[41]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[42]  M. Rieder,et al.  A genome-wide scan for common genetic variants with a large influence on warfarin maintenance dose. , 2008, Blood.

[43]  Ben Y. Reis,et al.  Predicting Adverse Drug Events Using Pharmacological Network Models , 2011, Science Translational Medicine.

[44]  Barry C. Jones,et al.  DRUG-DRUG INTERACTIONS FOR UDP-GLUCURONOSYLTRANSFERASE SUBSTRATES: A PHARMACOKINETIC EXPLANATION FOR TYPICALLY OBSERVED LOW EXPOSURE (AUCI/AUC) RATIOS , 2004, Drug Metabolism and Disposition.

[45]  M. Milik,et al.  Mapping adverse drug reactions in chemical space. , 2009, Journal of medicinal chemistry.

[46]  P. Bork,et al.  Network Neighbors of Drug Targets Contribute to Drug Side-Effect Similarity , 2011, PloS one.

[47]  M. Schuemie,et al.  Combining electronic healthcare databases in Europe to allow for large‐scale drug safety monitoring: the EU‐ADR Project , 2011, Pharmacoepidemiology and drug safety.

[48]  A. Butte,et al.  Predicting Adverse Drug Reactions Using Publicly Available PubChem BioAssay Data , 2011, Clinical pharmacology and therapeutics.

[49]  Hisashi Kashima,et al.  Side Effect Prediction Using Cooperative Pathways , 2009, 2009 IEEE International Conference on Bioinformatics and Biomedicine.

[50]  Teresa M. Przytycka,et al.  Identifying Causal Genes and Dysregulated Pathways in Complex Diseases , 2011, PLoS Comput. Biol..

[51]  E. Simpson,et al.  Relative expression of aromatase cytochrome P450 in human fetal tissues as determined by competitive polymerase chain reaction amplification. , 1992, The Journal of clinical endocrinology and metabolism.

[52]  J. Overhage,et al.  Advancing the Science for Active Surveillance: Rationale and Design for the Observational Medical Outcomes Partnership , 2010, Annals of Internal Medicine.

[53]  Michael J. Keiser,et al.  Large Scale Prediction and Testing of Drug Activity on Side-Effect Targets , 2012, Nature.

[54]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[55]  P. Bork,et al.  Drug Target Identification Using Side-Effect Similarity , 2008, Science.

[56]  J. Cossman,et al.  Drug Development and the FDA's Critical Path Initiative , 2007, Clinical pharmacology and therapeutics.

[57]  Yoshihiro Yamanishi,et al.  Predicting drug side-effect profiles: a chemical fragment-based approach , 2011, BMC Bioinformatics.

[58]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[59]  Bin Chen,et al.  PubChem as a Source of Polypharmacology , 2009, J. Chem. Inf. Model..

[60]  Russell A. Wilke,et al.  Identifying genetic risk factors for serious adverse drug reactions: current progress and challenges , 2008, Nature Reviews Drug Discovery.

[61]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[62]  C Helma,et al.  Prediction of Adverse Drug Reactions Using Decision Tree Modeling , 2010, Clinical pharmacology and therapeutics.

[63]  Snigdhansu Chatterjee,et al.  Causality and pathway search in microarray time series experiment , 2007, Bioinform..

[64]  Susumu Goto,et al.  KEGG for representation and analysis of molecular networks involving diseases and drugs , 2009, Nucleic Acids Res..

[65]  M. F. Fuller,et al.  Practical Nonparametric Statistics; Nonparametric Statistical Inference , 1973 .

[66]  A. Bender,et al.  Analysis of Pharmacology Data and the Prediction of Adverse Drug Reactions and Off‐Target Effects from Chemical Structure , 2007, ChemMedChem.

[67]  Pradip Kumar Saha,et al.  Evaluation of clinical and immunogenetic risk factors for the development of hepatotoxicity during antituberculosis treatment. , 2002, American journal of respiratory and critical care medicine.