Drug Side-Effect Profiles Prediction: From Empirical to Structural Risk Minimization

The identification of drug side-effects is considered to be an important step in drug design, which could not only shorten the time but also reduce the cost of drug development. In this paper, we investigate the relationship between the potential side-effects of drug candidates and their chemical structures. The preliminary Regularized Regression (RR) model for drug side-effects prediction has promising features in the efficiency of model training and the existence of a closed form solution. It performs better than other state-of-the-art methods, in terms of minimum accuracy and average accuracy. In order to dig inside how drug structure will associate with side effect, we further propose weighted GTS (Generalized T-Student Kernel: WGTS) SVM model from a structural risk minimization perspective. The SVM model proposed in this paper provides a better understanding of drug side-effects in the process of drug development. The usefulness of the WGTS model lies in the superior performance in a cross validation setting on 888 approved drugs with 1385 side-effects profiling from SIDER database. This work is expected to shed light on intriguing studies that predict potential un-identifying side-effects and suggest how we can avoid drug side-effects by the removal of some distinguished chemical structures.

[1]  Hisashi Kashima,et al.  Side Effect Prediction Using Cooperative Pathways , 2009, 2009 IEEE International Conference on Bioinformatics and Biomedicine.

[2]  A. Barabasi,et al.  Drug—target network , 2007, Nature Biotechnology.

[3]  Dinesh P. Mital,et al.  Prediction of the serious adverse drug reactions using an artificial neural network model , 2011, Int. J. Medical Eng. Informatics.

[4]  Jens Sadowski,et al.  Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification , 2003, J. Chem. Inf. Comput. Sci..

[5]  P. Bork,et al.  A side effect resource to capture phenotypic effects of drugs , 2010, Molecular systems biology.

[6]  Marie-Dominique Devignes,et al.  Integrative relational machine-learning for understanding drug side-effect profiles , 2013, BMC Bioinformatics.

[7]  Jean-Marc Schwartz,et al.  A global view of drug-therapy interactions , 2007, BMC pharmacology.

[8]  Baris E. Suzek,et al.  The Universal Protein Resource (UniProt) in 2010 , 2009, Nucleic Acids Res..

[9]  Feng Liu,et al.  Drug side effect prediction through linear neighborhoods and multiple data source integration , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[10]  D. Butina,et al.  Predicting ADME properties in silico: methods and models. , 2002, Drug discovery today.

[11]  Roy J. Vaz,et al.  Characterization of HERG potassium channel inhibition using CoMSiA 3D QSAR and homology modeling approaches. , 2003, Bioorganic & medicinal chemistry letters.

[12]  Jean-Philippe Tarel,et al.  Non-Mercer Kernels for SVM Object Recognition , 2004, BMVC.

[13]  E. Uriarte,et al.  Simple Stochastic Fingerprints Towards Mathematical Modeling in Biology and Medicine 2. Unifying Markov Model for Drugs Side Effects , 2006, Bulletin of mathematical biology.

[14]  Hua Xu,et al.  Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs , 2012, J. Am. Medical Informatics Assoc..

[15]  Thomas Lengauer,et al.  A fast flexible docking method using an incremental construction algorithm. , 1996, Journal of molecular biology.

[16]  Bin Chen,et al.  PubChem as a Source of Polypharmacology , 2009, J. Chem. Inf. Model..

[17]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[18]  Sherry L. Jenkins,et al.  Network analysis of FDA approved drugs and their targets. , 2007, The Mount Sinai journal of medicine, New York.

[19]  Gene H. Golub,et al.  Matrix computations (3rd ed.) , 1996 .

[20]  Edda Klipp,et al.  Biochemical network-based drug-target prediction. , 2010, Current opinion in biotechnology.

[21]  Yasuo Tabei,et al.  Inferring protein domains associated with drug side effects based on drug-target interaction network , 2013, BMC Systems Biology.

[22]  M. Milik,et al.  Mapping adverse drug reactions in chemical space. , 2009, Journal of medicinal chemistry.

[23]  Feng Liu,et al.  Predicting drug side effects by multi-label learning and ensemble learning , 2015, BMC Bioinformatics.

[24]  Roded Sharan,et al.  An Algorithmic Framework for Predicting Side-Effects of Drugs , 2010, RECOMB.

[25]  J. Bajorath,et al.  Docking and scoring in virtual screening for drug discovery: methods and applications , 2004, Nature Reviews Drug Discovery.

[26]  Yoshihiro Yamanishi,et al.  Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework , 2010, Bioinform..

[27]  María Martín,et al.  The Universal Protein Resource (UniProt) in 2010 , 2010 .

[28]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[29]  Gisbert Schneider,et al.  A Virtual Screening Method for Prediction of the hERG Potassium Channel Liability of Compound Libraries , 2002, Chembiochem : a European journal of chemical biology.

[30]  Yoshihiro Yamanishi,et al.  Predicting drug side-effect profiles: a chemical fragment-based approach , 2011, BMC Bioinformatics.