Predicting potential side effects of drugs by recommender methods and ensemble learning

Drugs provide help and promise for human health, but they usually come with side effects. Predicting side effects of drugs is a critical issue for the drug discovery. Although several machine-learning methods were proposed to predict the drug side effects, it remains the space for the improvement. To the best of our knowledge, many side effects are not detectable in clinical trials until drugs are approved, thus predicting potential or missing side effects based on the known side effects is important for the postmarketing surveillance. In order to solve this specific problem, we formulate approved drugs, side effect terms and drug-side effect associations as a recommender system, and transform the problem of predicting side effects into a recommender task. Two recommender methods, i.e. the integrated neighborhood-based method and the restricted Boltzmann machine-based method, are designed to make predictions. Further, in order to achieve better performances, we combine proposed methods and existing methods of the same type to develop ensemble models. Compared with benchmark methods, the proposed methods and the ensemble method lead to better performances, and the statistical analysis demonstrates the improvements are significant (p-value<0.05). In conclusion, the integrated neighborhood-based method, the restricted Boltzmann machine-based method and the ensemble method are promising tools for the side effect prediction. The source codes and datasets are provided as the supplementary. Predicting side effects of drugs is a critical issue for the drug discovery.We transform approved drugs, side effect terms and drug-side effect associations as a recommender system.We design two recommender methods, i.e. the integrated neighborhood-based method and the restricted Boltzmann machine-based method, to make predictions.Further, we combine proposed methods and existing methods of the same type to develop ensemble models.

[1]  Von-Wun Soo,et al.  Q ( QVHPEOH 5 DQN / HDUQLQJ $ SSURDFK IRU * HQH 3 ULRULWL ] DWLRQ , 2022 .

[2]  David S. Wishart,et al.  DrugBank 3.0: a comprehensive resource for ‘Omics’ research on drugs , 2010, Nucleic Acids Res..

[3]  P. Bork,et al.  A side effect resource to capture phenotypic effects of drugs , 2010, Molecular systems biology.

[4]  Meng Zhao,et al.  Prediction of conformational B-cell epitopes from 3D structures by random forests with a distance-based feature , 2011, BMC Bioinformatics.

[5]  Yoshihiro Yamanishi,et al.  Relating drug–protein interaction network with drug side effects , 2012, Bioinform..

[6]  Yoshihiro Yamanishi,et al.  Drug Side-Effect Prediction Based on the Integration of Chemical and Biological Spaces , 2012, J. Chem. Inf. Model..

[7]  Hua Xu,et al.  Large-scale prediction of adverse drug reactions using chemical, biological, and phenotypic properties of drugs , 2012, J. Am. Medical Informatics Assoc..

[8]  J. Bobadilla,et al.  Recommender systems survey , 2013, Knowl. Based Syst..

[9]  Marie-Dominique Devignes,et al.  Integrative relational machine-learning for understanding drug side-effect profiles , 2013, BMC Bioinformatics.

[10]  Randi Karlsen,et al.  Challenges and Opportunities of Using Recommender Systems for Personalized Health Education , 2009, MIE.

[11]  Susumu Goto,et al.  KEGG for representation and analysis of molecular networks involving diseases and drugs , 2009, Nucleic Acids Res..

[12]  Alberto Cortés-Martín,et al.  Study of LZ-Based Location Prediction and Its Application to Transportation Recommender Systems , 2012, Sensors.

[13]  C Helma,et al.  Prediction of Adverse Drug Reactions Using Decision Tree Modeling , 2010, Clinical pharmacology and therapeutics.

[14]  Yoshihiro Yamanishi,et al.  Predicting drug side-effect profiles: a chemical fragment-based approach , 2011, BMC Bioinformatics.

[15]  Jie Shen,et al.  Adverse Drug Events: Database Construction and in Silico Prediction , 2013, J. Chem. Inf. Model..

[16]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[17]  Ke Zhang,et al.  Predicting immunogenic T-cell epitopes by combining various sequence-derived features , 2013, 2013 IEEE International Conference on Bioinformatics and Biomedicine.

[18]  J. Chen,et al.  Predicting adverse side effects of drugs , 2011, BMC Genomics.

[19]  John Riedl,et al.  An Empirical Analysis of Design Choices in Neighborhood-Based Collaborative Filtering Algorithms , 2002, Information Retrieval.

[20]  David S. Wishart,et al.  DrugBank: a comprehensive resource for in silico drug discovery and exploration , 2005, Nucleic Acids Res..

[21]  J. Chen,et al.  Predicting adverse drug reaction profiles by integrating protein interaction networks with drug structures , 2013, Proteomics.

[22]  Yi-Cheng Zhang,et al.  Bipartite network projection and personal recommendation. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Taghi M. Khoshgoftaar,et al.  A Survey of Collaborative Filtering Techniques , 2009, Adv. Artif. Intell..

[24]  A. Bender,et al.  Analysis of Pharmacology Data and the Prediction of Adverse Drug Reactions and Off‐Target Effects from Chemical Structure , 2007, ChemMedChem.

[25]  Paul Smolensky,et al.  Information processing in dynamical systems: foundations of harmony theory , 1986 .

[26]  Jianhua Ruan,et al.  An ensemble approach for drug side effect prediction , 2013, 2013 IEEE International Conference on Bioinformatics and Biomedicine.

[27]  S. Bryant,et al.  PubChem as a public resource for drug discovery. , 2010, Drug discovery today.

[28]  A. Fliri,et al.  Analysis of drug-induced effect patterns to link structure and side effects of medicines , 2005, Nature chemical biology.

[29]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[30]  Yanli Wang,et al.  PubChem: a public information system for analyzing bioactivities of small molecules , 2009, Nucleic Acids Res..

[31]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[32]  Christodoulos A. Floudas,et al.  CONCORD: a consensus method for protein secondary structure prediction via mixed integer linear optimization , 2012, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[33]  Geoffrey E. Hinton,et al.  Restricted Boltzmann machines for collaborative filtering , 2007, ICML '07.

[34]  Hisashi Kashima,et al.  Side Effect Prediction Using Cooperative Pathways , 2009, 2009 IEEE International Conference on Bioinformatics and Biomedicine.

[35]  Bo Yao,et al.  EPSVR and EPMeta: prediction of antigenic epitopes using support vector regression and multiple server results , 2010, BMC Bioinformatics.

[36]  Jun Hu,et al.  Determining molecular predictors of adverse drug reactions with causality analysis based on structure learning , 2014, J. Am. Medical Informatics Assoc..