Predicting Drug-Disease Treatment Associations Based on Topological Similarity and Singular Value Decomposition

To assist drug development, many computational methods have been proposed to identify potential drug-disease treatment associations before wet experiments. Based on the assumption that similar drugs may treat similar diseases, most methods need the similarities of drugs and diseases, and they will not work if the biological or chemical features for computing similarities are missing. Besides, being lack of validated negative samples in the drug-disease associations data, most methods simply select some unlabeled samples as negative ones, which may introduce noises. Herein, we propose a new method (TS-SVD) which only uses those known drug-protein, disease-protein and drug-disease interactions to predict the potential drug-disease associations. In a constructed drug-protein-disease heterogeneous network, we consider the common neighbors of drugs and diseases to obtain the topological similarity. Then the topological similarity matrix of drugs (diseases) will be used to get the low dimensional embedding representations of drug-disease pairs. Finally, a Random Forest classifier is trained to do the prediction. To train a more reasonable model, we select out some reliable negative samples based on the k-step neighbors relationships between drugs and diseases. Compared with some state-of-the-art methods, we use less information but achieve better or comparable performance. Meanwhile, our strategy for selecting reliable negative samples can improve the performances of these methods.

[1]  Juan Liu,et al.  Semi-supervised graph cut algorithm for drug repositioning by integrating drug, disease and genomic associations , 2016, 2016 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[2]  Juan Liu,et al.  Prediction of drug-disease associations based on ensemble meta paths and singular value decomposition , 2019, BMC Bioinformatics.

[3]  Yi Pan,et al.  Drug repositioning based on comprehensive similarity measures and Bi-Random walk algorithm , 2016, Bioinform..

[4]  David S. Wishart,et al.  DrugBank 5.0: a major update to the DrugBank database for 2018 , 2017, Nucleic Acids Res..

[5]  Juan Liu,et al.  Prediction of drug-disease treatment relations based on positive and unlabeled samples , 2018, J. Intell. Fuzzy Syst..

[6]  Juan Liu,et al.  Predicting drug-disease interactions by semi-supervised graph cut algorithm and three-layer data integration , 2017, BMC Medical Genomics.

[7]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[8]  Armando Blanco,et al.  DrugNet: Network-based drug-disease prioritization by integrating heterogeneous data , 2015, Artif. Intell. Medicine.

[9]  Xiang Zhang,et al.  Drug repositioning by integrating target information through a heterogeneous network model , 2014, Bioinform..

[10]  G. Vriend,et al.  A text-mining analysis of the human phenome , 2006, European Journal of Human Genetics.

[11]  Juan Wang,et al.  The computational prediction of drug-disease interactions using the dual-network L2,1-CMF method , 2018, BMC Bioinformatics.

[12]  R. Sharan,et al.  PREDICT: a method for inferring novel drug indications with application to personalized medicine , 2011, Molecular systems biology.

[13]  Alan Talevi,et al.  Computer-Aided Drug Design: An Overview. , 2018, Methods in molecular biology.

[14]  Feng Liu,et al.  Predicting drug-disease associations by using similarity constrained matrix factorization , 2018, BMC Bioinformatics.

[15]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[16]  The UniProt Consortium,et al.  UniProt: a worldwide hub of protein knowledge , 2018, Nucleic Acids Res..

[17]  Ying Fu,et al.  LRSSL: predict and interpret drug‐disease associations based on data integration using sparse subspace learning , 2017, Bioinform..