HeteroDualNet: A Dual Convolutional Neural Network With Heterogeneous Layers for Drug-Disease Association Prediction via Chou’s Five-Step Rule

Identifying new treatments for existing drugs can help reduce drug development costs and explore novel indications of drugs. The prediction of associations between drugs and diseases is challenging because their similarities and relations are complicated and non-linear. We propose a HeteroDualNet model to address this issue. Firstly, three types of matrices are extracted to represent intra-drug similarities, intra-disease similarity and drug-disease associations. The intra-drug similarities consider three drug features and a newly introduced drug-related disease correlation. Secondly, an embedding mechanism is proposed to integrate these matrices in a heterogenous drug-disease association layer (hetero-layer). Further, a neighbouring heterogeneous layer (hetero-layer-N) is constructed to incorporate the biological premise that similar drugs can often treat related diseases. Finally, a dual convolutional neural network is built with hetero-layer and hetero-layer-N as two branches to learn from characteristics of drug-disease and the relations of their neighbours simultaneously. HeteroDualNet outperformed the other four methods in comparison over a public dataset of 763 drugs and 681 diseases in terms of Areas Under the Curves of Receiver Operating Characteristics and Precision-Recall, and recall rate at top k. Case study of five drugs further proved the capacity of HeteroDualNet in finding reliable disease candidates of drugs as validated by database records or literature. Our findings show that the embedded heterogenous layers of original and neighbouring drug-disease representations in a dual neural network improved the association prediction performance.

[1]  Kuo-Chen Chou,et al.  pLoc_bal-mGpos: Predict subcellular localization of Gram-positive bacterial proteins by quasi-balancing training dataset and PseAAC. , 2019, Genomics.

[2]  K. Chou Some remarks on protein attribute prediction and pseudo amino acid composition , 2010, Journal of Theoretical Biology.

[3]  Jun O. Liu,et al.  Recent Advances in Drug Repositioning for the Discovery of New Anticancer Drugs , 2014, International journal of biological sciences.

[4]  R. Sharan,et al.  PREDICT: a method for inferring novel drug indications with application to personalized medicine , 2011, Molecular systems biology.

[5]  B. Padhy,et al.  Drug repositioning: re-investigating existing drugs for new therapeutic indications. , 2011, Journal of postgraduate medicine.

[6]  K. Chou,et al.  REVIEW : Recent advances in developing web-servers for predicting protein attributes , 2009 .

[7]  K. Chou Advance in predicting subcellular localization of multi-label proteins and its implication for developing multi-target drugs. , 2019, Current medicinal chemistry.

[8]  Ping Zhang,et al.  Exploring the associations between drug side-effects and therapeutic indications , 2014, J. Biomed. Informatics.

[9]  H. Grabowski,et al.  Are the economics of pharmaceutical research and development changing? , 2012, PharmacoEconomics.

[10]  Alexander A. Morgan,et al.  Discovery and Preclinical Validation of Drug Indications Using Compendia of Public Gene Expression Data , 2011, Science Translational Medicine.

[11]  Chao Wu,et al.  Computational drug repositioning through heterogeneous network clustering , 2013, BMC Systems Biology.

[12]  Zhiyong Lu,et al.  A survey of current trends in computational drug repositioning , 2016, Briefings Bioinform..

[13]  Van V. Brantner,et al.  Estimating the cost of new drug development: is it really 802 million dollars? , 2006, Health affairs.

[14]  Yongcui Wang,et al.  Drug Repositioning by Kernel-Based Integration of Molecular Structure, Molecular Activity, and Phenotype Data , 2013, PloS one.

[15]  Xiang Zhang,et al.  Drug repositioning by integrating target information through a heterogeneous network model , 2014, Bioinform..

[16]  Man Mohan Mehndiratta,et al.  Drug repositioning , 2016, International Journal of Epilepsy.

[17]  Feng Liu,et al.  Predicting drug-disease associations by using similarity constrained matrix factorization , 2018, BMC Bioinformatics.

[18]  Charles C. Persinger,et al.  How to improve R&D productivity: the pharmaceutical industry's grand challenge , 2010, Nature Reviews Drug Discovery.

[19]  P. Sanseau,et al.  Computational Drug Repositioning: From Data to Therapeutics , 2013, Clinical pharmacology and therapeutics.

[20]  P. Sanseau,et al.  Drug repurposing: progress, challenges and recommendations , 2018, Nature Reviews Drug Discovery.

[21]  K. Chou Progresses in Predicting Post-translational Modification , 2019, International Journal of Peptide Research and Therapeutics.

[22]  N. Tamimi,et al.  Drug Development: From Concept to Marketing! , 2009, Nephron Clinical Practice.

[23]  Ping Xuan,et al.  Drug repositioning through integration of prior knowledge and projections of drugs and diseases , 2019, Bioinform..

[24]  Kuo-Chen Chou,et al.  iPhosH-PseAAC: Identify Phosphohistidine Sites in Proteins by Blending Statistical Moments and Position Relative Features According to the Chou's 5-Step Rule and General Pseudo Amino Acid Composition , 2019, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[25]  Kuo-Chen Chou,et al.  An Unprecedented Revolution in Medicinal Chemistry Driven by the Progress of Biological Science. , 2017, Current topics in medicinal chemistry.

[26]  Kuo-Chen Chou,et al.  iHyd-PseAAC (EPSV): Identifying Hydroxylation Sites in Proteins by Extracting Enhanced Position and Sequence Variant Feature via Chou's 5-Step Rule and General Pseudo Amino Acid Composition , 2019, Current genomics.

[27]  Yanli Wang,et al.  PubChem: a public information system for analyzing bioactivities of small molecules , 2009, Nucleic Acids Res..

[28]  Huaiyu Mi,et al.  The InterPro protein families database: the classification resource after 15 years , 2014, Nucleic Acids Res..

[29]  Oakland J. Peters,et al.  Predicting new indications for approved drugs using a proteochemometric method. , 2012, Journal of medicinal chemistry.

[30]  Kuo-Chen Chou,et al.  pLoc_bal‐mAnimal: predict subcellular localization of animal proteins by balancing training dataset and PseAAC , 2018, Bioinform..

[31]  Michael Hay,et al.  Clinical development success rates for investigational drugs , 2014, Nature Biotechnology.

[32]  Lin Gao,et al.  Inferring drug-disease associations based on known protein complexes , 2015, BMC Medical Genomics.

[33]  Jayne-Louise E. Pritchard,et al.  Enhancing the Promise of Drug Repositioning through Genetics , 2017, Front. Pharmacol..

[34]  Ying Fu,et al.  LRSSL: predict and interpret drug‐disease associations based on data integration using sparse subspace learning , 2017, Bioinform..

[35]  Xing Chen,et al.  NLLSS: Predicting Synergistic Drug Combinations Based on Semi-supervised Learning , 2016, PLoS Comput. Biol..

[36]  Yi Pan,et al.  Drug repositioning based on comprehensive similarity measures and Bi-Random walk algorithm , 2016, Bioinform..

[37]  A. Speciale,et al.  Resveratrol role in Staphylococcus aureus-induced corneal inflammation. , 2013, Pathogens and Disease.

[38]  A. Chiang,et al.  Systematic Evaluation of Drug–Disease Relationships to Identify Leads for Novel Drug Uses , 2009, Clinical pharmacology and therapeutics.

[39]  Zuping Zhang,et al.  Network-Based Inference Methods for Drug Repositioning , 2015, Comput. Math. Methods Medicine.

[40]  K. Chou Impacts of bioinformatics to medicinal chemistry. , 2015, Medicinal chemistry (Shariqah (United Arab Emirates)).

[41]  Yoshihiro Yamanishi,et al.  Systematic Drug Repositioning for a Wide Range of Diseases with Integrative Analyses of Phenotypic and Molecular Data , 2015, J. Chem. Inf. Model..

[42]  Dong Wang,et al.  Inferring the human microRNA functional similarity and functional network based on microRNA-associated diseases , 2010, Bioinform..

[43]  Nicola Nosengo Can you teach old drugs new tricks? , 2016, Nature.

[44]  Kuo-Chen Chou,et al.  pLoc-mHum: predict subcellular localization of multi-location human proteins via general PseAAC to winnow out the crucial GO information , 2018, Bioinform..

[45]  Pankaj Agarwal,et al.  Systematic Drug Repositioning Based on Clinical Side-Effects , 2011, PloS one.

[46]  Armando Blanco,et al.  DrugNet: Network-based drug-disease prioritization by integrating heterogeneous data , 2015, Artif. Intell. Medicine.

[47]  Kuo-Chen Chou,et al.  SPalmitoylC-PseAAC: A sequence-based model developed via Chou's 5-steps rule and general PseAAC for identifying S-palmitoylation sites in proteins. , 2019, Analytical biochemistry.

[48]  Baris E. Suzek,et al.  The Universal Protein Resource (UniProt) in 2010 , 2009, Nucleic Acids Res..