MCANet: shared-weight-based MultiheadCrossAttention network for drug-target interaction prediction

Accurate and effective drug-target interaction (DTI) prediction can greatly shorten the drug development lifecycle and reduce the cost of drug development. In the deep-learning-based paradigm for predicting DTI, robust drug and protein feature representations and their interaction features play a key role in improving the accuracy of DTI prediction. Additionally, the class imbalance problem and the overfitting problem in the drug-target dataset can also affect the prediction accuracy, and reducing the consumption of computational resources and speeding up the training process are also critical considerations. In this paper, we propose shared-weight-based MultiheadCrossAttention, a precise and concise attention mechanism that can establish the association between target and drug, making our models more accurate and faster. Then, we use the cross-attention mechanism to construct two models: MCANet and MCANet-B. In MCANet, the cross-attention mechanism is used to extract the interaction features between drugs and proteins for improving the feature representation ability of drugs and proteins, and the PolyLoss loss function is applied to alleviate the overfitting problem and the class imbalance problem in the drug-target dataset. In MCANet-B, the robustness of the model is improved by combining multiple MCANet models and prediction accuracy further increases. We train and evaluate our proposed methods on six public drug-target datasets and achieve state-of-the-art results. In comparison with other baselines, MCANet saves considerable computational resources while maintaining accuracy in the leading position; however, MCANet-B greatly improves prediction accuracy by combining multiple models while maintaining a balance between computational resource consumption and prediction accuracy.

[1]  Yaohang Li,et al.  IIFDTI: predicting drug-target interactions through interactive and independent features based on attention mechanism , 2022, Bioinform..

[2]  Mingxing Tan,et al.  PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions , 2022, ICLR.

[3]  Guohua Wang,et al.  Supervised graph co-contrastive learning for drug-target interaction prediction , 2022, Bioinform..

[4]  Hojung Nam,et al.  Sequence-based prediction of protein binding regions and drug–target interactions , 2022, Journal of Cheminformatics.

[5]  Zhenhua Feng,et al.  CPInformer for Efficient and Robust Compound-Protein Interaction Prediction , 2022, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[6]  Yaohang Li,et al.  BACPI: a bi-directional attention neural network for compound-protein interaction and binding affinity prediction , 2022, Bioinform..

[7]  Qichang Zhao,et al.  HyperAttentionDTI: improving drug-protein interaction prediction by sequence-based deep learning with attention mechanism , 2021, Bioinform..

[8]  Guohua Wang,et al.  Drug-target interaction predication via multi-channel graph neural networks , 2021, Briefings Bioinform..

[9]  Zhiqiang Wei,et al.  MCN-CPI: Multiscale Convolutional Network for Compound–Protein Interaction Prediction , 2021, Biomolecules.

[10]  Congmin Jia,et al.  Quantitative prediction model for affinity of drug–target interactions based on molecular vibrations and overall system of ligand-receptor , 2021, BMC Bioinformatics.

[11]  Guohua Wang,et al.  Evaluating disease similarity based on gene network reconstruction and representation , 2021, Bioinform..

[12]  Khushnood Abbas,et al.  A survey on deep learning and its applications , 2021, Comput. Sci. Rev..

[13]  Jie Zhang,et al.  BridgeDPI: A Novel Graph Neural Network for Predicting Drug-Protein Interactions , 2021, ArXiv.

[14]  Parvin Razzaghi,et al.  DeepCDA: deep cross-domain compound-protein affinity prediction through LSTM and convolutional neural networks , 2020, Bioinform..

[15]  Arne Elofsson,et al.  TransformerCPI: improving compound-protein interaction prediction by sequence-based deep learning with self-attention mechanism and label reversal experiments , 2020, Bioinform..

[16]  Jiajie Peng,et al.  Identifying drug-target interactions based on graph convolutional network and deep neural network , 2020, Briefings Bioinform..

[17]  Jimeng Sun,et al.  MolTrans: Molecular Interaction Transformer for drug–target interaction prediction , 2020, Bioinform..

[18]  Natalia Gimelshein,et al.  PyTorch: An Imperative Style, High-Performance Deep Learning Library , 2019, NeurIPS.

[19]  Ming Du,et al.  Computer vision algorithms and hardware implementations: A survey , 2019, Integr..

[20]  Hojung Nam,et al.  DeepConv-DTI: Prediction of drug-target interactions via deep learning with convolution on protein sequences , 2018, PLoS Comput. Biol..

[21]  Jugal K. Kalita,et al.  A Survey of the Usages of Deep Learning for Natural Language Processing , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[22]  Asif Ali Laghari,et al.  Machine Learning in Computer Vision: A Review , 2018, EAI Endorsed Trans. Scalable Inf. Syst..

[23]  Jun Sese,et al.  Compound‐protein interaction prediction with end‐to‐end learning of neural networks for graphs and sequences , 2018, Bioinform..

[24]  Yong Zhou,et al.  Prediction of Drug–Target Interaction Networks from the Integration of Protein Sequences and Drug Chemical Structures , 2017, Molecules.

[25]  Shuigeng Zhou,et al.  Boosting compound-protein interaction prediction by deep learning , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[26]  Yang Zhang,et al.  GLASS: a comprehensive database for experimentally validated GPCR-ligand associations , 2015, Bioinform..

[27]  Tao Xu,et al.  Making Sense of Large-Scale Kinase Inhibitor Bioactivity Data Sets: A Comparative and Integrative Analysis , 2014, J. Chem. Inf. Model..

[28]  Alexander Golbraikh,et al.  Application of Quantitative Structure–Activity Relationship Models of 5-HT1A Receptor Binding to Virtual Screening Identifies Novel and Potent 5-HT1A Ligands , 2014, J. Chem. Inf. Model..

[29]  Mindy I. Davis,et al.  Comprehensive analysis of kinase inhibitor selectivity , 2011, Nature Biotechnology.

[30]  David Rogers,et al.  Extended-Connectivity Fingerprints , 2010, J. Chem. Inf. Model..

[31]  Robert B. Russell,et al.  SuperTarget and Matador: resources for exploring drug-target relationships , 2007, Nucleic Acids Res..

[32]  Van V. Brantner,et al.  Estimating the cost of new drug development: is it really 802 million dollars? , 2006, Health affairs.

[33]  Kiyoko F. Aoki-Kinoshita,et al.  From genomics to chemical genomics: new developments in KEGG , 2005, Nucleic Acids Res..

[34]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[35]  Andrew L. Maas Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[36]  Antje Chang,et al.  BRENDA , the enzyme database : updates and major new developments , 2003 .

[37]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[38]  BIOINFORMATICS ORIGINAL PAPER doi:10.1093/bioinformatics/btm580 Systems biology , 2022 .