Predict the Protein-protein Interaction between Virus and Host through Hybrid Deep Neural Network

Viral infection has been considered as a threat to human health for many years, where protein-protein interactions (PPIs) between viruses and hosts is involved. Researching the PPI between the virus and the host is conducive to understanding the mechanism of virus infection and the development of new drugs. Currently, most of the existing studies based on sequence only focus on extracting sequence features from original amino acid sequences, whereas the redundancy and noise of the features are neglected.In this paper, we employed Ll-regularized logistic regression to obtain efficacious sequence features related to PPIs without losing accuracy and generalization. A hybrid deep learning framework which combines convolutional neural network together with a long short term memory network to extract more hidden high-level features was designed to extract more latent features. As it is demonstrated in experiments results, the proposed framework is superior to the current advanced framework in both benchmark data and independent testing and is promising for identifying virus-host interactions.

[1]  Q. Zou,et al.  Gene2vec: gene subsequence embedding for prediction of mammalian N6-methyladenosine sites from mRNA , 2018, RNA.

[2]  Xiaoping Zhou,et al.  A generalized approach to predicting protein-protein interactions between virus and host , 2018 .

[3]  Xiang Zhou,et al.  An improved method for predicting interactions between virus and human proteins , 2017, J. Bioinform. Comput. Biol..

[4]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[5]  Lenwood S. Heath,et al.  DeNovo: virus-host sequence-based protein-protein interaction prediction , 2016, Bioinform..

[6]  Shawn M Gomez,et al.  Structural similarity-based predictions of protein interactions between HIV-1 and Homo sapiens , 2010, Virology Journal.

[7]  Hui Liu,et al.  A deep neural network approach using distributed representations of RNA sequence and structure for identifying binding site of RNA-binding proteins , 2019, 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[8]  Pei Hao,et al.  The Domain Landscape of Virus-Host Interactomes , 2014, BioMed research international.

[9]  Tianwei Yu,et al.  forgeNet: a graph deep neural network model using tree-based ensemble classifiers for feature graph construction , 2020, Bioinform..

[10]  S. Saha,et al.  Prediction of Interactions between Viral and Host Proteins Using Supervised Machine Learning Methods , 2014, PloS one.

[11]  Víctor A. Bucheli,et al.  Prediction of virus-host protein-protein interactions mediated by short linear motifs , 2017, BMC Bioinformatics.

[12]  Lei Deng,et al.  Targeting Virus-host Protein Interactions: Feature Extraction and Machine Learning Approaches. , 2019, Current drug metabolism.

[13]  Carlo Zaniolo,et al.  Multifaceted protein–protein interaction prediction based on Siamese residual RCNN , 2019, Bioinform..

[14]  Hao Zhu,et al.  Computational reconstruction of proteome-wide protein interaction networks between HTLV retroviruses and Homo sapiens , 2014, BMC Bioinformatics.

[15]  Juwen Shen,et al.  Predicting protein–protein interactions based only on sequences information , 2007, Proceedings of the National Academy of Sciences.

[16]  Yaping Wang,et al.  Prediction of GCRV virus-host protein interactome based on structural motif-domain interactions , 2017, BMC Bioinformatics.

[17]  Ibrahim H. I. Ahmed,et al.  Computational prediction of host-pathogen protein-protein interactions , 2017 .

[18]  Lyle Ungar,et al.  Prediction of HIV-1 virus-host protein interactions using virus and host sequence motifs , 2009, BMC Medical Genomics.

[19]  Farshad Khunjush,et al.  Virus–human protein–protein interaction prediction using Bayesian matrix factorization and projection techniques , 2018 .

[20]  Yan Wang,et al.  DNN-Dom: predicting protein domain boundary from sequence alone by deep neural network , 2019, Bioinform..

[21]  Aldo Segura-Cabrera,et al.  A Viral-Human Interactome Based on Structural Motif-Domain Interactions Captures the Human Infectome , 2013, PloS one.

[22]  Yanjun Qi,et al.  Prediction of Interactions Between HIV-1 and Human Proteins by Information Integration , 2008, Pacific Symposium on Biocomputing.

[23]  Vincent Lotteau,et al.  Structure homology and interaction redundancy for discovering virus–host protein interactions , 2013, EMBO reports.

[24]  Xiujun Gong,et al.  Deep Neural Network Based Predictions of Protein Interactions Using Primary Sequences , 2018, Molecules.

[25]  Alan Christoffels,et al.  Prediction of human-Bacillus anthracis protein–protein interactions using multi-layer neural network , 2018, Bioinform..

[26]  Farshad Khunjush,et al.  Computational approaches for prediction of pathogen-host protein-protein interactions , 2015, Front. Microbiol..

[27]  Wei Xu,et al.  Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.

[28]  Kyungsook Han,et al.  Prediction of protein-protein interactions between viruses and human by an SVM model , 2012, BMC Bioinformatics.

[29]  Matthew D. Dyer,et al.  Supervised learning and prediction of physical interactions between human and HIV proteins. , 2011, Infection, genetics and evolution : journal of molecular epidemiology and evolutionary genetics in infectious diseases.