Comprehensive Prediction and Interpretation of Viral Protein Subcellular Localization

Determining the subcellular localization of viral proteins is indispensable for understanding the activity of the virus and inferring viral protein functions. Although previous studies about predicting viral protein subcellular localization have been developed, they often have the following disadvantages: (i) only focusing on a part of proteins of a species (ii) not considering the presence of multi-location proteins and (iii) lacking interpretability for the results. To address these problems, this paper is firstly predicting all the subcellular localization of the whole viral proteome in the UniProtKB and is interpretable for the results. This paper gives high prediction accuracy for the single-location and multi-location viral proteins by the FUEL-mLoc predictor. More importantly, we did deeply analysis and interpretation of the subcellular localization of all viral proteins. Finally, we have found some essential GO terms which are interpretable for the results and are significant in predicting the subcellular localization of the viral proteins.

[1]  R. Murphy Communicating subcellular distributions , 2010, Cytometry. Part A : the journal of the International Society for Analytical Cytology.

[2]  K. Chou Prediction of protein cellular attributes using pseudo‐amino acid composition , 2001, Proteins.

[3]  Sun-Yuan Kung,et al.  FUEL‐mLoc: feature‐unified prediction and explanation of multi‐localization of cellular proteins in multiple organisms , 2017, Bioinform..

[4]  S. Brunak,et al.  Locating proteins in the cell using TargetP, SignalP and related tools , 2007, Nature Protocols.

[5]  Sang-Mun Chi,et al.  WegoLoc: accurate prediction of protein subcellular localization using weighted Gene Ontology terms , 2012, Bioinform..

[6]  Eyke Hüllermeier,et al.  On label dependence and loss minimization in multi-label classification , 2012, Machine Learning.

[7]  M. S. Dilber,et al.  Mapping of herpes simplex virus-1 VP22 functional domains for inter- and subcellular protein targeting , 2001, Gene Therapy.

[8]  K. Nakai Protein sorting signals and prediction of subcellular localization. , 2000, Advances in protein chemistry.

[9]  S. Kung,et al.  GOASVM: a subcellular location predictor by incorporating term-frequency gene ontology into the general form of Chou's pseudo-amino acid composition. , 2013, Journal of theoretical biology.

[10]  Anamika Thakur,et al.  MSLVP: prediction of multiple subcellular localization of viral proteins using a support vector machine. , 2016, Molecular bioSystems.

[11]  Peer Bork,et al.  Predicting protein cellular localization using a domain projection method. , 2002, Genome research.

[12]  Sun-Yuan Kung,et al.  HybridGO-Loc: Mining Hybrid Features on Gene Ontology for Predicting Subcellular Localization of Multi-Location Proteins , 2014, PloS one.

[13]  Sun-Yuan Kung,et al.  mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines , 2012, BMC Bioinformatics.

[14]  H. Gelderblom Structure and Classification of Viruses , 1996 .

[15]  G. Plosker,et al.  Saquinavir: a review of its use in boosted regimens for treating HIV infection. , 2003, Drugs.

[16]  Zhi-Hua Zhou,et al.  On the Consistency of Multi-Label Learning , 2011, COLT.

[17]  T. Mertens,et al.  Classic paper: Are the chickenpox virus and the zoster virus identical? , 2018, Reviews in medical virology.

[18]  Oliver Kohlbacher,et al.  YLoc—an interpretable web server for predicting subcellular localization , 2010, Nucleic Acids Res..

[19]  L. Gallagher Hepatitis B. , 2016, Journal.

[20]  Piero Fariselli,et al.  BUSCA: an integrative web server to predict subcellular localization of proteins , 2018, Nucleic Acids Res..

[21]  Kuo-Chen Chou,et al.  Predicting eukaryotic protein subcellular location by fusing optimized evidence-theoretic K-Nearest Neighbor classifiers. , 2006, Journal of proteome research.

[22]  K. Chou,et al.  Cell-PLoc 2.0: an improved package of web-servers for predicting subcellular localization of proteins in various organisms , 2010 .

[23]  H. Lodish,et al.  Viruses: Structure, Function, and Uses , 2000 .

[24]  K. Chou,et al.  Virus-mPLoc: A Fusion Classifier for Viral Protein Subcellular Location Prediction by Incorporating Multiple Sites , 2010, Journal of biomolecular structure & dynamics.

[25]  J. Grainger The Virus , 1940, Nature.

[26]  M. Elazar,et al.  The hepatitis C virus NS5A inhibitor (BMS-790052) alters the subcellular localization of the NS5A non-structural viral protein. , 2011, Virology.

[27]  Shiow-Fen Hwang,et al.  ProLoc-GO: Utilizing informative Gene Ontology terms for sequence-based prediction of protein subcellular localization , 2008, BMC Bioinformatics.

[28]  L. Enjuanes,et al.  Subcellular location and topology of severe acute respiratory syndrome coronavirus envelope protein , 2011, Virology.

[29]  Susan S. Taylor,et al.  Isoform-specific subcellular localization and function of protein kinase A identified by mosaic imaging of mouse brain , 2017, eLife.