Identifying Protein Phosphorylation Sites with Kinase Substrate Specificity on Human Viruses

Viruses infect humans and progress inside the body leading to various diseases and complications. The phosphorylation of viral proteins catalyzed by host kinases plays crucial regulatory roles in enhancing replication and inhibition of normal host-cell functions. Due to its biological importance, there is a desire to identify the protein phosphorylation sites on human viruses. However, the use of mass spectrometry-based experiments is proven to be expensive and labor-intensive. Furthermore, previous studies which have identified phosphorylation sites in human viruses do not include the investigation of the responsible kinases. Thus, we are motivated to propose a new method to identify protein phosphorylation sites with its kinase substrate specificity on human viruses. The experimentally verified phosphorylation data were extracted from virPTM – a database containing 301 experimentally verified phosphorylation data on 104 human kinase-phosphorylated virus proteins. In an attempt to investigate kinase substrate specificities in viral protein phosphorylation sites, maximal dependence decomposition (MDD) is employed to cluster a large set of phosphorylation data into subgroups containing significantly conserved motifs. The experimental human phosphorylation sites are collected from Phospho.ELM, grouped according to its kinase annotation, and compared with the virus MDD clusters. This investigation identifies human kinases such as CK2, PKB, CDK, and MAPK as potential kinases for catalyzing virus protein substrates as confirmed by published literature. Profile hidden Markov model is then applied to learn a predictive model for each subgroup. A five-fold cross validation evaluation on the MDD-clustered HMMs yields an average accuracy of 84.93% for Serine, and 78.05% for Threonine. Furthermore, an independent testing data collected from UniProtKB and Phospho.ELM is used to make a comparison of predictive performance on three popular kinase-specific phosphorylation site prediction tools. In the independent testing, the high sensitivity and specificity of the proposed method demonstrate the predictive effectiveness of the identified substrate motifs and the importance of investigating potential kinases for viral protein phosphorylation sites.

[1]  Hsien-Da Huang,et al.  RegPhos: a system to explore the protein kinase–substrate phosphorylation network in humans , 2010, Nucleic Acids Res..

[2]  Joachim Selbig,et al.  PhosPhAt: a database of phosphorylation sites in Arabidopsis thaliana and a plant-specific phosphorylation site predictor , 2007, Nucleic Acids Res..

[3]  B. McManus,et al.  Protein Kinase B/Akt Regulates Coxsackievirus B3 Replication through a Mechanism Which Is Not Caspase Dependent , 2004, Journal of Virology.

[4]  R. Zell,et al.  Impact of global warming on viral diseases: what is the evidence? , 2008, Current Opinion in Biotechnology.

[5]  D. Litchfield,et al.  Evidence for Regulation of Mitotic Progression through Temporal Phosphorylation and Dephosphorylation of CK2α , 2009, Molecular and Cellular Biology.

[6]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[7]  Hanno Steen,et al.  Phosphorylation Analysis by Mass Spectrometry , 2006, Molecular & Cellular Proteomics.

[8]  C. López-Otín,et al.  Location of phosphorylated residues in human respiratory syncytial virus phosphoprotein. , 1991, The Journal of general virology.

[9]  Hsien-Da Huang,et al.  SNOSite: Exploiting Maximal Dependence Decomposition to Identify Cysteine S-Nitrosylation with Substrate Site Specificity , 2011, PloS one.

[10]  L. Ratner,et al.  Phosphorylation of Human Immunodeficiency Virus Type 1 Vpr Regulates Cell Cycle Arrest , 2000, Journal of Virology.

[11]  G. Pauli,et al.  Human-immundeficiency-virus-type-1-encoded Vpu protein is phosphorylated by casein kinase II , 1992 .

[12]  N. Blom,et al.  Sequence and structure-based prediction of eukaryotic protein phosphorylation sites. , 1999, Journal of molecular biology.

[13]  P. Bork,et al.  Linear Motif Atlas for Phosphorylation-Dependent Signaling , 2008, Science Signaling.

[14]  Shih-Shin Chang,et al.  Effect of phosphorylation on the transactivation activity of Epstein-Barr virus BMRF1, a major target of the viral BGLF4 kinase. , 2008, The Journal of general virology.

[15]  M. Katze,et al.  High-Throughput Screening of the Yeast Kinome: Identification of Human Serine/Threonine Protein Kinases That Phosphorylate the Hepatitis C Virus NS5A Protein , 2004, Journal of Virology.

[16]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[17]  Jorng-Tzong Horng,et al.  Incorporating support vector machine for identifying protein tyrosine sulfation sites , 2009, J. Comput. Chem..

[18]  L. Pinna,et al.  One‐thousand‐and‐one substrates of protein kinase CK2? , 2003, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[19]  R. Berro,et al.  RNA interference directed to CDK2 inhibits HIV-1 transcription. , 2005, Virology.

[20]  G. Pauli,et al.  Human-immunodeficiency-virus-type-1-encoded Vpu protein is phosphorylated by casein kinase II. , 1992, European journal of biochemistry.

[21]  Yu Xue,et al.  A summary of computational resources for protein phosphorylation. , 2010, Current protein & peptide science.

[22]  Nikolaj Blom,et al.  Phospho.ELM: A database of experimentally verified phosphorylation sites in eukaryotic proteins , 2004, BMC Bioinformatics.

[23]  L. Meijer,et al.  Pharmacological Cyclin-Dependent Kinase Inhibitors Inhibit Replication of Wild-Type and Drug-Resistant Strains of Herpes Simplex Virus and Human Immunodeficiency Virus Type 1 by Targeting Cellular, Not Viral, Proteins , 2002, Journal of Virology.

[24]  Bostjan Kobe,et al.  The Predikin webserver: improved prediction of protein kinase peptide specificity using structural information , 2008, Nucleic Acids Res..

[25]  R. Elliott,et al.  CDK/ERK-mediated phosphorylation of the human influenza A virus NS1 protein at threonine-215. , 2009, Virology.

[26]  Yu Xue,et al.  GPS: a comprehensive www server for phosphorylation sites prediction , 2005, Nucleic Acids Res..

[27]  P. McIntosh,et al.  Phosphorylation of the Human Papillomavirus Type 16 E1^E4 Protein at T57 by ERK Triggers a Structural Change That Enhances Keratin Binding and Protein Stability , 2009, Journal of Virology.

[28]  B. Roizman,et al.  Protein Kinase B/Akt Is Present in Activated Form throughout the Entire Replicative Cycle of ΔUS3 Mutant Virus but Only at Early Times after Infection with Wild-Type Herpes Simplex Virus 1 , 2006, Journal of Virology.

[29]  Ivan Mijakovic,et al.  NetPhosBac – A predictor for Ser/Thr phosphorylation sites in bacterial proteins , 2009, Proteomics.

[30]  M. Bukrinsky,et al.  Phosphorylation of Vpr regulates HIV type 1 nuclear import and macrophage infection. , 2002, AIDS research and human retroviruses.

[31]  S. Lele,et al.  Euclidean distance matrix analysis: a coordinate-free approach for comparing biological shapes using landmark data. , 1991, American journal of physical anthropology.

[32]  Nikolaj Blom,et al.  NetPhosYeast: prediction of protein phosphorylation sites in yeast , 2007, Bioinform..

[33]  Yu Xue,et al.  GPS 2.1: enhanced prediction of kinase-specific phosphorylation sites with an algorithm of motif length selection. , 2011, Protein engineering, design & selection : PEDS.

[34]  Hsien-Da Huang,et al.  KinasePhos 2.0: a web server for identifying protein kinase-specific phosphorylation sites based on sequences and coupling patterns , 2007, Nucleic Acids Res..

[35]  Jorng-Tzong Horng,et al.  Incorporating structural characteristics for identification of protein methylation sites , 2009, J. Comput. Chem..

[36]  Yu Xue,et al.  PPSP: prediction of PK-specific phosphorylation site with Bayesian decision theory , 2006, BMC Bioinformatics.

[37]  Michael B. Yaffe,et al.  Scansite 2.0: proteome-wide prediction of cell signaling interactions using short sequence motifs , 2003, Nucleic Acids Res..

[38]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[39]  Tzong-Yi Lee,et al.  PlantPhos: using maximal dependence decomposition to identify plant phosphorylation sites with substrate site specificity , 2011, BMC Bioinformatics.

[40]  S. Ruben,et al.  Functional significance of phosphorylation to the human immunodeficiency virus Rev protein , 1989, Journal of virology.

[41]  Tzong-Yi Lee,et al.  Carboxylator: incorporating solvent-accessible surface area for identifying protein carboxylation sites , 2011, J. Comput. Aided Mol. Des..

[42]  Bostjan Kobe,et al.  Substrate specificity of protein kinases and computational prediction of substrates. , 2005, Biochimica et biophysica acta.

[43]  Jorng-Tzong Horng,et al.  KinasePhos: a web tool for identifying protein kinase-specific phosphorylation sites , 2005, Nucleic Acids Res..

[44]  George M Church,et al.  Collection and Motif-Based Prediction of Phosphorylation Sites in Human Viruses , 2010, Science Signaling.

[45]  Sergei Nekhai,et al.  HIV-1 Tat Interaction with RNA Polymerase II C-terminal Domain (CTD) and a Dynamic Association with CDK2 Induce CTD Phosphorylation and Transcription from HIV-1 Promoter* , 2002, The Journal of Biological Chemistry.

[46]  Eric Chevet,et al.  Phosphoprotein analysis: from proteins to proteomes , 2006, Proteome Science.

[47]  J. Piette,et al.  Varicella-Zoster Virus IE63 Protein Phosphorylation by Roscovitine-sensitive Cyclin-dependent Kinases Modulates Its Cellular Localization and Activity* , 2005, Journal of Biological Chemistry.

[48]  Michele Tinti,et al.  VirusMINT: a viral protein interaction database , 2008, Nucleic Acids Res..

[49]  S. Gygi,et al.  An iterative statistical approach to the identification of protein phosphorylation motifs from large-scale data sets , 2005, Nature Biotechnology.

[50]  George M. Church,et al.  Predicting Protein Post-translational Modifications Using Meta-analysis of Proteome Scale Data Sets*S , 2009, Molecular & Cellular Proteomics.

[51]  D. Laune,et al.  Phosphorylation of the HTLV-1 matrix L-domain-containing protein by virus-associated ERK-2 kinase. , 2006, Virology.

[52]  Aftab J. Ahmed,et al.  Phosphorylation of the Rex protein of human T-cell leukemia virus type I. , 1992, The Journal of biological chemistry.

[53]  Victor Neduva,et al.  Peptides mediating interaction networks: new leads at last. , 2006, Current opinion in biotechnology.

[54]  A. Marotta,et al.  Protein kinase CK2 is involved in G2 arrest and apoptosis following spindle damage in epithelial cells , 2002, Oncogene.

[55]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[56]  Hans Bitter,et al.  Identification of a Kinase Profile that Predicts Chromosome Damage Induced by Small Molecule Kinase Inhibitors , 2009, PLoS Comput. Biol..

[57]  Gregory Shakhnarovich,et al.  Discovery of phosphorylation motif mixtures in phosphoproteomics data , 2008, Bioinform..

[58]  N. Blom,et al.  Prediction of post‐translational glycosylation and phosphorylation of proteins from the amino acid sequence , 2004, Proteomics.

[59]  B. Roizman,et al.  Protein kinase B/Akt is present in activated form throughout the entire replicative cycle of deltaU(S)3 mutant virus but only at early times after infection with wild-type herpes simplex virus 1. , 2006, Journal of virology.

[60]  Tzong-Yi Lee,et al.  Exploiting maximal dependence decomposition to identify conserved motifs from a group of aligned signal sequences , 2011, Bioinform..

[61]  F. Kashanchi,et al.  Inhibition of HTLV-1 transcription by cyclin dependent kinase inhibitors , 2002, Molecular and Cellular Biochemistry.