Proteomic analysis and prediction of human phosphorylation sites in subcellular level reveal subcellular specificity

MOTIVATION Protein phosphorylation is the most common post-translational modification (PTM) regulating major cellular processes through highly dynamic and complex signaling pathways. Large-scale comparative phosphoproteomic studies have frequently been done on whole cells or organs by conventional bottom-up mass spectrometry approaches, i.e at the phosphopeptide level. Using this approach, there is no way to know from where the phosphopeptide signal originated. Also, as a consequence of the scale of these studies, important information on the localization of phosphorylation sites in subcellular compartments (SCs) is not surveyed. RESULTS Here, we present a first account of the emerging field of subcellular phosphoproteomics where a support vector machine (SVM) approach was combined with a novel algorithm of discrete wavelet transform (DWT) to facilitate the identification of compartment-specific phosphorylation sites and to unravel the intricate regulation of protein phosphorylation. Our data reveal that the subcellular phosphorylation distribution is compartment type dependent and that the phosphorylation displays site-specific sequence motifs that diverge between SCs. AVAILABILITY AND IMPLEMENTATION The method and database both are available as a web server at: http://bioinfo.ncu.edu.cn/SubPhos.aspx. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

[1]  Mathias Dreger,et al.  Subcellular proteomics , 2021, Nature Reviews Methods Primers.

[2]  Michelle S. Scott,et al.  Global Survey of Organ and Organelle Protein Expression in Mouse: Combined Proteomic and Transcriptomic Profiling , 2006, Cell.

[3]  Xiaowei Zhao,et al.  Prediction of Protein Phosphorylation Sites by Using the Composition of k-Spaced Amino Acid Pairs , 2012, PloS one.

[4]  P. Rakic,et al.  Requirement of the JIP1 scaffold protein for stress-induced JNK activation. , 2001, Genes & development.

[5]  S. Gygi,et al.  An iterative statistical approach to the identification of protein phosphorylation motifs from large-scale data sets , 2005, Nature Biotechnology.

[6]  M. Mann,et al.  Global, In Vivo, and Site-Specific Phosphorylation Dynamics in Signaling Networks , 2006, Cell.

[7]  Fidel Ramírez,et al.  Computing topological parameters of biological networks , 2008, Bioinform..

[8]  Etienne Gagnon,et al.  Organelle proteomics: looking at less to see more. , 2004, Drug discovery today.

[9]  I. Chung,et al.  Discovery of Protein Phosphorylation Motifs through Exploratory Data Analysis , 2011, PloS one.

[10]  Pierre Thibault,et al.  Subcellular phosphoproteomics. , 2010, Mass spectrometry reviews.

[11]  P. Shannon,et al.  Cytoscape: a software environment for integrated models of biomolecular interaction networks. , 2003, Genome research.

[12]  J. Yates,et al.  Proteomics of organelles and large cellular structures , 2005, Nature Reviews Molecular Cell Biology.

[13]  Marc D. H. Hansen,et al.  Spatio-temporal regulation of Rac1 localization and lamellipodia dynamics during epithelial cell-cell adhesion. , 2002, Developmental cell.

[14]  S. Gammeltoft,et al.  Phosphoproteomics toolbox: Computational biology, protein chemistry and mass spectrometry , 2006, FEBS letters.

[15]  Dong Xu,et al.  Musite, a Tool for Global Prediction of General and Kinase-specific Phosphorylation Sites* , 2010, Molecular & Cellular Proteomics.

[16]  Anthony J. Kusalik,et al.  Computational prediction of eukaryotic phosphorylation sites , 2011, Bioinform..

[17]  T. Hunter,et al.  The Protein Kinase Complement of the Human Genome , 2002, Science.

[18]  K. Yoshioka,et al.  A Novel Mechanism of JNK1 Activation , 1997, The Journal of Biological Chemistry.

[19]  B. Snel,et al.  In-depth Qualitative and Quantitative Profiling of Tyrosine Phosphorylation Using a Combination of Phosphopeptide Immunoaffinity Purification and Stable Isotope Dimethyl Labeling* , 2009, Molecular & Cellular Proteomics.

[20]  R. Murphy,et al.  Automated subcellular location determination and high-throughput microscopy. , 2007, Developmental cell.

[21]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[22]  P. H. Cameron,et al.  Organelle-specific phosphorylation. Identification of unique membrane phosphoproteins of the endoplasmic reticulum and endosomal apparatus. , 1993, The Journal of biological chemistry.

[23]  M. Gerstein,et al.  A Bayesian Networks Approach for Predicting Protein-Protein Interactions from Genomic Data , 2003, Science.

[24]  M. Graham,et al.  Differential phosphorylation of dynamin I isoforms in subcellular compartments demonstrates the hidden complexity of phosphoproteomes. , 2010, Journal of proteome research.

[25]  Daniel Figeys,et al.  Analysis of the subcellular phosphoproteome using a novel phosphoproteomic reactor. , 2010, Journal of proteome research.

[26]  Vladimir Vapnik,et al.  An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[27]  E. Krebs,et al.  Consensus sequences as substrate specificity determinants for protein kinases and protein phosphatases. , 1991, The Journal of biological chemistry.

[28]  T. Barrette,et al.  Probabilistic model of the human protein-protein interaction network , 2005, Nature Biotechnology.

[29]  Nathalie Japkowicz,et al.  The class imbalance problem: A systematic study , 2002, Intell. Data Anal..

[30]  Zhang Qiang,et al.  Maximum Spectrum of Continuous Wavelet Transform and Its Application in Resolving an Overlapped Signal , 2004, J. Chem. Inf. Model..

[31]  Li Zhang,et al.  Identify submitochondria and subchloroplast locations with pseudo amino acid composition: approach from the strategy of discrete wavelet transform feature extraction. , 2011, Biochimica et biophysica acta.

[32]  Damian Szklarczyk,et al.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration , 2012, Nucleic Acids Res..

[33]  Jian-Ding Qiu,et al.  Using support vector machines for prediction of protein structural classes based on discrete wavelet transform , 2009, J. Comput. Chem..

[34]  Y. Ueno,et al.  Prediction of spalling on a ball bearing by applying the discrete wavelet transform to vibration signals , 1996 .