SNOSite: Exploiting Maximal Dependence Decomposition to Identify Cysteine S-Nitrosylation with Substrate Site Specificity

S-nitrosylation, the covalent attachment of a nitric oxide to (NO) the sulfur atom of cysteine, is a selective and reversible protein post-translational modification (PTM) that regulates protein activity, localization, and stability. Despite its implication in the regulation of protein functions and cell signaling, the substrate specificity of cysteine S-nitrosylation remains unknown. Based on a total of 586 experimentally identified S-nitrosylation sites from SNAP/L-cysteine-stimulated mouse endothelial cells, this work presents an informatics investigation on S-nitrosylation sites including structural factors such as the flanking amino acids composition, the accessible surface area (ASA) and physicochemical properties, i.e. positive charge and side chain interaction parameter. Due to the difficulty to obtain the conserved motifs by conventional motif analysis, maximal dependence decomposition (MDD) has been applied to obtain statistically significant conserved motifs. Support vector machine (SVM) is applied to generate predictive model for each MDD-clustered motif. According to five-fold cross-validation, the MDD-clustered SVMs could achieve an accuracy of 0.902, and provides a promising performance in an independent test set. The effectiveness of the model was demonstrated on the correct identification of previously reported S-nitrosylation sites of Bos taurus dimethylarginine dimethylaminohydrolase 1 (DDAH1) and human hemoglobin subunit beta (HBB). Finally, the MDD-clustered model was adopted to construct an effective web-based tool, named SNOSite (http://csb.cse.yzu.edu.tw/SNOSite/), for identifying S-nitrosylation sites on the uncharacterized protein sequences.

[1]  M. Oobatake,et al.  An analysis of non-bonded energy of proteins. , 1977, Journal of theoretical biology.

[2]  Chih-Jen Lin,et al.  Combining SVMs with Various Feature Selection Strategies , 2006, Feature Extraction.

[3]  T. Iwasaka,et al.  [Establishment and characterization of a cell line derived from mucinous cystadenocarcinoma of human ovary]. , 1985, Nihon Sanka Fujinka Gakkai zasshi.

[4]  C. Chothia The nature of the accessible and buried surfaces in proteins. , 1976, Journal of molecular biology.

[5]  J. Stamler,et al.  The SNO-proteome: causation and classifications. , 2011, Current opinion in chemical biology.

[6]  M. Wilkins,et al.  Surface accessibility of protein post-translational modifications. , 2007, Journal of proteome research.

[7]  S. Rackovsky,et al.  Empirical Studies of Hydrophobicity. 1. Effect of Protein Size on the Hydrophobic Behavior of Amino Acids , 1980 .

[8]  G. Fasman,et al.  Protein conformational prediction. , 1989, Trends in biochemical sciences.

[9]  Shuk-Mei Ho,et al.  Comprehensive Identification and Modified-Site Mapping of S-Nitrosylated Targets in Prostate Epithelial Cells , 2010, PloS one.

[10]  Leonardo Nogueira,et al.  Proteomic analysis of S-nitrosylation and denitrosylation by resin-assisted capture , 2009, Nature Biotechnology.

[11]  E. Sheta,et al.  Proteomic analysis reveals novel protein targets of S-nitrosylation in mesangial cells. , 2004, Contributions to nephrology.

[12]  A. Finkelstein,et al.  Physical reasons for secondary structure stability: alpha-helices in short peptides. , 1991, Proteins.

[13]  O. Ptitsyn,et al.  Physical reasons for secondary structure stability: α‐Helices in short peptides , 1991 .

[14]  W. R. Krigbaum,et al.  Local interactions as a structure determinant for globular proteins. , 1971, Biochimica et biophysica acta.

[15]  S. Gross,et al.  Balancing reactivity against selectivity: the evolution of protein S-nitrosylation as an effector of cell signaling by nitric oxide. , 2007, Cardiovascular research.

[16]  Wei-Chi Ku,et al.  S-alkylating labeling strategy for site-specific identification of the s-nitrosoproteome. , 2010, Journal of proteome research.

[17]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[18]  S. Karlin,et al.  Prediction of complete gene structures in human genomic DNA. , 1997, Journal of molecular biology.

[19]  R. Sack,et al.  Zn(II)-free Dimethylargininase-1 (DDAH-1) Is Inhibited upon Specific Cys-S-Nitrosylation* , 2003, The Journal of Biological Chemistry.

[20]  M. Oobatake,et al.  Molecular orientation of plastocyanin on spinach thylakoid membranes as determined by acetylation of lysine residues. , 1985, Journal of biochemistry.

[21]  Paul Tempst,et al.  Protein S-nitrosylation: a physiological signal for neuronal nitric oxide , 2001, Nature Cell Biology.

[22]  Thomas L. Madden,et al.  BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. , 1999, FEMS microbiology letters.

[23]  K. Chou,et al.  Recent progress in protein subcellular location prediction. , 2007, Analytical biochemistry.

[24]  A. L. Weber,et al.  Genetic code correlations: Amino acids and their anticodon nucleotides , 1978, Journal of Molecular Evolution.

[25]  P. Karplus,et al.  Prediction of chain flexibility in proteins , 1985, Naturwissenschaften.

[26]  Eric J. Toone,et al.  (S)NO Signals: Translocation, Regulation, and a Consensus Motif , 1997, Neuron.

[27]  Shandar Ahmad,et al.  RVP-net: online prediction of real valued accessible surface area of proteins from single sequences , 2003, Bioinform..

[28]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[29]  H. E. Marshall,et al.  Protein S-nitrosylation: purview and parameters , 2005, Nature Reviews Molecular Cell Biology.

[30]  S. Gygi,et al.  An iterative statistical approach to the identification of protein phosphorylation motifs from large-scale data sets , 2005, Nature Biotechnology.

[31]  E. Sheta,et al.  Proteomic Analysis of S-Nitrosylated Proteins in Mesangial Cells * , 2003, Molecular & Cellular Proteomics.

[32]  H. Guy Amino acid side-chain partition energies and distribution of residues in soluble proteins. , 1985, Biophysical journal.

[33]  A. Godzik,et al.  S-Nitrosylation of Drp1 Mediates β-Amyloid-Related Mitochondrial Fission and Neuronal Injury , 2009, Science.

[34]  Vladimir Vacic,et al.  Two Sample Logo: a graphical representation of the differences between two sets of sequence alignments , 2006, Bioinform..

[35]  Tzong-Yi Lee,et al.  Exploiting maximal dependence decomposition to identify conserved motifs from a group of aligned signal sequences , 2011, Bioinform..

[36]  M A Roseman,et al.  Hydrophilicity of polar amino acid side-chains is markedly reduced by flanking peptide bonds. , 1988, Journal of molecular biology.

[37]  A. Maritan,et al.  A knowledge‐based scale for amino acid membrane propensity , 2002, Proteins.

[38]  M. Gromiha,et al.  Real value prediction of solvent accessibility from amino acid sequence , 2003, Proteins.

[39]  J. Stamler,et al.  S-nitrosohaemoglobin: a dynamic activity of blood involved in vascular control , 1996, Nature.

[40]  S. Gross,et al.  S-Nitrosylation Is Emerging as a Specific and Fundamental Posttranslational Protein Modification: Head-to-Head Comparison with O-Phosphorylation , 2001, Science's STKE.

[41]  Minoru Kanehisa,et al.  AAindex: amino acid index database, progress report 2008 , 2007, Nucleic Acids Res..

[42]  S. Rackovsky,et al.  Differential geometry and polymer conformation. 4. Conformational and nucleation properties of individual amino acids , 1982 .

[43]  Allan Doctor,et al.  S-nitrosothiol signaling in respiratory biology. , 2006, American journal of respiratory and critical care medicine.

[44]  Ebru Karpuzoglu,et al.  Estrogen regulation of nitric oxide and inducible nitric oxide synthase (iNOS) in immune cells: implications for immunity, autoimmune diseases, and apoptosis. , 2006, Nitric oxide : biology and chemistry.

[45]  Fabien Campagne,et al.  SNOSID, a proteomic method for identification of cysteine S-nitrosylation sites in complex protein mixtures. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[46]  Takashi Uehara,et al.  Nitrosative stress linked to sporadic Parkinson's disease: S-nitrosylation of parkin regulates its E3 ubiquitin ligase activity. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[47]  A. Komoriya,et al.  Local interactions as a structure determinant for protein molecules: III. , 1979, Biochimica et biophysica acta.

[48]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[49]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[50]  M. Levitt,et al.  Conformation of amino acid side-chains in proteins. , 1978, Journal of molecular biology.

[51]  P. Tripathi Nitric oxide and immune response. , 2007, Indian journal of biochemistry & biophysics.

[52]  M. Levitt A simplified representation of protein conformations for rapid simulation of protein folding. , 1976, Journal of molecular biology.

[53]  Jorng-Tzong Horng,et al.  Incorporating support vector machine for identifying protein tyrosine sulfation sites , 2009, J. Comput. Chem..

[54]  Jörg Durner,et al.  Proteomic Identification of S-Nitrosylated Proteins in Arabidopsis1[w] , 2005, Plant Physiology.

[55]  Daniel C Liebler,et al.  Identification of S-nitrosylation motifs by site-specific mapping of the S-nitrosocysteine proteome in human vascular smooth muscle cells. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[56]  L. Kier,et al.  Amino acid side chain parameters for correlation studies in biology and pharmacology. , 2009, International journal of peptide and protein research.

[57]  A. Komoriya,et al.  Local interactions as a structure determinant for protein molecules: II. , 1979, Biochimica et biophysica acta.

[58]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[59]  V. Gladyshev,et al.  Structural analysis of cysteine S-nitrosylation: a modified acid-based motif and the emerging role of trans-nitrosylation. , 2010, Journal of molecular biology.

[60]  Stuart A. Lipton,et al.  Cell death: protein misfolding and neurodegenerative diseases , 2009, Apoptosis.

[61]  Jorng-Tzong Horng,et al.  Incorporating structural characteristics for identification of protein methylation sites , 2009, J. Comput. Chem..

[62]  P. Roepstorff,et al.  Proteomic analysis of S‐nitrosylated proteins in Arabidopsis thaliana undergoing hypersensitive response , 2008, Proteomics.

[63]  Santiago Lamas,et al.  Nitrosylation The Prototypic Redox-Based Signaling Mechanism , 2001, Cell.

[64]  Shinn-Ying Ho,et al.  Computational identification of ubiquitylation sites from protein sequences , 2008, BMC Bioinformatics.

[65]  Zexian Liu,et al.  GPS-SNO: Computational Prediction of Protein S-Nitrosylation Sites with a Modified GPS Algorithm , 2010, PloS one.