Deep learning based prediction of species-specific protein S-glutathionylation sites.

As a widespread and reversible post-translational modification of proteins, S-glutathionylation specifically generates the mixed disulfides between cysteine residues and glutathione, which regulates various biological processes including oxidative stress, nitrosative stress and signal transduction. The identification of proteins and specific sites that undergo S-glutathionylation is crucial for understanding the underlying mechanisms and regulatory effects of S-glutathionylation. Experimental identification of S-glutathionylation sites is laborious and time-consuming, whereas computational predictions are more attractive due to their high speed and convenience. Here, we developed a novel computational framework DeepGSH (http://deepgsh.cancerbio.info/) for species-specific S-glutathionylation sites prediction, based on deep learning and particle swarm optimization algorithms. 5-fold cross validation indicated that DeepGSH was able to achieve an AUC of 0.8393 and 0.8458 for Homo sapiens and Mus musculus. According to critical evaluation and comparison, DeepGSH showed excellent robustness and better performance than existing tools in both species, demonstrating DeepGSH was suitable for S-glutathionylation prediction. The prediction results of DeepGSH might provide guidance for experimental validation of S-glutathionylation sites and helpful information to understand the intrinsic mechanisms.

[1]  Tomas Bergman,et al.  Determination of site-specificity of S-glutathionylated cellular proteins. , 2005, Biochemical and biophysical research communications.

[2]  Yu Li,et al.  Deep learning in bioinformatics: Introduction, application, and perspective in the big data era. , 2019, Methods.

[3]  Zsuzsanna Dosztányi,et al.  IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content , 2005, Bioinform..

[4]  Daniel Rueckert,et al.  Deep learning cardiac motion analysis for human survival prediction , 2018, Nature Machine Intelligence.

[5]  P. Ghezzi Protein glutathionylation in health and disease. , 2013, Biochimica et biophysica acta.

[6]  Xiaobo Zhou,et al.  Deep learning of the splicing (epi)genetic code reveals a novel candidate mechanism linking histone modifications to ESC fate decision , 2017, bioRxiv.

[7]  Q. Ning,et al.  Identification of S-glutathionylation sites in species-specific proteins by incorporating five sequence-derived features into the general pseudo-amino acid composition. , 2016, Journal of theoretical biology.

[8]  J. Mieyal,et al.  Posttranslational modification of cysteine in redox signaling and oxidative stress: Focus on s-glutathionylation. , 2012, Antioxidants & redox signaling.

[9]  T. Petersen,et al.  A generic method for assignment of reliability scores applied to solvent accessibility predictions , 2009, BMC Structural Biology.

[10]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[11]  Chuan-Yun Li,et al.  KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases , 2011, Nucleic Acids Res..

[12]  A. Pastore,et al.  S-Glutathionylation signaling in cell biology: progress and prospects. , 2012, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[13]  D. Giustarini,et al.  Protein S-glutathionylation: a regulatory device from bacteria to humans. , 2009, Trends in biochemical sciences.

[14]  Yu Xue,et al.  DeepPhagy: a deep learning framework for quantitatively measuring autophagy activity in Saccharomyces cerevisiae , 2020, Autophagy.

[15]  Tzong-Yi Lee,et al.  Incorporating Distant Sequence Features and Radial Basis Function Networks to Identify Ubiquitin Conjugation Sites , 2011, PloS one.

[16]  Yu-Ju Chen,et al.  dbGSH: a database of S-glutathionylation , 2014, Bioinform..

[17]  Roberto Colombo,et al.  Molecular mechanisms and potential clinical significance of S-glutathionylation. , 2008, Antioxidants & redox signaling.

[18]  Minghao Yin,et al.  PGluS: prediction of protein S-glutathionylation sites with multiple features and analysis. , 2015, Molecular bioSystems.

[19]  Xing-Ming Zhao,et al.  Prediction of S-Glutathionylation Sites Based on Protein Sequences , 2013, PloS one.

[20]  Geoffrey I. Webb,et al.  Cascleave: towards more accurate prediction of caspase substrate cleavage sites , 2010, Bioinform..

[21]  Vladimir Vacic,et al.  Two Sample Logo: a graphical representation of the differences between two sets of sequence alignments , 2006, Bioinform..

[22]  May D. Wang,et al.  DeeperBind: Enhancing Prediction of Sequence Specificities of DNA Binding Proteins , 2016, bioRxiv.

[23]  V. Vacic,et al.  Identification, analysis, and prediction of protein ubiquitination sites , 2010, Proteins.

[24]  Xiaowei Zhao,et al.  Prediction of Protein Phosphorylation Sites by Using the Composition of k-Spaced Amino Acid Pairs , 2012, PloS one.

[25]  Tzong-Yi Lee,et al.  GSHSite: Exploiting an Iteratively Statistical Method to Identify S-Glutathionylation Sites with Substrate Specificity , 2015, PloS one.

[26]  A. Holmgren,et al.  Identification of S-glutathionylated cellular proteins during oxidative stress and constitutive metabolism by affinity purification and proteomic analysis. , 2002, Archives of biochemistry and biophysics.

[27]  Hsien-Da Huang,et al.  dbPTM in 2019: exploring disease association and cross-talk of post-translational modifications , 2018, Nucleic Acids Res..

[28]  M. Gallogly,et al.  Mechanisms of reversible protein glutathionylation in redox signaling and oxidative stress. , 2007, Current opinion in pharmacology.