Computational Modeling of Lysine Post-Translational Modification: An Overview

Living organisms have a magnificent ordered and complex structure. In regulating the cellular functions, post-translational modifications (PTMs) are critical molecular measures. They alter protein conformation, modulating their activity, stability and localization. Up to date, more than 300 types of PTMs are experimentally discovered in vivo and in vitro pathways [1,2]. Major and common PTMs are methylation, ubiquitination, succinylation, phosphorylation, glycosylation, acetylation, and sumoylation. PTM is a biological mechanism common to both prokaryotic and eukaryotic organisms, which controls the protein functions and stability or the proteolytic cleavage of regulatory subunits and affects all aspects of cellular life. The PTM of a protein can also determine the cell signaling state, turnover, localization, and interactions with other proteins [3]. Therefore, the analysis of proteins and their PTMs are particularly important for the study of heart disease, cancer, neurodegenerative diseases and diabetes [4,5]. Since the characterization of PTMs gets invaluable insight into the cellular functions in etiological processes, there are still challenges. Specifically, the major challenges in studying PTMs are the development of specific detection and purification methods.

[1]  N. Blom,et al.  Prediction of post‐translational glycosylation and phosphorylation of proteins from the amino acid sequence , 2004, Proteomics.

[2]  D. Umlauf,et al.  Site-specific analysis of histone methylation and acetylation. , 2004, Methods in molecular biology.

[3]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[4]  Dongsup Kim,et al.  SCUD: Saccharomyces Cerevisiae Ubiquitination Database , 2008, BMC Genomics.

[5]  Ashok Sharma,et al.  Type 2 diabetes mellitus: phylogenetic motifs for predicting protein functional sites , 2007, Journal of Biosciences.

[6]  Jinyan Li,et al.  Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs , 2015, PloS one.

[7]  K. Resing,et al.  Mapping protein post-translational modifications with mass spectrometry , 2007, Nature Methods.

[8]  Y. Zhang,et al.  Influence of succinylation on physicochemical property of yak casein micelles. , 2016, Food chemistry.

[9]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[10]  Changjiang Jin,et al.  CSS-Palm 2.0: an updated software for palmitoylation sites prediction. , 2008, Protein engineering, design & selection : PEDS.

[11]  Zexian Liu,et al.  GPS-YNO2: computational prediction of tyrosine nitration sites in proteins. , 2011, Molecular bioSystems.

[12]  Yu Xue,et al.  CPLM: a database of protein lysine modifications , 2013, Nucleic Acids Res..

[13]  Chun-Yuan Chen,et al.  Covalent Small Ubiquitin-like Modifier (SUMO) Modification of Maf1 Protein Controls RNA Polymerase III-dependent Transcription Repression* , 2013, The Journal of Biological Chemistry.

[14]  Hsien-Da Huang,et al.  dbPTM 2016: 10-year anniversary of a resource for post-translational modification of proteins , 2015, Nucleic Acids Res..

[15]  Wei Li,et al.  SysPTM 2.0: an updated systematic resource for post-translational modification , 2014, Database J. Biol. Databases Curation.

[16]  Avram Hershko,et al.  The ubiquitin system for protein degradation and some of its roles in the control of the cell-division cycle (Nobel lecture). , 2005, Angewandte Chemie.

[17]  Katalin F Medzihradszky,et al.  Peptide sequence analysis. , 2005, Methods in enzymology.

[18]  R. Sheppard,et al.  Feline gastrin. An example of peptide sequence analysis by mass spectrometry. , 1969, Journal of the American Chemical Society.

[19]  Alejandro Garcia,et al.  UbiProt: a database of ubiquitylated proteins , 2007, BMC Bioinformatics.

[20]  J. Boeke,et al.  Lysine Succinylation and Lysine Malonylation in Histones* , 2012, Molecular & Cellular Proteomics.

[21]  Jaclyn R. Gareau,et al.  Determinants of Small Ubiquitin-like Modifier 1 (SUMO1) Protein Specificity, E3 Ligase, and SUMO-RanGAP1 Binding Activities of Nucleoporin RanBP2* , 2011, The Journal of Biological Chemistry.

[22]  C. Sander,et al.  Correlated mutations and residue contacts in proteins , 1994, Proteins.

[23]  Vikram Pudi,et al.  RBNBC: Repeat Based Naive Bayes Classifier for Biological Sequences , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[24]  Richard W. Aldrich,et al.  A perturbation-based method for calculating explicit likelihood of evolutionary co-variance in multiple sequence alignments , 2004, Bioinform..

[25]  Ming Lu,et al.  hUbiquitome: a database of experimentally verified ubiquitination cascades in humans , 2011, Database J. Biol. Databases Curation.

[26]  Zhiqiang Ma,et al.  Accurate in silico identification of protein succinylation sites using an iterative semi-supervised learning technique. , 2015, Journal of theoretical biology.

[27]  B. Rost,et al.  Identifying cysteines and histidines in transition‐metal‐binding sites using support vector machines and neural networks , 2006, Proteins.

[28]  Philippe Bogaerts,et al.  Fast and accurate predictions of protein stability changes upon mutations using statistical potentials and neural networks: PoPMuSiC-2.0 , 2009, Bioinform..

[29]  Chun-Wei Tung,et al.  PupDB: a database of pupylated proteins , 2012, BMC Bioinformatics.

[30]  Shinn-Ying Ho,et al.  Computational identification of ubiquitylation sites from protein sequences , 2008, BMC Bioinformatics.

[31]  Christodoulos A. Floudas,et al.  Proteome-wide post-translational modification statistics: frequency analysis and curation of the swiss-prot database , 2011, Scientific reports.

[32]  Moinuddin,et al.  Glycated Lysine Residues: A Marker for Non-Enzymatic Protein Glycation in Age-Related Diseases , 2011, Disease markers.

[33]  Sebastian A. Wagner,et al.  Lysine succinylation is a frequently occurring modification in prokaryotes and eukaryotes and extensively overlaps with acetylation. , 2013, Cell reports.

[34]  A. Burlingame,et al.  Electron transfer dissociation (ETD): The mass spectrometric breakthrough essential for O‐GlcNAc protein site assignments—a study of the O‐GlcNAcylated protein Host Cell Factor C1 , 2013, Proteomics.

[35]  Byung-Jun Yoon,et al.  Hidden Markov Models and their Applications in Biological Sequence Analysis , 2009, Current genomics.

[36]  Anders Krogh,et al.  Hidden Markov models for sequence analysis: extension and analysis of the basic method , 1996, Comput. Appl. Biosci..

[37]  C. Sander,et al.  Correlated Mutations and Residue Contacts , 1994 .

[38]  Ping Wang,et al.  Structural Insight into Coordinated Recognition of Trimethylated Histone H3 Lysine 9 (H3K9me3) by the Plant Homeodomain (PHD) and Tandem Tudor Domain (TTD) of UHRF1 (Ubiquitin-like, Containing PHD and RING Finger Domains, 1) Protein* , 2012, The Journal of Biological Chemistry.

[39]  Kunihiko Fukushima,et al.  Cognitron: A self-organizing multilayered neural network , 1975, Biological Cybernetics.

[40]  Eran Segal,et al.  Proteome-wide prediction of acetylation substrates , 2009, Proceedings of the National Academy of Sciences.

[41]  Wei Liu,et al.  First succinyl-proteome profiling of extensively drug-resistant Mycobacterium tuberculosis revealed involvement of succinylation in cellular physiology. , 2015, Journal of proteome research.

[42]  Md. Nurul Haque Mollah,et al.  SuccinSite: a computational tool for the prediction of protein succinylation sites by exploiting the amino acid patterns and properties. , 2016, Molecular bioSystems.

[43]  Lennart Martens,et al.  Protein structure as a means to triage proposed PTM sites , 2013, Proteomics.

[44]  Hiroyuki Kurata,et al.  Computational identification of protein S-sulfenylation sites by incorporating the multiple sequence features information. , 2017, Molecular bioSystems.

[45]  D. Hand,et al.  Idiot's Bayes—Not So Stupid After All? , 2001 .

[46]  M. Mann,et al.  Decoding signalling networks by mass spectrometry-based proteomics , 2010, Nature Reviews Molecular Cell Biology.

[47]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[48]  Dianjing Guo,et al.  A systematic identification of species-specific protein succinylation sites using joint element features information , 2017, International journal of nanomedicine.

[49]  Yong-Zi Chen,et al.  GANNPhos: a new phosphorylation site predictor based on a genetic algorithm integrated neural network. , 2007, Protein engineering, design & selection : PEDS.

[50]  A. Mirsky,et al.  ACETYLATION AND METHYLATION OF HISTONES AND THEIR POSSIBLE ROLE IN THE REGULATION OF RNA SYNTHESIS. , 1964, Proceedings of the National Academy of Sciences of the United States of America.

[51]  S. Berger,et al.  The emerging field of dynamic lysine methylation of non-histone proteins. , 2008, Current opinion in genetics & development.

[52]  Ronenn Roubenoff,et al.  Convergent Random Forest predictor: methodology for predicting drug response from genome-scale data applied to anti-TNF response. , 2009, Genomics.

[53]  M. Kirschner,et al.  Post‐Translational Modification Profiling—a High‐Content Assay for Identifying Protein Modifications in Mammalian Cellular Systems , 2014, Current protocols in protein science.

[54]  Melanie Y. White,et al.  Functional decorations: post-translational modifications and heart disease delineated by targeted proteomics , 2013, Genome Medicine.

[55]  J. Shabanowitz,et al.  Peptide and protein sequence analysis by electron transfer dissociation mass spectrometry. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[56]  Zhihong Zhang,et al.  Identification of lysine succinylation as a new post-translational modification. , 2011, Nature chemical biology.

[57]  G. Nelsestuen,et al.  Amino-terminal alanine functions in a calcium-specific process essential for membrane binding by prothrombin fragment 1. , 1988, Biochemistry.

[58]  A. Burlingame,et al.  Mass Spectrometry-Based Detection and Assignment of Protein Posttranslational Modifications , 2014, ACS chemical biology.

[59]  Hodong Lee,et al.  E3Net: A System for Exploring E3-mediated Regulatory Networks of Cellular Functions* , 2011, Molecular & Cellular Proteomics.

[60]  Olaf Wolkenhauer,et al.  Naïve Bayes classifier predicts functional microRNA target interactions in colorectal cancer. , 2015, Molecular bioSystems.

[61]  M. Sutter,et al.  Bacterial ubiquitin-like modifier Pup is deamidated and conjugated to substrates by distinct but homologous enzymes , 2009, Nature Structural &Molecular Biology.

[62]  P. Thompson,et al.  Chemical and biological methods to detect post‐translational modifications of arginine , 2014, Biopolymers.

[63]  R. Ranganathan,et al.  Evolutionarily conserved pathways of energetic connectivity in protein families. , 1999, Science.

[64]  Edward L. Huttlin,et al.  Systematic and quantitative assessment of the ubiquitin-modified proteome. , 2011, Molecular cell.

[65]  Takeshi Tomonaga,et al.  Identification of missing proteins in the neXtProt database and unregistered phosphopeptides in the PhosphoSitePlus database as part of the Chromosome-centric Human Proteome Project. , 2013, Journal of proteome research.

[66]  P. Radivojac,et al.  Evaluation of features for catalytic residue prediction in novel folds , 2007 .

[67]  G. Demartino PUPylation: something old, something new, something borrowed, something Glu. , 2009, Trends in biochemical sciences.

[68]  Paul Tempst,et al.  Protein S-nitrosylation: a physiological signal for neuronal nitric oxide , 2001, Nature Cell Biology.

[69]  B. Cravatt,et al.  Functional Lysine Modification by an Intrinsically Reactive Primary Glycolytic Metabolite , 2013, Science.

[70]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[71]  Shao-Ping Shi,et al.  SuccFind: a novel succinylation sites online prediction tool via enhanced characteristic strategy , 2015, Bioinform..

[72]  Tao Zhou,et al.  mUbiSiDa: A Comprehensive Database for Protein Ubiquitination Sites in Mammals , 2014, PloS one.