A method to distinguish between lysine acetylation and lysine methylation from protein sequences.

Lysine acetylation and methylation are two major post-translational modifications of lysine residues. They play vital roles in both biological and pathological processes. Specific lysine residues in H3 histone protein tails appear to be targeted for either acetylation or methylation. Hence it is very challenging to distinguish between acetylated and methylated lysine residues using computational methods. This work presents a method that incorporates protein sequence information, secondary structure and amino acid properties to differentiate acetyl-lysine from methyl-lysine. We apply an encoding scheme based on grouped weight and position weight amino acid composition to extract sequence information and physicochemical properties around lysine sites. The proposed method achieves an accuracy of 93.3% using a jackknife test. Feature analysis demonstrates that the prediction model with multiple features can take full advantage of the supplementary information from different features to improve classification performance and prediction robustness. Analysis of the characteristics of lysine residues which can be either methylated or acetylated shows that they are more similar to methyl-lysine than to acetyl-lysine.

[1]  Brian C. Smith,et al.  Chemical mechanisms of histone lysine and arginine modifications. , 2009, Biochimica et biophysica acta.

[2]  Rachael P. Huntley,et al.  QuickGO: a web-based tool for Gene Ontology searching , 2009, Bioinform..

[3]  Hsien-Da Huang,et al.  N‐Ace: Using solvent accessibility and physicochemical properties to identify protein N‐acetylation sites , 2010, J. Comput. Chem..

[4]  Brian K. Kennedy,et al.  Sirtuins in Aging and Age-Related Disease , 2006, Cell.

[5]  Dong Xu,et al.  Computational Identification of Protein Methylation Sites through Bi-Profile Bayes Feature Extraction , 2009, PloS one.

[6]  Jaques Reifman,et al.  Support vector machines with selective kernel scaling for protein classification and identification of key amino acid positions , 2002, Bioinform..

[7]  Jorng-Tzong Horng,et al.  Incorporating structural characteristics for identification of protein methylation sites , 2009, J. Comput. Chem..

[8]  Takehiro Suzuki,et al.  Mitochondrial localization of fission yeast manganese superoxide dismutase is required for its lysine acetylation and for cellular stress resistance and respiratory growth. , 2011, Biochemical and biophysical research communications.

[9]  Eran Segal,et al.  Proteome-wide prediction of acetylation substrates , 2009, Proceedings of the National Academy of Sciences.

[10]  Ronald Breslow,et al.  Dimethyl sulfoxide to vorinostat: development of this histone deacetylase inhibitor as an anticancer drug , 2007, Nature Biotechnology.

[11]  J. Nieto,et al.  Use of fuzzy clustering technique and matrices to classify amino acids and its impact to Chou's pseudo amino acid composition. , 2009, Journal of theoretical biology.

[12]  Zhen-Hui Zhang,et al.  A novel method for apoptosis protein subcellular localization prediction combining encoding based on grouped weight and support vector machine , 2006, FEBS letters.

[13]  Qing Zhang,et al.  The RCSB Protein Data Bank: a redesigned query system and relational database based on the mmCIF schema , 2004, Nucleic Acids Res..

[14]  Steven P. Gygi,et al.  Stress-Dependent Regulation of FOXO Transcription Factors by the SIRT1 Deacetylase , 2004, Science.

[15]  Li Zhang,et al.  Identify submitochondria and subchloroplast locations with pseudo amino acid composition: approach from the strategy of discrete wavelet transform feature extraction. , 2011, Biochimica et biophysica acta.

[16]  Jianding Qiu,et al.  Prediction of G-protein-coupled receptor classes based on the concept of Chou's pseudo amino acid composition: an approach from discrete wavelet transform. , 2009, Analytical biochemistry.

[17]  Minoru Yoshida,et al.  Proteomic analysis of organ‐specific post‐translational lysine‐acetylation and ‐methylation in mice by use of anti‐acetyllysine and ‐methyllysine mouse monoclonal antibodies , 2005, Proteomics.

[18]  Yu Shyr,et al.  Improved prediction of lysine acetylation by support vector machines. , 2009, Protein and peptide letters.

[19]  M. Mann,et al.  Lysine Acetylation Targets Protein Complexes and Co-Regulates Major Cellular Functions , 2009, Science.

[20]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[21]  Brian D Strahl,et al.  Role of protein methylation in regulation of transcription. , 2005, Endocrine reviews.

[22]  R. Marmorstein,et al.  Structure and function of histone acetyltransferases , 2001, Cellular and Molecular Life Sciences CMLS.

[23]  J. Denu,et al.  The Sirtuin family: therapeutic targets to treat diseases of aging. , 2008, Current opinion in chemical biology.

[24]  E. Kalkhoven,et al.  Aberrant forms of histone acetyltransferases in human disease. , 2007, Sub-cellular biochemistry.

[25]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[26]  Xiang-Jiao Yang,et al.  Lysine acetylation and the bromodomain: a new partnership for signaling , 2004, BioEssays : news and reviews in molecular, cellular and developmental biology.

[27]  Predrag Radivojac,et al.  Intrinsic Disorder and Prote in Modifications: Building an SVM Predictor for Methylation , 2005, 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[28]  Kuo-Chen Chou,et al.  Support vector machines for prediction of protein signal sequences and their cleavage sites , 2003, Peptides.

[29]  N. Grishin,et al.  Substrate and functional diversity of lysine acetylation revealed by a proteomics survey. , 2006, Molecular cell.

[30]  Mark Gerstein,et al.  Systematic evaluation of variability in ChIP-chip experiments using predefined DNA targets. , 2008, Genome research.

[31]  Changjiang Jin,et al.  Prediction of N e -acetylation on internal lysines implemented in Bayesian Discriminant Method , 2006 .

[32]  M. Ismond,et al.  Effects of acetylation and succinylation on the physicochemical properties of the canola 12S globulin. Part I , 1997 .

[33]  Yixue Li,et al.  Regulation of Cellular Metabolism by Protein Lysine Acetylation , 2010, Science.

[34]  Yu Xue,et al.  MeMo: a web tool for prediction of protein methylation modifications , 2006, Nucleic Acids Res..

[35]  Minoru Yoshida,et al.  HDAC6 a new cellular stress surveillance factor , 2008, Cell cycle.

[36]  K. Chou,et al.  Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms , 2008, Nature Protocols.

[37]  Shao-Ping Shi,et al.  PLMLA: prediction of lysine methylation and lysine acetylation by combining multiple features. , 2012, Molecular bioSystems.

[38]  C. Allis,et al.  Histone methylation versus histone acetylation: new insights into epigenetic regulation. , 2001, Current opinion in cell biology.

[39]  Florian Gnad,et al.  Predicting post-translational lysine acetylation using support vector machines , 2010, Bioinform..

[40]  Loris Nanni,et al.  An ensemble of reduced alphabets with protein encoding based on grouped weight for predicting DNA-binding proteins , 2008, Amino Acids.

[41]  J. Ausió,et al.  Acetylation Increases the α-Helical Content of the Histone Tails of the Nucleosome* , 2000, The Journal of Biological Chemistry.

[42]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[43]  M. Wilkins,et al.  Surface accessibility of protein post-translational modifications. , 2007, Journal of proteome research.

[44]  Jun Ding,et al.  Lysine acetylation sites prediction using an ensemble of support vector machine classifiers. , 2010, Journal of theoretical biology.

[45]  George M Whitesides,et al.  Lysine acetylation can generate highly charged enzymes with increased resistance toward irreversible inactivation , 2008, Protein science : a publication of the Protein Society.

[46]  K. Chou,et al.  Prediction of protein structural classes. , 1995, Critical reviews in biochemistry and molecular biology.

[47]  Nathan T. Ross,et al.  Synthetic mimetics of protein secondary structure domains , 2010, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[48]  Q Gu,et al.  Prediction of G-protein-coupled receptor classes in low homology using Chou's pseudo amino acid composition with approximate entropy and hydrophobicity patterns. , 2010, Protein and peptide letters.

[49]  E. Seto,et al.  Lysine acetylation: codified crosstalk with other posttranslational modifications. , 2008, Molecular cell.

[50]  Daniel W. A. Buchan,et al.  Protein annotation and modelling servers at University College London , 2010, Nucleic Acids Res..

[51]  E. Seto,et al.  The Rpd3/Hda1 family of lysine deacetylases: from bacteria and yeast to mice and men , 2008, Nature Reviews Molecular Cell Biology.

[52]  K. Chou Some remarks on protein attribute prediction and pseudo amino acid composition , 2010, Journal of Theoretical Biology.

[53]  L. Corbo,et al.  Protein arginine methylation in estrogen signaling and estrogen-related cancers , 2010, Trends in Endocrinology & Metabolism.

[54]  Cyrus Martin,et al.  The diverse functions of histone lysine methylation , 2005, Nature Reviews Molecular Cell Biology.

[55]  Li Xue,et al.  A Novel Approach Predicting the Signal Peptides and Their Cleavage Sites , 2007, 2007 1st International Conference on Bioinformatics and Biomedical Engineering.

[56]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[57]  F. Sherman,et al.  Methylation of proteins involved in translation , 2007, Molecular microbiology.

[58]  Zhongyi Cheng,et al.  Bioinformatic Analysis and Post-Translational Modification Crosstalk Prediction of Lysine Acetylation , 2011, PloS one.

[59]  E L Gershey,et al.  Chemical studies of histone acetylation. The distribution of epsilon-N-acetyllysine in calf thymus histones. , 1968, The Journal of biological chemistry.

[60]  P. Reddy,et al.  Neuronal protection by sirtuins in Alzheimer's disease , 2006, Journal of neurochemistry.

[61]  K. Chou Prediction of protein cellular attributes using pseudo‐amino acid composition , 2001, Proteins.

[62]  Tony Kouzarides,et al.  Reversing histone methylation , 2005, Nature.

[63]  Hao Lin The modified Mahalanobis Discriminant for predicting outer membrane proteins by using Chou's pseudo amino acid composition. , 2008, Journal of theoretical biology.

[64]  K. Chou Pseudo Amino Acid Composition and its Applications in Bioinformatics, Proteomics and System Biology , 2009 .

[65]  T. Kouzarides Chromatin Modifications and Their Function , 2007, Cell.

[66]  C. Allis,et al.  Specificity of the Chromodomain Y Chromosome Family of Chromodomains for Lysine-methylated ARK(S/T) Motifs* , 2008, Journal of Biological Chemistry.

[67]  Andrew J. Bannister,et al.  Unsafe SETs: histone lysine methyltransferases and cancer. , 2002, Trends in biochemical sciences.