BIOINFORMATICS APPLICATIONS NOTE Sequence analysis NetAcet: prediction of N-terminal acetylation sites

We present here a neural network based method for prediction of N-terminal acetylation-by far the most abundant post-translational modification in eukaryotes. The method was developed on a yeast dataset for N-acetyltransferase A (NatA) acetylation, which is the type of N-acetylation for which most examples are known and for which orthologs have been found in several eukaryotes. We obtain correlation coefficients close to 0.7 on yeast data and a sensitivity up to 74% on mammalian data, suggesting that the method is valid for eukaryotic NatA orthologs.

[1]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[2]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[3]  M. Perrot,et al.  Two‐dimensional gel protein database of Saccharomyces cerevisiae , 1996, Electrophoresis.

[4]  A. Shevchenko,et al.  Two‐dimensional gel protein database of Saccharomyces cerevisiae (update 1999) , 1999, Electrophoresis.

[5]  F. Sherman,et al.  N-terminal acetyltransferases and sequence requirements for N-terminal acetylation of eukaryotic proteins. , 2003, Journal of molecular biology.

[6]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[7]  Finn Wold,et al.  How much sequence information is needed for the regulation of amino-terminal acetylation of eukaryotic proteins? , 1986 .

[8]  F. Sherman,et al.  Nα-terminal Acetylation of Eukaryotic Proteins* , 2000, The Journal of Biological Chemistry.

[9]  N. Blom,et al.  Cleavage site analysis in picornaviral polyproteins: Discovering cellular targets by neural networks , 1996, Protein science : a publication of the Protein Society.

[10]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[11]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[12]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[13]  S. Brunak,et al.  SHORT COMMUNICATION Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites , 1997 .