Prediction of palmitoylation sites using the composition of k-spaced amino acid pairs.

Palmitoylation is an important hydrophobic protein modification activity that participates many cellular processes, including signaling, neuronal transmission, membrane trafficking and so on. So it is an important problem to identify palmitoylated proteins and the corresponding sites. Comparing with the expensive and time-consuming biochemical experiments, the computational methods have attracted much attention due to their good performances in predicting palmitoylation sites. In this paper, we develop a novel automated computational method to perform this work. For a sequence segment in a given protein, the encoding scheme based on the composition of k-spaced amino acid pairs (CKSAAP) is introduced, and then the support vector machine is used as the predictor. The proposed prediction model CKSAAP-Palm outperforms the existing method CSS-Palm2.0 on both cross-validation experiments and some independent testing data sets. These results imply that our CKSAAP-Palm is able to predict more potential palmitoylation sites and increases research productivity in palmitoylation sites discovery. The corresponding software can be freely downloaded from http://www.aporc.org/doc/wiki/CKSAAP-Palm.

[1]  Changjiang Jin,et al.  CSS-Palm 2.0: an updated software for palmitoylation sites prediction. , 2008, Protein engineering, design & selection : PEDS.

[2]  Ziding Zhang,et al.  Prediction of mucin-type O-glycosylation sites in mammalian proteins using the composition of k-spaced amino acid pairs , 2008, BMC Bioinformatics.

[3]  Kathleen Marchal,et al.  Evaluation of time profile reconstruction from complex two-color microarray designs , 2008, BMC Bioinformatics.

[4]  R. Deschenes,et al.  Palmitoylation: policing protein stability and traffic , 2007, Nature Reviews Molecular Cell Biology.

[5]  Yu Xue,et al.  NBA-Palm: prediction of palmitoylation site implemented in Naïve Bayes algorithm , 2006, BMC Bioinformatics.

[6]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[7]  J. Yates,et al.  Global Analysis of Protein Palmitoylation in Yeast , 2006, Cell.

[8]  A. Álvarez-Barrientos,et al.  N-terminal palmitoylation within the appropriate amino acid environment conveys on NOS2 the ability to progress along the intracellular sorting pathways , 2006, Journal of Cell Science.

[9]  Yu Xue,et al.  CSS-Palm: palmitoylation site prediction with a clustering and scoring strategy (CSS) , 2006, Bioinform..

[10]  J. Smotrys,et al.  Palmitoylation of intracellular signaling proteins: regulation and function. , 2004, Annual review of biochemistry.

[11]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[12]  M. Veit,et al.  Membrane targeting via protein palmitoylation. , 1998, Methods in molecular biology.

[13]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[14]  Lloyd Allison,et al.  Reconstruction of strings past , 1993, Comput. Appl. Biosci..