Artificial Neural Network Model for Predicting Protein Subcellular Location

The function of a protein is closely correlated to its subcellular location. Is it possible to utilize a bioinformatics method to predict the protein subcellular location? To explore this problem, proteins are classified into 12 groups (Protein Eng. 12 (1999) 107-118) according to their subcellular location: (1) chloroplast, (2) cytoplasm, (3) cytoskeleton, (4) endoplasmic reticulum, (5) extracellular, (6) Golgi apparatus, (7) lysosome, (8) mitochondria, (9) nucleus, (10) peroxisome, (11) plasma membrane and (12) vacuole. In this paper, the neural network method was proposed to predict the subcellular location of a protein according to its amino acid composition. Results obtained through self-consistency, cross-validation and independent dataset tests are quite high. Accordingly, the present method can serve as a complement tool for the existing prediction methods in this area.

[1]  K Nishikawa,et al.  Discrimination of intracellular and extracellular proteins using amino acid composition and residue-pair frequencies. , 1994, Journal of molecular biology.

[2]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[3]  Teuvo Kohonen,et al.  An introduction to neural computing , 1988, Neural Networks.

[4]  K. Chou,et al.  Protein subcellular location prediction. , 1999, Protein engineering.

[5]  R. Quatrano Genomics , 1998, Plant Cell.

[6]  P. Aloy,et al.  Relation between amino acid composition and cellular location of proteins. , 1997, Journal of molecular biology.

[7]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[8]  K. Chou,et al.  Prediction of protein structural classes. , 1995, Critical reviews in biochemistry and molecular biology.

[9]  M. Kanehisa,et al.  A knowledge base for predicting protein localization sites in eukaryotic cells , 1992, Genomics.