An optimization approach to predicting protein structural class from amino acid composition

Proteins are generally classified into four structural classes: all‐α proteins, all‐β proteins, α+β proteins, and α/β proteins. In this article, a protein is expressed as a vector of 20‐dimensional space, in which its 20 components are defined by the composition of its 20 amino acids. Based on this, a new method, the so‐called maximum component coefficient method, is proposed for predicting the structural class of a protein according to its amino acid composition. In comparison with the existing methods, the new method yields a higher general accuracy of prediction. Especially for the all‐α proteins, the rate of correct prediction obtained by the new method is much higher than that by any of the existing methods. For instance, for the 19 all‐α proteins investigated previously by P.Y. Chou, the rate of correct prediction by means of his method was 84.2%, but the correct rate when predicted with the new method would be 100%! Furthermore, the new method is characterized by an explicable physical picture. This is reflected by the process in which the vector representing a protein to be predicted is decomposed into four component vectors, each of which corresponds to one of the norms of the four protein structural classes.

[1]  Harold A. Scheraga Conformational Analysis of Polypeptides and Proteins for the Study of Protein Folding, Molecular Recognition, and Molecular Design , 1987 .

[2]  G. Fasman Prediction of Protein Structure and the Principles of Protein Conformation , 2012, Springer US.

[3]  J. Garnier,et al.  Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins. , 1978, Journal of molecular biology.

[4]  Kuo-Chen Chou,et al.  Energetics of interactions of regular structural elements in proteins , 1990 .

[5]  Y. Okamoto,et al.  A prediction of tertiary structures of peptide by the Monte Carlo simulated annealing method. , 1989, Protein engineering.

[6]  P. Y. Chou,et al.  Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins. , 1974, Biochemistry.

[7]  P. Y. Chou,et al.  Prediction of Protein Structural Classes from Amino Acid Compositions , 1989 .

[8]  Kuo-Chen Chou,et al.  Energy of stabilization of the right-handed βαβ crossover in proteins☆ , 1989 .

[9]  K Nishikawa,et al.  The folding type of a protein is relevant to the amino acid composition. , 1986, Journal of biochemistry.

[10]  C. Chothia,et al.  Structural patterns in globular proteins , 1976, Nature.

[11]  K. Chou,et al.  Energy-optimized structure of antifreeze protein and its binding mechanism. , 1992, Journal of molecular biology.

[12]  K. Chou,et al.  Simulated annealing approach to the study of protein structures. , 1991, Protein engineering.

[13]  S. Wilson,et al.  Applications of simulated annealing to peptides , 1990, Biopolymers.

[14]  H. Scheraga,et al.  Effect of amino acid composition on the twist and the relative stability of parallel and antiparallel .beta.-sheets , 1983 .

[15]  Kuo-Chen Chou,et al.  Energetic approach to the folding of α/β barrels , 1991 .

[16]  P. Klein,et al.  Prediction of protein structural class by discriminant analysis. , 1986, Biochimica et biophysica acta.

[17]  P. Y. Chou,et al.  Prediction of protein conformation. , 1974, Biochemistry.

[18]  V. Lim Structural principles of the globular organization of protein chains. A stereochemical theory of globular protein secondary structure. , 1974, Journal of molecular biology.

[19]  Lamberto Cesari,et al.  Optimization-Theory And Applications , 1983 .

[20]  J. Szulmajster Protein folding , 1988, Bioscience reports.

[21]  J. Richardson,et al.  Principles and Patterns of Protein Conformation , 1989 .

[22]  T Gojobori,et al.  Codon usage tabulated from the GenBank Genetic Sequence Data. , 1988, Nucleic acids research.

[23]  G M Maggiora,et al.  A heuristic approach to predicting the tertiary structure of bovine somatotropin. , 1991, Biochemistry.

[24]  H A Scheraga,et al.  Origin of the right-handed twist of beta-sheets of poly(LVal) chains. , 1982, Proceedings of the National Academy of Sciences of the United States of America.

[25]  C. DeLisi,et al.  Prediction of protein structural class from the amino acid sequence , 1986, Biopolymers.