The limits of protein secondary structure prediction accuracy from multiple sequence alignment.

The expected best residue-by-residue accuracies for secondary structure prediction from multiple protein sequence alignment have been determined by an analysis of known protein structural families. The results show substantial variation is possible among homologous protein structures, and that 100% agreement is unlikely between a consensus prediction and one member of a protein structural family. The study provides the range of agreement to be expected between a perfect secondary structure prediction from a multiple alignment and each protein within the alignment. The results of this study overcome the difficulties inherent in the use of residue-by-residue accuracy for assessing the quality of consensus secondary structure predictions. The accuracies of recent consensus predictions for the annexins, SH2 domains and SH3 domains fall within the expected range for a perfect prediction.

[1]  V. Lim Structural principles of the globular organization of protein chains. A stereochemical theory of globular protein secondary structure. , 1974, Journal of molecular biology.

[2]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[3]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[4]  P. Y. Chou,et al.  Prediction of the secondary structure of proteins from their amino acid sequence. , 2006 .

[5]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[6]  A. Lesk,et al.  How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. , 1980, Journal of molecular biology.

[7]  J. M. Thornton,et al.  Prediction of super-secondary structure in proteins , 1983, Nature.

[8]  T. L. Blundell,et al.  Knowledge-based prediction of protein structures and the design of novel molecules , 1987, Nature.

[9]  W R Taylor,et al.  Predicted structure for the calcium-dependent membrane-binding proteins p35, p36, and p32. , 1987, Protein engineering.

[10]  G. Barton,et al.  Prediction of antigenic determinants and secondary structures of the major AIDS virus proteins , 1987, FEBS letters.

[11]  I. Crawford,et al.  Prediction of secondary structure by evolutionary comparison: Application to the α subunit of tryptophan synthase , 1987, Proteins.

[12]  E. Padlan,et al.  Three-dimensional structure of the tryptophan synthase alpha 2 beta 2 multienzyme complex from Salmonella typhimurium. , 1988, The Journal of biological chemistry.

[13]  F. Richards,et al.  Identification of structural motifs from protein coordinate data: Secondary structure and first‐level supersecondary structure * , 1988, Proteins.

[14]  M. Karplus,et al.  Protein secondary structure prediction with a neural network. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Janet M. Thornton,et al.  Prediction of progress at last , 1991, Nature.

[16]  J. Zheng,et al.  Structure of a peptide inhibitor bound to the catalytic subunit of cyclic adenosine monophosphate-dependent protein kinase. , 1991, Science.

[17]  S. Benner,et al.  Patterns of divergence in homologous proteins as indicators of secondary and tertiary structure: a prediction of the structure of the catalytic domain of protein kinases. , 1991, Advances in enzyme regulation.

[18]  D. Baltimore,et al.  Three-dimensional solution structure of the src homology 2 domain of c-abl , 1992, Cell.

[19]  D. Baltimore,et al.  Crystal structure of the phosphotyrosine recognition domain SH2 of v-src complexed with tyrosine-phosphorylated peptides , 1993, Nature.

[20]  Mark A. Cohen,et al.  Correct structure prediction? , 1992, Nature.

[21]  R. Huber,et al.  Crystal and molecular structure of human annexin V after refinement. Implications for structure, membrane binding and ion channel formation of the annexin family of proteins. , 1992, Journal of molecular biology.

[22]  G. Panayotou,et al.  Interaction of the p85 subunit of PI 3‐kinase and its N‐terminal SH2 domain with a PDGF receptor phosphorylation site: structural features and analysis of conformational changes. , 1992, The EMBO journal.

[23]  G. Barton,et al.  Conservation analysis and structure prediction of the SH2 family of phosphotyrosine binding domains , 1992, FEBS letters.

[24]  Andrea Musacchio,et al.  Crystal structure of a Src-homology 3 (SH3) domain , 1992, Nature.

[25]  Chris Sander,et al.  Jury returns on structure prediction , 1992, Nature.

[26]  G. Barton,et al.  Multiple protein sequence alignment from tertiary structure comparison: Assignment of global and residue confidence levels , 1992, Proteins.

[27]  I. Campbell,et al.  Structure of an SH2 domain of the p85α subunit of phosphatidylinositol-3-OH kinase , 1994, Nature.

[28]  S A Benner,et al.  Predicting the conformation of proteins man versus machine , 1993, FEBS letters.

[29]  Stuart L. Schreiber,et al.  Structure of the Pl3K SH3 domain and analysis of the SH3 family , 1993, Cell.

[30]  R. Huber,et al.  Structure of chicken annexin V at 2.25-A resolution. , 1993, Biochemistry.

[31]  M. Saraste,et al.  Crystal structure of the SH3 domain in human Fyn; comparison of the three‐dimensional structures of SH3 domains in tyrosine kinases and spectrin. , 1993, The EMBO journal.

[32]  B. Rost,et al.  Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[33]  C Sander,et al.  Progress in protein structure prediction? , 1993, Trends in biochemical sciences.

[34]  J. Schlessinger,et al.  Solution structure of the SH3 domain of phospholipase C-γ , 1993, Cell.

[35]  R. Huber,et al.  Crystal structure of human annexin I at 2.5 Å resolution , 1993, Protein science : a publication of the Protein Society.

[36]  Barry Robson,et al.  Protein structure prediction , 1993, Nature.

[37]  S A Benner,et al.  Predicted secondary structure for the Src homology 3 domain. , 1993, Journal of molecular biology.

[38]  Geoffrey J. Barton,et al.  Protein sequence alignments: a strategy for the hierarchical analysis of residue conservation , 1993, Comput. Appl. Biosci..

[39]  D. Kohda,et al.  SOLUTION STRUCTURE OF THE SH3 DOMAIN OF PHOSPHOLIPASE CGAMMA , 1994 .