Sequence Conservation and Correlation Measures in Protein Structure Prediction

The rapid elucidation of protein sequences has allowed multiple sequence alignments to be calculated for a wide variety of proteins. Such alignments reveal positions that exhibit amino acid conservation--either of specific chemical groups in active and binding sites or of the more chemically inert hydrophobic residues that contribute to the protein core. The latter can provide constraints on the position of the protein chain and any local periodicity can suggest the type of secondary structure. Conservation measures, however, cannot provide specific pairwise packing information (each conserved hydrophobic position might pack against any other). However, if correlated changes between positions were observed then specific pairs of residue could be identified as interacting and therefore probably spatially adjacent. Most 'observations' of correlated changes have been anecdotal and of the few systematic studies that have been made, most have mistakenly incorporated a strong bias towards selecting conserved positions. When the conservation effect is separated (as best as possible) then little correlation signal remains to help identify adjacent positions.

[1]  William R. Taylor,et al.  A structural model for the retroviral proteases , 1987, Nature.

[2]  William R. Taylor,et al.  Protein Structure Prediction From Sequence , 1993, Comput. Chem..

[3]  W R Taylor,et al.  Structural and mechanistic implications of the amino acid sequence of calcium-transporting ATPases. , 1986, Ciba Foundation symposium.

[4]  M J Sternberg,et al.  On the conformation of proteins: the handedness of the connection between parallel beta-strands. , 1977, Journal of molecular biology.

[5]  M. Sternberg,et al.  On the prediction of protein structure: The significance of the root-mean-square deviation. , 1980, Journal of molecular biology.

[6]  W. Taylor,et al.  Folding polypeptide α‐carbon backbones by distance geometry methods , 1994 .

[7]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[8]  A. Lesk,et al.  Correlation of co-ordinated amino acid substitutions with function in viruses related to tobacco mosaic virus. , 1987, Journal of molecular biology.

[9]  K. Nagai,et al.  Coordinated amino acid changes in homologous protein families. , 1988, Protein engineering.

[10]  W R Taylor,et al.  A template based method of pattern matching in protein sequences. , 1989, Progress in biophysics and molecular biology.

[11]  W R Taylor,et al.  The predicted secondary structures of the nucleotide-binding sites of six cation-transporting ATPases lead to a probable tertiary fold. , 1989, European journal of biochemistry.

[12]  Timothy F. Havel,et al.  A new method for building protein conformations from sequence alignments with homologues of known structure. , 1991, Journal of molecular biology.

[13]  M. Nilges,et al.  Computational challenges for macromolecular structure determination by X-ray crystallography and solution NMRspectroscopy , 1993, Quarterly Reviews of Biophysics.

[14]  M. Sternberg,et al.  Prediction of protein secondary structure and active sites using the alignment of homologous sequences. , 1987, Journal of molecular biology.

[15]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[16]  W. Taylor,et al.  A structural model for the nucleotide binding domains of the flavocytochrome b–245 β‐chain , 1993, Protein science : a publication of the Protein Society.

[17]  S. Benner,et al.  Patterns of divergence in homologous proteins as indicators of secondary and tertiary structure: a prediction of the structure of the catalytic domain of protein kinases. , 1991, Advances in enzyme regulation.

[18]  P. Argos,et al.  Analysis of insertions/deletions in protein structures. , 1992, Journal of molecular biology.