Correlated mutations and residue contacts in proteins

The maintenance of protein function and structure constrains the evolution of amino acid sequences. This fact can be exploited to interpret correlated mutations observed in a sequence family as an indication of probable physical contact in three dimensions. Here we present a simple and general method to analyze correlations in mutational behavior between different positions in a multiple sequence alignment. We then use these correlations to predict contact maps for each of 11 protein families and compare the result with the contacts determined by crystallography. For the most strongly correlated residue pairs predicted to be in contact, the prediction accuracy ranges from 37 to 68% and the improvement ratio relative to a random prediction from 1.4 to 5.1. Predicted contact maps can be used as input for the calculation of protein tertiary structure, either from sequence information alone or in combination with experimental information. © 1994 John Wiley & Sons, Inc.

[1]  A. Mclachlan Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c 551 . , 1971, Journal of molecular biology.

[2]  M A Rodionov,et al.  [Calculation of the tertiary structure of proteins on the basis of an analysis of the matrix contacts between amino acid residues]. , 1980, Biofizika.

[3]  A. Lesk,et al.  How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. , 1980, Journal of molecular biology.

[4]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[5]  M. Volkenstein,et al.  Protein structure and neutral theory of evolution. , 1986, Journal of biomolecular structure & dynamics.

[6]  A. Lesk,et al.  Correlation of co-ordinated amino acid substitutions with function in viruses related to tobacco mosaic virus. , 1987, Journal of molecular biology.

[7]  A. McPherson,et al.  The crystal structure of ribonuclease B at 2.5-A resolution. , 1988, The Journal of biological chemistry.

[8]  K. Nagai,et al.  Coordinated amino acid changes in homologous protein families. , 1988, Protein engineering.

[9]  J. Wells,et al.  Additivity of mutational effects in proteins. , 1990, Biochemistry.

[10]  C. Sander,et al.  Database of homology‐derived protein structures and the structural meaning of sequence alignment , 1991, Proteins.

[11]  M. Levitt,et al.  Accurate prediction of the stability and activity effects of site-directed mutagenesis on a protein core , 1991, Nature.

[12]  R. Sauer,et al.  Additivity of mutant effects assessed by binomial mutagenesis. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[13]  K Nishikawa,et al.  A geometrical constraint approach for reproducing the native backbone conformation of a protein , 1993, Proteins.

[14]  C. Sander,et al.  Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations? , 1994, Protein engineering.

[15]  C. Chothia,et al.  Volume changes in protein evolution. , 1994, Journal of molecular biology.