Formal analysis of protein sequences. I. Specific long-range constraints in pair associations of amino acids.

Abstract A thorough formal investigation of pair correlations between amino acids in various protein sequences is performed. To this end we apply a particularly sensitive method of statistical analysis. An evidence is given for the existence of correlations between amino acids separated, along the polypeptide chain, by 5 and 6 and by 28 and 29 peptide bonds. We discuss the chemical nature of these correlations and their possible relation to the three-dimensional structure of proteins.

[1]  R. Ambler,et al.  The amino acid sequence of Pseudomonas fluorescens azurin. , 1967, The Biochemical journal.

[2]  J. Clegg,et al.  Coincidence and protein structure. , 1961, Journal of molecular biology.

[3]  V. Ingram The hemoglobins in genetics and evolution , 1963 .

[4]  P. Doty,et al.  ON THE CONFORMATION OF HORSE HEART FERRI- AND FERROCYTOCHROME C. , 1965, Journal of the American Chemical Society.

[5]  V. Bryson,et al.  Evolving Genes and Proteins. , 1965, Science.

[6]  J. R. Pierce,et al.  Symposium on Information Theory in Biology , 1959 .

[7]  R. S. Morgan,et al.  Concerning the ribonuclease sequence , 1960 .

[8]  D. Ulmer OPTICAL ROTATORY DISPERSION OF OXIDIZED AND REDUCED CYTOCHROME C. , 1965, Biochemistry.

[9]  E. Margoliash,et al.  Properties and primary structure of the cytochrome c from the flight muscles of the moth, Samia cynthia. , 1966, The Journal of biological chemistry.

[10]  Edmundson Ab Amino-Acid Sequence of Sperm Whale Myoglobin , 1965 .

[11]  D. F. Waugh Proteins and Their Interactions , 1959 .

[12]  M. Laskowski,et al.  The basic trypsin inhibitor of bovine pancreas. V. The disulfide linkages. , 1965, Biochemical and biophysical research communications.

[13]  F. Yates,et al.  Statistical Tables for Biological, Agricultural and Medical Research. , 1939 .

[14]  F. Sanger Chemistry of insulin; determination of the structure of insulin opens the way to greater understanding of life processes. , 1959, Science.

[15]  E. Margoliash Amino acid sequence of chymotryptic peptides from horse heart cytochrome c. , 1962, The Journal of biological chemistry.

[16]  H. Bull,et al.  Sequence of amino acid residues in proteins. , 1965, Archives of biochemistry and biophysics.

[17]  B. Hartley Amino-Acid Sequence of Bovine Chymotrypsinogen-A , 1964, Nature.

[18]  J. C. Kendrew,et al.  Structure and function of haemoglobin: II. Some relations between polypeptide chain configuration and amino acid sequence , 1965 .

[19]  K. Narita,et al.  The complete amino acid sequence in baker's yeast cytochrome c. , 1969, Journal of biochemistry.

[20]  Ambler Rp THE AMINO ACID SEQUENCE OF PSEUDOMONAS CYTOCHROME C-551. , 1963 .

[21]  D. F. Koenig,et al.  Structure of Hen Egg-White Lysozyme: A Three-dimensional Fourier Synthesis at 2 Å Resolution , 1965, Nature.