Homology in protein sequences expressed by correlation coefficients.

Abstract Internal homologies in an amino acid sequence of a protein and in amino acid sequences of two different proteins are examined, using correlation coefficients calculated from the sequences when residues are replaced by various quantitative properties of the amino acids such as hydrophobicity. To improve the signal-noise ratio the average correlation coefficient is used to detect homology because the correlation depends on the property considered. In this way, any sequence repetition in a protein and the extent of the similarity and difference among proteins can be estimated quantitatively. The procedure was applied first to the sequences of proteins which have been assumed on other grounds to contain some internal sequence repetitions, α-tropomyosin from rabbit skeletal muscle, calmodulin from bovine brain, troponin C from skeletal and cardiac muscle, and then to the sequences of calcium binding proteins, calmodulin, troponin C, and L2 light chain of myosin. The results show that α-tropomyosin has a markedly periodic sequence at intervals of multiples of seven residues throughout the whole sequence, and calmodulin and skeletal troponin C contain two homologous sequences, the homology of troponin C being weaker than that of calmodulin. Candidates for the calcium binding regions of both troponin C, calmodulin, and L2 light chain are the homologous parts having a high average correlation coefficient (about 0·5) with respect to the sequences of the CD and EF hand regions of carp parvalbumin. The procedure may be a useful method for searching for homologous segments in amino acid sequences.

[1]  Kenji Takahashi,et al.  Determination of the complete amino acid sequence of bovine cardiac troponin C. , 1976 .

[2]  D. Mercola,et al.  Calcium binding by troponin-C and homologs is correlated with the position and linear density of "beta-turn forming" residues. , 1979, Journal of theoretical biology.

[3]  Y. M. Lin,et al.  Cyclic 3':5'-nucleotide phosphodiesterase. Purification, characterization, and active form of the protein activator from bovine brain. , 1974, The Journal of biological chemistry.

[4]  M. O. Dayhoff,et al.  Atlas of protein sequence and structure , 1965 .

[5]  R. Hodges,et al.  Synthetic analog of a high affinity calcium binding site in rabbit skeletal troponin C. , 1980, The Journal of biological chemistry.

[6]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[7]  M. Oobatake,et al.  An analysis of non-bonded energy of proteins. , 1977, Journal of theoretical biology.

[8]  L. Smillie,et al.  The amino acid sequence of rabbit skeletal alpha-tropomyosin. The NH2-terminal half and complete sequence. , 1978, The Journal of biological chemistry.

[9]  A. Mclachlan,et al.  The 14-fold periodicity in α-tropomyosin and the interaction with actin , 1976 .

[10]  R. Bradshaw,et al.  Carp muscle calcium-binding protein. I. Characterization of the tryptic peptides and the complete amino acid sequence of component B. , 1973, The Journal of biological chemistry.

[11]  G. Matsuda,et al.  Amino acid sequence of the L-2 light chain of rabbit skeletal muscle myosin. , 1977, Journal of biochemistry.

[12]  T. Vanaman,et al.  The complete amino acid sequence of the Ca2+-dependent modulator protein (calmodulin) of bovine brain. , 1980, The Journal of biological chemistry.

[13]  P. Y. Chou,et al.  Prediction of protein conformation. , 1974, Biochemistry.

[14]  J. Stull,et al.  Calcium binding properties of beef cardiac troponin. , 1978, The Journal of biological chemistry.

[15]  J. M. Zimmerman,et al.  The characterization of amino acid sequences in proteins by statistical methods. , 1968, Journal of theoretical biology.

[16]  J. H. Collins,et al.  The amino acid sequence of rabbit skeletal muscle troponin C: Gene replication and homology with calcium ‐binding proteins from carp and hake muscle , 1973, FEBS letters.

[17]  M. Tanaka,et al.  The amino acid sequence of Clostridium pasteurianum ferredoxin. , 1966, Biochemical and biophysical research communications.

[18]  T. Vanaman,et al.  Structural similarities between the Ca2+-dependent regulatory proteins of 3':5'-cyclic nucleotide phosphodiesterase and actomyosin ATPase. , 1976, The Journal of biological chemistry.

[19]  Divalent cation binding properties of slow skeletal muscle troponin in comparison with those of cardiac and fast skeletal muscle troponins. , 1979, Journal of biochemistry.

[20]  J. H. Collins Homology of myosin DTNB light chain with alkali light chains, troponin C and parvalbumin , 1976, Nature.

[21]  R. Kretsinger,et al.  Carp muscle calcium-binding protein. II. Structure determination and general description. , 1973, The Journal of biological chemistry.

[22]  D. D. Jones,et al.  Amino acid properties and side-chain orientation in proteins: a cross correlation appraoch. , 1975, Journal of theoretical biology.

[23]  J. H. Collins Homology of myosin light chains, troponin-C and parvalbumins deduced from comparison of their amino acid sequences. , 1974, Biochemical and biophysical research communications.

[24]  A. Mclachlan Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c 551 . , 1971, Journal of molecular biology.