Nucleotide sequences of complementary deoxyribonucleic acids for the pro alpha 1 chain of human type I procollagen. Statistical evaluation of structures that are conserved during evolution.

Nucleotide sequences were determined for two cloned cDNAs encoding for over three-fourths of the pro alpha 1 (I) chain of type I procollagen from man. Comparison with previously published data on amino acid sequences of the alpha 1 (I) chain of type I collagen made it possible to examine mutations in the transcribed products of the gene which have occurred during the evolution of man, calf, rat, mouse, and chick. Comparison of the nucleotide sequences with the corresponding sequences of cDNAs from chick [Fuller, F., & Boedtker, H. (1981) Biochemistry 20, 996] and with cDNAs for the pro alpha 2(I) chain from man [Bernard, M.P., Myers, J. C., Chu, M.-L., Ramirez, F., Eikenberry, E. F., & Prockop, D. J. (1983) Biochemistry 22, 1139] demonstrated that selective pressure during evolution for 250 million or more years acted more strongly on the structure of the pro alpha 1 (I) chain than on the pro alpha 2(I) chain. To improve the reliability of the comparison, the nucleotide sequences were examined with a modification of previous procedures for evaluating mutations in replacement sites and silent sites. The corrected divergence for replacement sites between the alpha 1 (I) chains was 6 +/- 0.8% whereas it was 15 +/- 1.9% for the alpha 2(I) chains. The C-propeptide domain of the pro alpha 1 (I) chain was also highly conserved with a corrected divergence at replacement sites of 5 +/- 0.9%, a value that was not distinguishable from the value previously found for the C-propeptide of the pro alpha 2(I) chain. Therefore, a large part of the structure of both C-propeptides appears to be under selective pressure. Inspection of changes in the C-propeptide of the pro alpha 1 (I) chain suggested that there was a highly conserved region around the carbohydrate attachment site similar to the highly conserved region of 37 amino acids previously found in the C-propeptide of the pro alpha 2(I) chain. Two statistical tests, however, were unable to confirm nonrandom distribution of changes in the C-propeptide of the pro alpha 1(I) chain. The same tests established the presence of a nonrandom distribution in nucleotide changes of the C-propeptide of the pro alpha 2(I) chain. The 3'-noncoding region of the cDNA for pro alpha 1(I) of human type I procollagen showed no homology with the same region in the chick.(ABSTRACT TRUNCATED AT 400 WORDS)

[1]  H. Furthmayr,et al.  Comparative sequence studies on alpha2-CB2 from calf, human, rabbit and pig-skin collagen. , 1974, European journal of biochemistry.

[2]  I. Pastan,et al.  The exon/intron structure of the 3'-region of the pro alpha 2(I) collagen gene. , 1981, The Journal of biological chemistry.

[3]  A. Wilson,et al.  The untranslated regions of β-globin mRNA evolve at a functional rate in higher primates , 1981, Cell.

[4]  Walter Gilbert,et al.  The evolution of genes: the chicken preproinsulin gene , 1980, Cell.

[5]  A. Kang,et al.  Amino acid sequence of cyanogen bromide peptides from the amino-terminal region of chick skicollagen. , 1970, Biochemistry.

[6]  D. Hanahan,et al.  Structure of the pro α2(I) collagen gene , 1981, Nature.

[7]  S. Faro,et al.  Cloning a cDNA for the pro-alpha 2 chain of human type I collagen. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[8]  W. Gilbert,et al.  Sequencing end-labeled DNA with base-specific chemical cleavages. , 1980, Methods in enzymology.

[9]  K. Kivirikko,et al.  [Biosynthesis of collagen and its disorders]. , 1979, Duodecim; laaketieteellinen aikakauskirja.

[10]  A. Kang,et al.  The amino acid sequence of chick skin collagen alpha1-CB7. , 1975, Biochemistry.

[11]  I. Pastan,et al.  Construction of a recombinant bacterial plasmid containing pro-alpha 1(I) collagen DNA sequences. , 1980, The Journal of biological chemistry.

[12]  Tom Maniatis,et al.  The structure and evolution of the human β-globin gene family , 1980, Cell.

[13]  P. Cloud Atmospheric and hydrospheric evolution on the primitive earth. Both secular accretion and biological and geochemical processes have affected earth's volatile envelope. , 1968, Science.

[14]  M L Chu,et al.  Structure of a cDNA for the pro alpha 2 chain of human type I procollagen. Comparison with chick cDNA for pro alpha 2(I) identifies structurally conserved features of the protein and the gene. , 1983, Biochemistry.

[15]  J. Seyer,et al.  Covalent structure of collagen: amino-acid sequence of chymotryptic peptides from the carboxyl-terminal region of alpha2-CB3 of chick-skin collagen. , 1977, European journal of biochemistry.

[16]  H. Hofmann,et al.  The role of polar and hydrophobic interactions for the molecular packing of type I collagen: a three-dimensional evaluation of the amino acid sequence. , 1978, Journal of molecular biology.

[17]  E. Fuchs,et al.  Complementary DNA sequence of a human cytoplasmic actin. Interspecies divergence of 3' non-coding regions. , 1983, Journal of molecular biology.

[18]  P. Bornstein,et al.  Structurally distinct collagen types. , 1980, Annual review of biochemistry.

[19]  H. Boedtker,et al.  Sequence determination and analysis of the 3' region of chicken pro-alpha 1(I) and pro-alpha 2(I) collagen messenger ribonucleic acids including the carboxy-terminal propeptide sequences. , 1981, Biochemistry.

[20]  J. Seyer,et al.  Covalent structure of collagen: isolation of chymotryptic peptides and amino acid sequence of the amino-terminal region of alpha2-CB3 from chick skin. , 1977, European journal of biochemistry.

[21]  J. Myers,et al.  Cloning and characterization of five overlapping cDNAs specific for the human pro alpha 1(I) collagen chain. , 1982, Nucleic acids research.

[22]  B. Mccarthy,et al.  DNA sequence analysis of a mouse pro alpha 1 (I) procollagen gene: evidence for a mouse B1 element within the gene , 1982, Molecular and cellular biology.

[23]  K. Kühn,et al.  The covalent structure of collagen: Amino acid sequence of the N‐terminal region of α2‐CB5 from rat skin collagen , 1973, FEBS letters.