Location of structural domains in proteins

We use surface area measurements based on atomic positions to give a quantitative definition of structural domains in proteins. Segments of the polypeptide chain making a minimum of interactions with the rest of the protein structure are identified on interface area scans, where the area B of the interface between a N-terminal segment of i residues and the complementary C-terminal segment is plotted as a function of i. Domain boundaries appear as minima of B in the scans. The procedure may be iterated to build a hierarchy of subdomains. It detects only continuous domains made of a single stretch of polypeptide chain but may be extended to detect such domains in the presence of discontinuous ones. Domains defined from interface area scans fit very well with globular structural regions identified by inspection of protein models [Wetlaufer, D. B. (1973) Proc. Natl. Acad. Sci. U.S.A. 70, 697-701]. They do not in general correspond to the repeated structural units observed in some proteins by superposition studies. In hemoglobin and hen lysozyme, the domains do not correspond to the coding sequences separated by introns in the genes. © 1981 American Chemical Society.

[1]  Michael G. Rossmann,et al.  Chemical and biological evolution of a nucleotide-binding protein , 1974, Nature.

[2]  C. Chothia,et al.  Hydrophobic bonding and accessible surface area in proteins , 1974, Nature.

[3]  D. Phillips,et al.  THE HEN EGG-WHITE LYSOZYME MOLECULE , 1967 .

[4]  C. Craik,et al.  O2 binding properties of the product of the central exon of β-globin gene , 1981, Nature.

[5]  C. Chothia,et al.  Structural patterns in globular proteins , 1976, Nature.

[6]  A. E. Sippel,et al.  Exons encode functional and structural units of chicken lysozyme. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[7]  S J Remington,et al.  A general method to assess similarity of protein structures, with applications to T4 bacteriophage lysozyme. , 1978, Proceedings of the National Academy of Sciences of the United States of America.

[8]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[9]  B. Matthews,et al.  Common precursor of lysozymes of hen egg-white and bacteriophage T4 , 1981, Nature.

[10]  S. Sprang,et al.  The structure of glycogen phosphorylase a at 2.5 Å resolution , 1979 .

[11]  C. Craik,et al.  Characterization of globin domains: heme binding to the central exon product. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[12]  Alexander A. Rashin,et al.  Location of domains in globular proteins , 1981, Nature.

[13]  O. Ptitsyn,et al.  A model of myoglobin self-organization. , 1975, Biophysical chemistry.

[14]  R. Tanguay,et al.  Heat-shock induced proteins present in the cell nucleus of Chironomus tentans salivary gland , 1979, Nature.

[15]  Anders Liljas,et al.  2 Evolutionary and Structural Relationships among Dehydrogenases , 1975 .

[16]  Robert B. Hermann,et al.  Theory of hydrophobic bonding. II. Correlation of hydrocarbon solubility in water with solvent cavity surface area , 1972 .

[17]  J. Jenkins,et al.  Crystallographic studies on the activity of glycogen phosphorylase b , 1978, Nature.

[18]  D. Teller Accessible area, packing volumes and interaction surfaces of globular proteins , 1976, Nature.

[19]  A. Mclachlan Gene duplications in the structural evolution of chymotrypsin. , 1979, Journal of molecular biology.

[20]  G M Crippen,et al.  The tree structural organization of proteins. , 1978, Journal of molecular biology.

[21]  G. Rose,et al.  Hierarchic organization of domains in globular proteins. , 1979, Journal of molecular biology.

[22]  J. Janin,et al.  Surface area of globular proteins. , 1976, Journal of molecular biology.

[23]  C. Chothia Structural invariants in protein folding , 1975, Nature.

[24]  Gerald M. Edelman,et al.  THE COVALENT STRUCTURE OF AN ENTIRE γG IMMUNOGLOBULIN MOLECULE , 1969 .

[25]  Frederic M. Richards,et al.  Packing of α-helices: Geometrical constraints and contact areas☆ , 1978 .

[26]  C. Blake,et al.  Exons encode protein functional units , 1979, Nature.

[27]  M. Go Correlation of DNA exonic regions with protein structural units in haemoglobin , 1981, Nature.

[28]  William A. Eaton,et al.  The relationship between coding sequences and function in haemoglobin , 1980, Nature.

[29]  P Argos,et al.  Exploring structural homology of proteins. , 1976, Journal of molecular biology.

[30]  Anders Liljas,et al.  Recognition of structural domains in globular proteins , 1974 .

[31]  T. Blundell,et al.  Homology among acid proteases: comparison of crystal structures at 3A resolution of acid proteases from Rhizopus chinensis and Endothia parasitica. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[32]  W. Gilbert Why genes in pieces? , 1978, Nature.

[33]  P Argos,et al.  A comparison of the heme binding pocket in globins and cytochrome b5. , 1975, The Journal of biological chemistry.

[34]  J. Walker,et al.  Sequence and structure of D-glyceraldehyde 3-phosphate dehydrogenase from Bacillus stearothermophilus , 1977, Nature.

[35]  T. Blundell,et al.  Four-fold structural repeat in the acid proteases. , 1979, Biochimica et biophysica acta.

[36]  M. Levitt A simplified representation of protein conformations for rapid simulation of protein folding. , 1976, Journal of molecular biology.

[37]  William R. Taylor,et al.  Analysis and prediction of protein β-sheet structures by a combinatorial approach , 1980, Nature.

[38]  B. Matthews,et al.  The conformation of thermolysin. , 1974, The Journal of biological chemistry.

[39]  D. Wetlaufer Nucleation, rapid folding, and globular intrachain regions in proteins. , 1973, Proceedings of the National Academy of Sciences of the United States of America.

[40]  A. Gustchina,et al.  On the supersecondary structure of acid proteases. , 1979, Biochemical and biophysical research communications.

[41]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[42]  M G Rossmann,et al.  Comparison of super-secondary structures in proteins. , 1973, Journal of molecular biology.

[43]  Michael G. Rossmann,et al.  X-Ray Studies of Protein Interactions , 1974 .

[44]  S. Sprang,et al.  Solvent accessibility properties of complex proteins , 1979, Nature.

[45]  J. Janin,et al.  Computer analysis of protein-protein interaction. , 1978, Journal of molecular biology.

[46]  C. Chothia,et al.  Principles of protein–protein recognition , 1975, Nature.

[47]  J. Janin,et al.  Analytical approximation to the accessible surface area of proteins. , 1980, Proceedings of the National Academy of Sciences of the United States of America.