Hierarchy of the interaction energy distribution in the spatial structure of globular proteins and the problem of domain definition.

An algorithm for determining of protein domain structure is proposed. Domain structures resulted from the algorithm application have been obtained and compared with available data. The method is based on entirely physical model of van der Waals interactions that reflects as illustrated in this work the distribution of electron density. Various levels of hierarchy in the protein spatial structure are discerned by analysis of the energy interaction between structural units of different scales. Thus the level of energy hierarchy plays role of sole parameter, and the method obviates the use of complicated geometrical criteria with numerous fitting parameters. The algorithm readily and accurately locates domains formed by continuous segments of the protein chain as well as those comprising non-sequential segments, sets no limit to the number of segments in a domain. We have analyzed 309 protein structures. Among 277 structures for which our results could be compared with the domain definitions made in other works, 243 showed complete or partial coincidence, and only in 34 cases the domain structures proved substantially different. The domains delineated with our approach may coincide with reference definition at different levels of the globule hierarchy. Along with defining the domain structure, our approach allows one to consider the protein spatial structure in terms of the spatial distribution of the interaction energy in order to establish the correspondence between the hierarchy of energy distribution and the hierarchy of structural elements.

[1]  B. Tidor,et al.  Do salt bridges stabilize proteins? A continuum electrostatic analysis , 1994, Protein science : a publication of the Protein Society.

[2]  C. Chothia,et al.  Principles of protein–protein recognition , 1975, Nature.

[3]  M Karplus,et al.  Enthalpic contribution to protein stability: insights from atom-based calculations and statistical mechanics. , 1995, Advances in protein chemistry.

[4]  Z. Derewenda,et al.  The occurrence of C-H...O hydrogen bonds in proteins. , 1995, Journal of molecular biology.

[5]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[6]  Harold A. Scheraga,et al.  Energy parameters in polypeptides. 8. Empirical potential energy algorithm for the conformational analysis of large molecules , 1978 .

[7]  J Moult,et al.  An analysis of protein folding pathways. , 1991, Biochemistry.

[8]  Alexander A. Rashin,et al.  Location of domains in globular proteins , 1981, Nature.

[9]  W. Fuller Structural aspects of recognition and assembly in biological macromolecules , 1982 .

[10]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[11]  A. Fersht,et al.  Surface electrostatic interactions contribute little of stability of barnase. , 1991, Journal of molecular biology.

[12]  D. Wetlaufer Nucleation, rapid folding, and globular intrachain regions in proteins. , 1973, Proceedings of the National Academy of Sciences of the United States of America.

[13]  M. H. Zehfus,et al.  Improved calculations of compactness and a reevaluation of continuous compact units , 1993, Proteins.

[14]  R. Doolittle The multiplicity of domains in proteins. , 1995, Annual review of biochemistry.

[15]  F M Richards,et al.  Areas, volumes, packing and protein structure. , 1977, Annual review of biophysics and bioengineering.

[16]  C. Chothia The nature of the accessible and buried surfaces in proteins. , 1976, Journal of molecular biology.

[17]  J M Thornton,et al.  Domain assignment for protein structures using a consensus approach: Characterization and analysis , 1998, Protein science : a publication of the Protein Society.

[18]  G. Rose,et al.  Rigid domains in proteins: An algorithmic approach to their identification , 1995, Proteins.

[19]  N. Esipova,et al.  Distribution of charges in Bacillus intermedius 7P ribonuclease determines the number of cooperatively melting regions of the globule. , 1987, Journal of biomolecular structure & dynamics.

[20]  László Patthy,et al.  Introns and exons , 1994 .

[21]  I D Campbell,et al.  The structure and function of protein modules. , 1991, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[22]  C. Chothia,et al.  Hydrophobic bonding and accessible surface area in proteins , 1974, Nature.

[23]  M G Rossmann,et al.  Comparison of super-secondary structures in proteins. , 1973, Journal of molecular biology.

[24]  J. Janin,et al.  Structural domains in proteins and their role in the dynamics of protein function. , 1983, Progress in biophysics and molecular biology.

[25]  P. Privalov,et al.  Energetics of protein structure. , 1995, Advances in protein chemistry.

[26]  Domains and modules of proteins , 1992 .

[27]  V. Tumanyan,et al.  Representation of amino acid sequences in terms of interaction energy in protein globules , 1997, FEBS letters.

[28]  T. Noguti,et al.  Localization of hydrogen‐bonds within modules in barnase , 1993, Proteins.

[29]  G M Crippen,et al.  The tree structural organization of proteins. , 1978, Journal of molecular biology.

[30]  A M Lesk,et al.  Folding units in globular proteins. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[31]  G. Rose,et al.  Compact units in proteins. , 1986, Biochemistry.

[32]  H. Casimir,et al.  The Influence of Retardation on the London-van der Waals Forces , 1948 .

[33]  J. Janin,et al.  Computer analysis of protein-protein interaction. , 1978, Journal of molecular biology.

[34]  M. Karplus,et al.  The hinge-bending mode in lysozyme , 1976, Nature.

[35]  M Go,et al.  Modular structural units, exons, and function in chicken lysozyme. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[36]  R. Porter The hydrolysis of rabbit y-globulin and antibodies with crystalline papain. , 1959, The Biochemical journal.

[37]  M. H. Zehfus,et al.  Binary discontinuous compact protein domains. , 1994, Protein engineering.

[38]  T. Noguti,et al.  Protein anatomy: spontaneous formation of filamentous helical structures from the N-terminal module of barnase. , 1993, Biochemistry.

[39]  G J Barton,et al.  Continuous and discontinuous domains: An algorithm for the automatic generation of reliable protein domain definitions , 1995, Protein science : a publication of the Protein Society.

[40]  C. Sander,et al.  Parser for protein folding units , 1994, Proteins.

[41]  Frederic M. Richards,et al.  Packing of α-helices: Geometrical constraints and contact areas☆ , 1978 .

[42]  J. Drenth,et al.  Structure of Papain , 1968, Nature.

[43]  B. Wittmann-Liebold Methods in Protein Sequence Analysis , 1989, Springer Berlin Heidelberg.

[44]  E. Baker,et al.  Hydrogen bonding in globular proteins. , 1984, Progress in biophysics and molecular biology.

[45]  C. Pace,et al.  Hydrogen bonding stabilizes globular proteins. , 1996, Biophysical journal.

[46]  Nobuhiko Saitô,et al.  Tertiary Structure of Proteins. I. : Representation and Computation of the Conformations , 1972 .

[47]  M. H. Zehfus,et al.  Identification of compact, hydrophobically stabilized domains and modules containing multiple peptide chains , 1997, Protein science : a publication of the Protein Society.

[48]  Tatsuo Ooi,et al.  Tertiary Structure of Proteins. II. Freedom of Dihedral Angles and Energy Calculation , 1972 .

[49]  Peer Bork,et al.  Mobile modules and motifs , 1992, Current Biology.

[50]  A. Fersht,et al.  Strength and co-operativity of contributions of surface salt bridges to protein stability. , 1990, Journal of molecular biology.

[51]  F. Richards,et al.  Identification of regions of potential flexibility in protein structures: Folding units and correlations with intron positions , 1988, Biopolymers.

[52]  S. G. Anderson International Symposium on Interferon and Interferon Inducers , 1971 .

[53]  J. Thornton,et al.  Ion-pairs in proteins. , 1983, Journal of molecular biology.

[54]  C. Pace,et al.  Forces contributing to the conformational stability of proteins , 1996, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[55]  B Honig,et al.  Free energy balance in protein folding. , 1995, Advances in protein chemistry.

[56]  H. Scheraga,et al.  Energy parameters in polypeptides. 9. Updating of geometrical parameters, nonbonded interactions, and hydrogen bond interactions for the naturally occurring amino acids , 1983 .

[57]  M J Sternberg,et al.  Identification and analysis of domains in proteins. , 1995, Protein engineering.

[58]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[59]  G. Rose,et al.  Hierarchic organization of domains in globular proteins. , 1979, Journal of molecular biology.

[60]  K. Dill,et al.  Hydrogen bonding in globular proteins. , 1992, Journal of molecular biology.

[61]  C. Chothia Structural invariants in protein folding , 1975, Nature.

[62]  James E. Bray,et al.  The CATH Database provides insights into protein structure/function relationships , 1999, Nucleic Acids Res..

[63]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[64]  M Go,et al.  Protein anatomy: functional roles of barnase module. , 1993, The Journal of biological chemistry.