Overcrossing spectra of protein backbones: Characterization of three‐dimensional molecular shape and global structural homologies

A procedure is developed and applied to characterize the global shape and folding features of the backbone of a chain molecule. The methodology is based on the following concept: the probability of observing a rigid placement of a backbone in 3‐space as a projected curve with N overcrossings. The numerical computation of these probabilities allows one to construct the overcrossing spectrum of a macromolecule at a given configuration. Although the spectrum is built from the knowledge of the nuclear geometry of the main‐chain atoms, the shape descriptor overlooks local geometrical features (such as distances and contacts) and provides a characterization of essential (topological) features of the overall fold, such as its compactness and degree of entanglement. In contrast with other shape descriptors, the present approach gives an absolute characterization of the configuration considered, and not one that is relative to an arbitrarily chosen reference structure. Moreover, it is possible to discriminate between folding features that otherwise may not be distinguished when using other geometrical or topological global descriptors. The overcrossing spectrum is proposed as a tool that complements current structural analyses of macromolecules, especially when monitoring structural homologies within a group of related or unrelated polymers. In this work, we apply the methodology to the analysis of proteins having the globin fold. The results are compared with those of other proteins exhibiting similar size and number of residues. Some basic properties of the spectra as a function of the chain length are also discussed. © 1993 John Wiley & Sons, Inc.

[1]  P. Mezey,et al.  Implementing knot-theoretical characterization methods to analyze the backbone structure of proteins: application to CTF L7/L12 and carboxypeptidase A inhibitor proteins. , 1991, Journal of molecular graphics.

[2]  M. Volkenstein,et al.  Statistical mechanics of chain molecules , 1969 .

[3]  P. Argos,et al.  Recognition of distantly related protein sequences using conserved motifs and neural networks. , 1992, Journal of molecular biology.

[4]  M Karplus,et al.  Temperature dependence of the structure and dynamics of myoglobin. A simulation approach. , 1990, Journal of molecular biology.

[5]  Janet M. Thornton,et al.  Lessons from analyzing protein structures , 1992 .

[6]  Topological aspects of the conformational transformations in polypeptides and proteins , 1983, Biopolymers.

[7]  C. Branden,et al.  Introduction to protein structure , 1991 .

[8]  P. Mezey,et al.  A method for the characterization of foldings in protein ribbon models. , 1990, Journal of molecular graphics.

[9]  K. Dill,et al.  Hydrogen bonding in globular proteins. , 1992, Journal of molecular biology.

[10]  Stieltjes integration and differential geometry: A model for enzyme recognition, discrimination, and catalysis , 1984, Bulletin of mathematical biology.

[11]  Joachim Selbig,et al.  Analysis of protein sheet topologies by graph theoretical methods , 1992, Proteins.

[12]  N. Cozzarelli,et al.  The helical repeat of double-stranded DNA varies as a function of catenation and supercoiling , 1988, Nature.

[13]  Kenneth C. Millett,et al.  A new polynomial invariant of knots and links , 1985 .

[14]  K. Humbel,et al.  Chemical Applications of Topology and Graph Theory, R.B. King (Ed.). Elsevier Science Publishers, Amsterdam (1983), (ISBN 0-444-42244-7). XII + 494 p. Price Dfl. 275.00 , 1985 .

[15]  G M Crippen Topology of globular proteins. II. , 1975, Journal of theoretical biology.

[16]  N. Goel,et al.  Computer Simulations of Self-Organization in Biological Systems , 1988 .

[17]  A H Louie,et al.  Differential geometry of proteins: a structural and dynamical representation of patterns. , 1982, Journal of theoretical biology.

[18]  G. Marsaglia Choosing a Point from the Surface of a Sphere , 1972 .

[19]  Trends in applied theoretical chemistry , 1992 .

[20]  P Willett,et al.  Use of techniques derived from graph theory to compare secondary structure motifs in proteins. , 1990, Journal of molecular biology.

[21]  M. L. Bret Catastrophic variation of twist and writhing of circular DNAs with constraint , 1979 .

[22]  G. Crippen,et al.  Contact potential that recognizes the correct folding of globular proteins. , 1992, Journal of molecular biology.

[23]  Hiromi Yamakawa,et al.  Modern Theory of Polymer Solutions , 1971 .

[24]  D Thirumalai,et al.  The nature of folded states of globular proteins , 1992, Biopolymers.

[25]  I. Kuntz,et al.  Linked and threaded loops in proteins , 1980, Biopolymers.

[26]  N. Cozzarelli,et al.  Helical repeat and linking number of surface-wrapped DNA. , 1988, Science.

[27]  O. Tapia,et al.  Electrostatic forces and the structural stability of a modelled bacteriophage T4 glutaredoxin fold : molecular dynamics simulations of polyglycine 87-mers , 1992 .

[28]  P. Mezey,et al.  Configurational dependence of molecular shape , 1992 .

[29]  G. Arteca Small- and large-scale global shape features in macromolecular backbones , 1993 .

[30]  O. Tapia,et al.  Surface fractality as a guide for studying protein—protein interactions , 1987 .

[31]  Paul G. Mezey,et al.  Potential Energy Hypersurfaces , 1987 .

[32]  A D Factor,et al.  Graphical representation of hydrogen bonding patterns in proteins. , 1991, Protein engineering.

[33]  Kenny B. Lipkowitz,et al.  Benzene is not very rigid , 1993, J. Comput. Chem..

[34]  O. Ptitsyn,et al.  Conformations of Macromolecules , 1966 .

[35]  Gustavo A. Arteca,et al.  The shapes of backbones of chain molecules: Three‐dimensional characterization by spherical shape maps , 1992 .

[36]  I. Kuntz,et al.  A molecular dynamics simulation of polyalanine: An analysis of equilibrium motions and helix–coil transitions , 1991, Biopolymers.

[37]  Gustavo A. Arteca Global measure of molecular flexibility and shape fluctuations about conformational minima , 1993, J. Comput. Chem..

[38]  A H Louie,et al.  Differential geometry of proteins. Helical approximations. , 1983 .

[39]  F. B. Fuller The writhing number of a space curve. , 1971, Proceedings of the National Academy of Sciences of the United States of America.

[40]  M. V. Volkenstein,et al.  The configurational statistics of polymeric chains , 1958 .

[41]  P. Gennes Scaling Concepts in Polymer Physics , 1979 .

[42]  Marvin Johnson,et al.  Concepts and applications of molecular similarity , 1990 .

[43]  V. Jones A polynomial invariant for knots via von Neumann algebras , 1985 .

[44]  D. W. Sumners,et al.  The knot theory of molecules , 1987 .

[45]  G. Arteca Theoretical Approaches to Structural Stability and Shape Stability Along Reaction Paths , 1992 .

[46]  R Elber,et al.  Molecular dynamics study of secondary structure motion in proteins: Application to myohemerythrin , 1990, Proteins.

[47]  Takeshi Kikuchi,et al.  Spatial geometric arrangements of disulfide‐crosslinked loops in proteins , 1986 .

[48]  Wolfgang Heiden,et al.  Topological analysis of complex molecular surfaces , 1992 .

[49]  S. Rackovsky,et al.  Characterization of multiple bends in proteins , 1980, Biopolymers.

[50]  P. Mezey New developments in molecular chirality , 1991 .

[51]  M. Hao,et al.  Modeling DNA supercoils and knots with B‐spline functions , 1989, Biopolymers.