Statistical geometry of packing defects of lattice chain polymer from enumeration and sequential Monte Carlo method

Voids exist in proteins as packing defects and are often associated with protein functions. We study the statistical geometry of voids in two-dimensional lattice chain polymers. We define voids as topological features and develop a simple algorithm for their detection. For short chains, void geometry is examined by enumerating all conformations. For long chains, the space of void geometry is explored using sequential Monte Carlo importance sampling and resampling techniques. We characterize the relationship of geometric properties of voids with chain length, including probability of void formation, expected number of voids, void size, and wall size of voids. We formalize the concept of packing density for lattice polymers, and further study the relationship between packing density and compactness, two parameters frequently used to describe protein packing. We find that both fully extended and maximally compact polymers have the highest packing density, but polymers with intermediate compactness have low packing density. To study the conformational effects of void formation, we characterize the conformational reduction factor of void formation and found that there are strong end-effect. Voids are more likely to form at the chain end. The critical exponent of end-effect is twice as large as that of self-contacting loop formation when existence of voids is not required. We also briefly discuss the sequential Monte Carlo sampling and resampling techniques used in this study. © 2002 American Institute of Physics. @DOI: 10.1063/1.1493772#

[1]  Duplantier Exact contact critical exponents of a self-avoiding polymer chain in two dimensions. , 1987, Physical review. B, Condensed matter.

[2]  S Vishveshwara,et al.  Lattice model for rapidly folding protein-like heteropolymers. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Jun S. Liu,et al.  Monte Carlo strategies in scientific computing , 2001 .

[4]  Hue Sun Chan,et al.  Compact Polymers , 2001 .

[5]  T. Ishinabe,et al.  Exact enumerations of self‐avoiding lattice walks with different nearest‐neighbor contacts , 1986 .

[6]  D Thirumalai,et al.  Kinetics and thermodynamics of folding in model proteins. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[7]  J. Onuchic,et al.  Folding kinetics of proteinlike heteropolymers , 1994, cond-mat/9404001.

[8]  Nando de Freitas,et al.  Sequential Monte Carlo Methods in Practice , 2001, Statistics for Engineering and Information Science.

[9]  J. D. Cloizeaux,et al.  Short range correlation between elements of a long polymer in a good solvent , 1980 .

[10]  Jun S. Liu,et al.  Blind Deconvolution via Sequential Imputations , 1995 .

[11]  Berend Smit,et al.  Understanding molecular simulation: from algorithms to applications , 1996 .

[12]  D. Thirumalai,et al.  Modeling the role of disulfide bonds in protein folding: Entropic barriers and pathways , 1995, Proteins.

[13]  V. Pande,et al.  Enumerations of the Hamiltonian walks on a cubic sublattice , 1994 .

[14]  K. Dill,et al.  A lattice statistical mechanics model of the conformational and sequence spaces of proteins , 1989 .

[15]  Hue Sun Chan,et al.  Energy landscapes and the collapse dynamics of homopolymers , 1993 .

[16]  W. Lim,et al.  Alternative packing arrangements in the hydrophobic core of λrepresser , 1989, Nature.

[17]  Jun S. Liu,et al.  Sequential Imputations and Bayesian Missing Data Problems , 1994 .

[18]  F. Richards The interpretation of protein structures: total volume, group volume distributions and packing density. , 1974, Journal of molecular biology.

[19]  F M Richards,et al.  An analysis of packing in the protein folding problem , 1993, Quarterly Reviews of Biophysics.

[20]  H. Meirovitch A new method for simulation of real chains: scanning future steps , 1982 .

[21]  A. Fersht,et al.  Active barnase variants with completely random hydrophobic cores. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[22]  F M Richards,et al.  Areas, volumes, packing and protein structure. , 1977, Annual review of biophysics and bioengineering.

[23]  K. Dill,et al.  The effects of internal constraints on the configurations of chain molecules , 1990 .

[24]  Ned S. Wingreen,et al.  Designability, thermodynamic stability, and dynamics in protein folding: A lattice model study , 1998, cond-mat/9806197.

[25]  Klimov,et al.  Criterion that determines the foldability of proteins. , 1996, Physical review letters.

[26]  Eugene I. Shakhnovich,et al.  Enumeration of all compact conformations of copolymers with random sequence of links , 1990 .

[27]  Jun S. Liu,et al.  Sequential Monte Carlo methods for dynamic systems , 1997 .

[28]  H. Edelsbrunner,et al.  Anatomy of protein pockets and cavities: Measurement of binding site geometry and implications for ligand design , 1998, Protein science : a publication of the Protein Society.

[29]  M. Swindells,et al.  Protein clefts in molecular recognition and function. , 1996, Protein science : a publication of the Protein Society.

[30]  S. Golomb Polyominoes: Puzzles, Patterns, Problems, and Packings , 1994 .

[31]  K. Dill Theory for the folding and stability of globular proteins. , 1985, Biochemistry.

[32]  M. Karplus,et al.  How does a protein fold? , 1994, Nature.

[33]  K A Dill,et al.  Are proteins well-packed? , 2001, Biophysical journal.

[34]  P. Grassberger Pruned-enriched Rosenbluth method: Simulations of θ polymers of chain length up to 1 000 000 , 1997 .

[35]  A. W. Rosenbluth,et al.  MONTE CARLO CALCULATION OF THE AVERAGE EXTENSION OF MOLECULAR CHAINS , 1955 .

[36]  N. Wingreen,et al.  Emergence of Preferred Structures in a Simple Model of Protein Folding , 1996, Science.

[37]  Jun S. Liu,et al.  Rejection Control and Sequential Importance Sampling , 1998 .

[38]  Richard A. Goldstein,et al.  Searching for foldable protein structures using optimized energy functions , 1995 .

[39]  D. Yee,et al.  Principles of protein folding — A perspective from simple exact models , 1995, Protein science : a publication of the Protein Society.

[40]  I. Orgzall,et al.  Universality and cluster structures in continuum models of percolation with two different radius distributions , 1993 .

[41]  W E Stites,et al.  Contributions of the large hydrophobic amino acids to the stability of staphylococcal nuclease. , 1990, Biochemistry.

[42]  Douglas M. Bates,et al.  Nonlinear Regression Analysis and Its Applications , 1988 .

[43]  K. Dill,et al.  Intrachain loops in polymers: Effects of excluded volume , 1989 .

[44]  R. K. Shyamasundar,et al.  Introduction to algorithms , 1996 .

[45]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[46]  C. Chothia Structural invariants in protein folding , 1975, Nature.