Visible volume: a robust measure for protein structure characterization.

We propose a new characterization of protein structure based on the natural tetrahedral geometry of the beta-carbon and a new geometric measure of structural similarity, called visible volume. In our model, the side-chains are replaced by an ideal tetrahedron, the orientation of which is fixed with respect to the backbone and corresponds to the preferred rotamer directions. Visible volume is a measure of the non-occluded empty space surrounding each residue position after the side-chains have been removed. It is a robust, parameter-free, locally computed quantity that accounts for many of the spatial constraints that are of relevance to the corresponding position in the native structure. When computing visible volume, we ignore the nature of both the residue observed at each side and the ones surrounding it. We focus instead on the space that, together, these residues could occupy. By doing so, we are able to quantify a new kind of invariance beyond the apparent variations within protein families, namely, the conservation of the physical space available at structurally equivalent positions for side-chain packing. Corresponding positions in native structures are likely to be of interest in protein structure prediction, protein design, and homology modeling. Visible volume is related to the degree of exposure of a residue position and to the actual rotamers in native proteins. Here, we discuss the properties of this new measure, namely, its robustness with respect to both crystallographic uncertainties and naturally occurring variations in atomic coordinates, and the remarkable fact that it is essentially independent of the choice of the parameters used in calculating it. We also show how visible volume can be used to align protein structures, to identify structurally equivalent positions that are conserved in a family of proteins, and to single out positions in a protein that are likely to be of biological interest. These properties qualify visible volume as a powerful tool in a variety of applications, from the detailed analysis of protein structure to homology modeling, protein structural alignment, and the definition of better scoring functions for threading purposes.

[1]  Kendrew Jc Side-chain interactions in myoglobin. , 1962 .

[2]  M. Perutz,et al.  Three-dimensional Fourier Synthesis of Horse Oxyhaemoglobin at 2.8 Å Resolution: The Atomic Model , 1968, Nature.

[3]  G. N. Ramachandran,et al.  Conformation of polypeptides and proteins. , 1968, Advances in protein chemistry.

[4]  H. Muirhead,et al.  Three-dimensional Fourier Synthesis of Horse Oxyhaemoglobin at 2.8 Å Resolution : (I) X-ray Analysis , 1968, Nature.

[5]  J. Richards The structure and action of proteins , 1969 .

[6]  Richard Earl Dickerson,et al.  Stereo supplement to the structure and action of proteins , 1969 .

[7]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[8]  F. Richards The interpretation of protein structures: total volume, group volume distributions and packing density. , 1974, Journal of molecular biology.

[9]  C. Chothia Structural invariants in protein folding , 1975, Nature.

[10]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[11]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[12]  A. Lesk,et al.  How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. , 1980, Journal of molecular biology.

[13]  J. Richardson,et al.  The anatomy and taxonomy of protein structure. , 1981, Advances in protein chemistry.

[14]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[15]  M. Volkenstein,et al.  Protein structure and neutral theory of evolution. , 1986, Journal of biomolecular structure & dynamics.

[16]  A. Lesk,et al.  Determinants of a protein fold. Unique features of the globin amino acid sequences. , 1987, Journal of molecular biology.

[17]  C. Orengo,et al.  A rapid method of protein structure alignment. , 1990, Journal of theoretical biology.

[18]  F E Cohen,et al.  Novel method for the rapid evaluation of packing in protein structures. , 1990, Journal of molecular biology.

[19]  D. Eisenberg,et al.  A method to identify protein sequences that fold into a known three-dimensional structure. , 1991, Science.

[20]  David Eisenberg,et al.  Inverted protein structure prediction , 1993 .

[21]  C. Chothia,et al.  New folds for all-β proteins , 1993 .

[22]  Shoshana J. Wodak,et al.  Generating and testing protein folds , 1993 .

[23]  M Gerstein,et al.  Volume changes on protein folding. , 1994, Structure.

[24]  C. Orengo Classification of protein folds , 1994 .

[25]  C. Chothia,et al.  Volume changes in protein evolution. , 1994, Journal of molecular biology.

[26]  C. Sander,et al.  Searching protein structure databases has come of age , 1994, Proteins.

[27]  O. Kapp,et al.  Alignment of 700 globin sequences: Extent of amino acid substitution and its correlation with variation in volume , 1995, Protein science : a publication of the Protein Society.

[28]  K. B. Ward,et al.  Occluded molecular surface: Analysis of protein packing , 1995, Journal of molecular recognition : JMR.

[29]  B. Matthews,et al.  A test of the "jigsaw puzzle" model for protein folding by multiple methionine substitutions within the core of T4 lysozyme. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Richard H. Lathrop,et al.  The threading approach to the inverse protein folding problem , 1997, RECOMB '97.

[31]  Mark Gerstein,et al.  How far can sequences diverge? , 1997, Nature.

[32]  Richard H. Lathrop,et al.  Current Limitations to Protein Threading Approaches , 1997, J. Comput. Biol..