In this paper, the variation of the values of dihedral angles in proteins is divided into two categories by analyzing distributions in a database of structures determined at a resolution of 1.8 A or better [Lovell et al. (2003), Proteins Struct. Funct. Genet. 50, 437-450]. The first analysis uses the torsion angle for the Calpha-Cbeta bond (chi1) of all Gln, Glu, Arg and Lys residues ('unbranched set'). Plateaued values at low B values imply a root-mean-square deviation (RMSD) of just 9 degrees for chi1 related to intrinsic structural differences between proteins. Extrapolation to high resolution gives a value of 11 degrees , while over the entire database the RMSD is 13.4 degrees . The assumption that the deviations arise from independent intrinsic and extrinsic sources gives approximately 10 degrees as the RMSD for chi1 of these unbranched side chains arising from all disorder and error over the entire set. It is also found that the decrease in chi1 deviation that is correlated with higher resolution structures is almost entirely a consequence of the higher percentage of low-B-value side chains in those structures and furthermore that the crystal temperature at which diffraction data are collected has a negligible effect on intrinsic deviation. Those intrinsic aspects of the distributions not related to statistical or other errors, data incompleteness or disorder correlate with energies of model compounds computed with high-level quantum mechanics. Mean side-chain torsion angles for specific rotamers correlate well with local energy minima of Ace-Leu-Nme, Ace-Ile-Nme and Ace-Met-Nme. Intrinsic RMSD values in examples with B < or = 20 A2 correlate inversely with calculated values for the relevant rotational energy barriers: from a low of 6.5 degrees for chi1 of some rotamers of Ile to a high of 14 degrees for some Met chi3 for fully tetrahedral angles and much higher for chi angles around bonds that are tetrahedral at one end and planar at the other (e.g. 30 degrees for chi2 of the gauche- rotamer of Phe). For the lower barrier Met chi3 rotations there are relatively more well validated cases near eclipsed values and calculated torques from the rest of the protein structure either confine or force the Cepsilon atom into the strained position. These results can be used to evaluate the variability and accuracy of chi angles in crystal structures and also to decide whether to restrain side-chain angles in refinement as a function of the resolution and atomic B values, depending on whether one aims for a realistic distribution of values or a spread that is statistically suitable to the probable data-set errors.
[1]
J. Ponder,et al.
Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes.
,
1987,
Journal of molecular biology.
[2]
Jan Hermans,et al.
Boltzmann‐type distribution of side‐chain conformation in proteins
,
2003,
Protein science : a publication of the Protein Society.
[3]
J. Thornton,et al.
PROCHECK: a program to check the stereochemical quality of protein structures
,
1993
.
[4]
W. Kabsch,et al.
Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features
,
1983,
Biopolymers.
[5]
Zukang Feng,et al.
Validation of protein structures for protein data bank.
,
2003,
Methods in enzymology.
[6]
良二 上田.
J. Appl. Cryst.の発刊に際して
,
1970
.
[7]
Justin T. Fermann,et al.
The torsional conformations of butane: Definitive energetics from ab initio methods
,
1997
.
[8]
H. Berendsen,et al.
A consistent empirical potential for water–protein interactions
,
1984
.
[9]
Roland L. Dunbrack,et al.
Backbone-dependent rotamer library for proteins. Application to side-chain prediction.
,
1993,
Journal of molecular biology.
[10]
M. Zalis,et al.
Visualizing and quantifying molecular goodness-of-fit: small-probe contact dots with explicit hydrogen atoms.
,
1999,
Journal of molecular biology.
[11]
Tamar Schlick,et al.
Computational Methods for Macromolecules: Challenges and Applications
,
2002
.