A forward-looking suggestion for resolving the stereochemical restraints debate: ideal geometry functions.

# 2008 International Union of Crystallography Printed in Singapore – all rights reserved Jaskolski et al. (2007a) initiated a very important discussion (Jaskolski et al., 2007b; Stec, 2007; Tickle, 2007) about the accuracy of ideal geometry targets and the appropriate stringency with which they should be obeyed at various resolutions. All of the discussants agree that protein structures determined at ultra-high resolution, which should be closer to the truth, tend to have larger r.m.s. deviations from ideality. This shows that real deviations from ideality do occur in true protein structures. Another point of agreement is that the real deviations from ideality are to some extent context dependent, so that the single target values used in current refinement programs are a simplification: ‘The N–C –C valence angle has a wide spread and may have a bimodal distribution correlated with secondary structure’, Jaskolski et al. (2007a); ‘Recent results suggest that protein stereochemistry is context-dependent’, Stec (2007); ‘As they point out, the [deviations from ideality] will include real variations arising from the chemical environment’, Tickle (2007); ‘In such a situation, a single target, as used in the refinement programs, will not agree with any of the truly preferred values. This again suggests that the geometrical parameters of protein models should not be too tightly restrained to some predefined values’, Jaskolski et al. (2007b). We point out here that much of the context dependence of stereochemistry can be transformed from a frustrating reality that limits the accuracy of protein modeling to a feature that can instead enhance modeling accuracy. This transformation is possible because much of the context-dependent variation is not random but varies systematically with conformation. This systematic dependence was well documented in the mid-1990s, especially for the N—C —C bond angle, for which the expected value was seen to vary over a range of 10 (Jiang et al.; Karplus, 1996; Schafer et al., 1995). We are now updating these analyses using ultra-high resolution protein structures. These structures not only confirm the highly systematic variation of backbone bond angles with conformation (as illustrated for the N— C —C bond angle in Fig. 1), but they also reveal that the standard deviations are very low. For the N—C —C bond angle, the standard deviations in individual ’, regions vary from 1.0 to 1.7 (Fig. 1b), much lower than the standard deviation of near 2.2 derived for the N—C —C bond angle in the population as a whole (see Table 3 of Jaskolski et al., 2007a). Based on this result, it is clear that no single ideal target value can be appropriate for all of these conformations. However, it is also true that because each mean (or expected value) can be empirically discovered, this feature of protein structure can be accounted for by altering our refinement protocols to allow for ideal geometry targets that are dependent on context. In other words, we propose that more accurate refinement of protein structures at all resolution ranges will be obtained by moving beyond the ‘single ideal value’ paradigm to an ‘ideal geometry function’ paradigm (Schafer et al., 1986) in which each restraint target value varies depending on the local conformation. We are now working to develop a set of empirical conformation-dependent ‘ideal geometry functions’ for backbone bond angles and lengths that can be incorporated into crystallographic refinement software (Berkholz & Karplus, unpublished work).