Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation.

Small-probe contact dot surface analysis, with all explicit hydrogen atoms added and their van der Waals contacts included, was used to choose between the two possible orientations for each of 1554 asparagine (Asn) and glutamine (Gln) side-chain amide groups in a dataset of 100 unrelated, high-quality protein crystal structures at 0.9 to 1.7 A resolution. For the movable-H groups, each connected, closed set of local H-bonds was optimized for both H-bonds and van der Waals overlaps. In addition to the Asn/Gln "flips", this process included rotation of OH, SH, NH3+, and methionine methyl H atoms, flip and protonation state of histidine rings, interaction with bound ligands, and a simple model of water interactions. However, except for switching N and O identity for amide flips (or N and C identity for His flips), no non-H atoms were shifted. Even in these very high-quality structures, about 20 % of the Asn/Gln side-chains required a 180 degrees flip to optimize H-bonding and/or to avoid NH2 clashes with neighboring atoms (incorporating a conservative score penalty which, for marginal cases, favors the assignment in the original coordinate file). The programs Reduce, Probe, and Mage provide not only a suggested amide orientation, but also a numerical score comparison, a categorization of the marginal cases, and a direct visualization of all relevant interactions in both orientations. Visual examination allowed confirmation of the raw score assignment for about 40 % of those Asn/Gln flips placed within the "marginal" penalty range by the automated algorithm, while uncovering only a small number of cases whose automated assignment was incorrect because of special circumstances not yet handled by the algorithm. It seems that the H-bond and the atomic-clash criteria independently look at the same structural realities: when both criteria gave a clear answer they agreed every time. But consideration of van der Waals clashes settled many additional cases for which H-bonding was either absent or approximately equivalent for the two main alternatives. With this extra information, 86 % of all side-chain amide groups could be oriented quite unambiguously. In the absence of further experimental data, it would probably be inappropriate to assign many more than this. Some of the remaining 14 % are ambiguous because of coordinate error or inadequacy of the theoretical model, but the great majority of ambiguous cases probably occur as a dynamic mix of both flip states in the actual protein molecule. The software and the 100 coordinate files with all H atoms added and optimized and with amide flips corrected are publicly available.

[1]  R. Bader,et al.  Molecular Charge Distributions and Chemical Binding. II. First‐Row Diatomic Hydrides, AH , 1967 .

[2]  M. Volkenstein,et al.  Statistical mechanics of chain molecules , 1969 .

[3]  S. Lifson,et al.  ENERGY FUNCTIONS FOR PEPTIDES AND PROTEINS PART 1, DERIVATION OF A CONSISTENT FORCE FIELD INCLUDING THE HYDROGEN BOND FROM AMIDE CRYSTALS , 1974 .

[4]  R. Huber,et al.  The molecular structure of a dimer composed of the variable portions of the Bence-Jones protein REI refined at 2.0-A resolution. , 1975, Biochemistry.

[5]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[6]  H. Berendsen,et al.  Interaction Models for Water in Relation to Protein Hydration , 1981 .

[7]  R. Dickerson,et al.  Structure of cytochrome c551 from Pseudomonas aeruginosa refined at 1.6 A resolution and comparison of the two redox forms. , 1982, Journal of molecular biology.

[8]  M. L. Connolly Solvent-accessible surfaces of proteins and nucleic acids. , 1983, Science.

[9]  Djordje Musil,et al.  The high-resolution X-ray crystal structure of the complex formed between subtilisin Carlsberg and eglin c, an elastase inhibitor from the leech Hirudo medicinalis Structural analysis, subtilisin structure and interface geometry , 1987 .

[10]  W. Bode,et al.  The high-resolution X-ray crystal structure of the complex formed between subtilisin Carlsberg and eglin c, an elastase inhibitor from the leech Hirudo medicinalis. Structural analysis, subtilisin structure and interface geometry. , 1987, European journal of biochemistry.

[11]  L. H. Jensen,et al.  X-Ray Structure Determination: A Practical Guide , 1989 .

[12]  J. Zou,et al.  Improved methods for building protein models in electron density maps and the location of errors in these models. , 1991, Acta crystallographica. Section A, Foundations of crystallography.

[13]  R. Ornstein,et al.  A method for determining the positions of polar hydrogens added to a protein structure that maximizes protein hydrogen bonding , 1992, Proteins.

[14]  D C Richardson,et al.  The kinemage: A tool for scientific communication , 1992, Protein science : a publication of the Protein Society.

[15]  D. McRee Practical Protein Crystallography , 1993 .

[16]  K Morikawa,et al.  Crystal structures of ribonuclease F1 of Fusarium moniliforme in its free form and in complex with 2'GMP. , 1993, Journal of molecular biology.

[17]  D C Richardson,et al.  Kinemages--simple macromolecular graphics for interactive teaching and publication. , 1994, Trends in biochemical sciences.

[18]  J. Fontecilla-Camps,et al.  Crystal structure of toxin II from the scorpion Androctonus australis Hector refined at 1.3 A resolution. , 1994, Journal of molecular biology.

[19]  Restored heptad pattern continuity does not alter the folding of a four-alpha-helix bundle. , 1994, Nature structural biology.

[20]  R. Wahl,et al.  1.56 Å structure of mature truncated human fibroblast collagenase , 1994, Proteins.

[21]  Z. Derewenda,et al.  The occurrence of C-H...O hydrogen bonds in proteins. , 1995, Journal of molecular biology.

[22]  J. Thornton,et al.  The application of hydrogen bonding analysis in X-ray crystallography to help orientate asparagine, glutamine and histidine side chains. , 1995, Protein engineering.

[23]  C. Higgins,et al.  The crystal structures of the oligopeptide-binding protein OppA complexed with tripeptide and tetrapeptide ligands. , 1995, Structure.

[24]  Z Dauter,et al.  The Escherichia coli Malonyl-CoA:Acyl Carrier Protein Transacylase at 1.5-Å Resolution. , 1995, The Journal of Biological Chemistry.

[25]  R. Dominguez,et al.  The crystal structure of endoglucanase CelA, a family 8 glycosyl hydrolase from Clostridium thermocellum. , 1996, Structure.

[26]  A. Fisher,et al.  The 1.5-Å Resolution Crystal Structure of Bacterial Luciferase in Low Salt Conditions* , 1996, The Journal of Biological Chemistry.

[27]  C. Sander,et al.  Positioning hydrogen atoms by optimizing hydrogen‐bond networks in protein structures , 1996, Proteins.

[28]  M. Bolognesi,et al.  Three-dimensional structure of Xenopus laevis Cu,Zn superoxide dismutase b determined by X-ray crystallography at 1.5 A resolution. , 1995, Acta crystallographica. Section D, Biological crystallography.

[29]  M. Zalis,et al.  Visualizing and quantifying molecular goodness-of-fit: small-probe contact dots with explicit hydrogen atoms. , 1999, Journal of molecular biology.

[30]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..