Atoms‐in‐molecules study of the genetically encoded amino acids. III. Bond and atomic properties and their correlations with experiment including mutation‐induced changes in protein stability and genetic coding

This article presents a study of the molecular charge distributions of the genetically encoded amino acids (AA), one that builds on the previous determination of their equilibrium geometries and the demonstrated transferability of their common geometrical parameters. The properties of the charge distributions are characterized and given quantitative expression in terms of the bond and atomic properties determined within the quantum theory of atoms‐in‐molecules (QTAIM) that defines atoms and bonds in terms of the observable charge density. The properties so defined are demonstrated to be remarkably transferable, a reflection of the underlying transferability of the charge distributions of the main chain and other groups common to the AA. The use of the atomic properties in obtaining an understanding of the biological functions of the AA, whether free or bound in a polypeptide, is demonstrated by the excellent statistical correlations they yield with experimental physicochemical properties. A property of the AA side chains of particular importance is the charge separation index (CSI), a quantity previously defined as the sum of the magnitudes of the atomic charges and which measures the degree of separation of positive and negative charges in the side chain of interest. The CSI values provide a correlation with the measured free energies of transfer of capped side chain analogues, from the vapor phase to aqueous solution, yielding a linear regression equation with r2 = 0.94. The atomic volume is defined by the van der Waals isodensity surface and it, together with the CSI, which accounts for the electrostriction of the solvent, yield a linear regression (r2 = 0.98) with the measured partial molar volumes of the AAs. The changes in free energies of transfer from octanol to water upon interchanging 153 pairs of AAs and from cyclohexane to water upon interchanging 190 pairs of AAs, were modeled using only three calculated parameters (representing electrostatic and volume contributions) yielding linear regressions with r2 values of 0.78 and 0.89, respectively. These results are a prelude to the single‐site mutation‐induced changes in the stabilities of two typical proteins: ubiquitin and staphylococcal nuclease. Strong quadratic correlations (r2 ∼ 0.9) were obtained between ΔCSI upon mutation and each of the two terms ΔΔH and TΔΔS taken from recent and accurate differential scanning calorimetry experiments on ubiquitin. When the two terms are summed to yield ΔΔG, the quadratic terms nearly cancel, and the result is a simple linear fit between ΔΔG and ΔCSI with r2 = 0.88. As another example, the change in the stability of staphylococcal nuclease upon mutation has been fitted linearly (r2 = 0.83) to the sum of a ΔCSI term and a term representing the change in the van der Waals volume of the side chains upon mutation. The suggested correlation of the polarity of the side chain with the second letter of the AA triplet genetic codon is given concrete expression in a classification of the side chains in terms of their CSI values and their group dipole moments. For example, all amino acids with a pyrimidine base as their second letter in mRNA possess side‐chain CSI ≤ 2.8 (with the exception of Cys), whereas all those with CSI > 2.8 possess an purine base. The article concludes with two proposals for measuring and predicting molecular complementarity: van der Waals complementarity expressed in terms of the van der Waals isodensity surface and Lewis complementarity expressed in terms of the local charge concentrations and depletions defined by the topology of the Laplacian of the electron density. A display of the experimentally accessible Laplacian distribution for a folded protein would offer a clear picture of the operation of the “stereochemical code” proposed as the determinant in the folding process. Proteins 2003;52:360–399. © 2003 Wiley‐Liss, Inc.

[1]  Laurent Joubert,et al.  Convergence of the electrostatic interaction based on topological atoms , 2001 .

[2]  B. Dittrich,et al.  Intra- and intermolecular topological properties of amino acids: a comparative study of experimental and theoretical results. , 2002, Journal of the American Chemical Society.

[3]  Paul L. A. Popelier,et al.  Quantum molecular similarity. 1. BCP space , 1999 .

[4]  R. Bader,et al.  Atoms‐in‐molecules study of the genetically encoded amino acids. II. Computational study of molecular geometries , 2002, Proteins.

[5]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[6]  Richard F. W. Bader,et al.  Bonded and nonbonded charge concentrations and their relation to molecular geometry and reactivity , 1984 .

[7]  B Honig,et al.  Extracting hydrophobic free energies from experimental data: relationship to protein folding and theoretical models. , 1991, Biochemistry.

[8]  P. Luger,et al.  TOPOLOGICAL ANALYSIS OF THE EXPERIMENTAL ELECTRON DENSITIES OF AMINO ACIDS. 1. D, L-ASPARTIC ACID AT 20 K , 1998 .

[9]  C. Alff-Steinberger,et al.  The genetic code and error transmission. , 1969, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Kenneth R. Adam,et al.  New density functional and atoms in molecules method of computing relative pKa values in solution , 2002 .

[11]  George I Makhatadze,et al.  Thermodynamic consequences of burial of polar and non-polar amino acid residues in the protein interior. , 2002, Journal of molecular biology.

[12]  G. Rose,et al.  Protein folding--what's the question? , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Paul L. A. Popelier,et al.  Theoretical Definition of a Functional Group and the Molecular Orbital Paradigm , 1994 .

[14]  R. Bader,et al.  Bonded and nonbonded charge concentrations and their relation to molecular geometry and reactivity , 1984 .

[15]  R. Bader,et al.  Similarity and complementarity in chemistry , 1992 .

[16]  R. Bader Atomic force microscope as an open system and the Ehrenfest force , 2000 .

[17]  Jinbo Bi,et al.  Prediction of Protein Retention Times in Anion-Exchange Chromatography Systems Using Support Vector Regression , 2002, J. Chem. Inf. Comput. Sci..

[18]  C. F. Matta Theoretical Reconstruction of the Electron Density of Large Molecules from Fragments Determined as Proper Open Quantum Systems: The Properties of the Oripavine PEO, Enkephalins, and Morphine , 2001 .

[19]  C. Landis,et al.  Curvature-induced polarization in carbon nanoshells , 2002 .

[20]  H. Krane,et al.  Fast Experiments for Charge-Density Determination: Topological Analysis and Electrostatic Potential of the Amino Acids L-Asn, dl-Glu, dl-Ser, and L-Thr. , 1999, Angewandte Chemie.

[21]  Iupaciubcommissiononbiochemic IUPAC-IUB Commission on Biochemical Nomenclature. Abbreviations and symbols for the description of the conformation of polypeptide chains. , 1971, Journal of molecular biology.

[22]  Paul L. A. Popelier,et al.  Quantum molecular similarity. Part 2: The relation between properties in BCP space and bond length , 1999 .

[23]  Paul L. A. Popelier,et al.  Quantum Molecular Similarity. 3. QTMS Descriptors , 2001, J. Chem. Inf. Comput. Sci..

[24]  R. Bader,et al.  An Atoms‐In‐Molecules study of the genetically‐encoded amino acids: I. Effects of conformation and of tautomerization on geometric, atomic, and bond properties , 2000, Proteins.

[25]  S. Pestka,et al.  On the Coding of Genetic Information , 1963 .

[26]  J. Platts Theoretical prediction of hydrogen bond basicity , 2000 .

[27]  W. Dunn,et al.  Amino acid side chain descriptors for quantitative structure-activity relationship studies of peptide analogues. , 1995, Journal of medicinal chemistry.

[28]  Chérif F. Matta,et al.  Application of the quantum theory of atoms in molecules to selected physico‐chemical and biophysical problems: Focus on correlation with experiment , 2003, J. Comput. Chem..

[29]  Peter P. Mager Multidimensional Pharmacochemistry: Design of Safer Drugs , 1985 .

[30]  Paul L. A. Popelier,et al.  Convergence of the multipole expansion for electrostatic potentials of finite topological atoms , 2000 .

[31]  Chérif F. Matta Applications of the Quantum Theory of Atoms in Molecules to Chemical and Biochemical Problems , 2002 .

[32]  R. Bader,et al.  Properties of atoms in crystals: Dielectric polarization , 2001 .

[33]  Friedrich Biegler-König,et al.  Calculation of the average properties of atoms in molecules. II , 1982 .

[34]  Topological analysis of DL-arginine monohydrate at 100 K , 2002 .

[35]  A. Wagner,et al.  Charge density and topological analysis of l-glutamine , 2001 .

[36]  R. Bader,et al.  The mapping of the conditional pair density onto the electron density , 1999 .

[37]  H. Kessler,et al.  Reproducability and transferability of topological properties; experimental charge density of the hexapeptide cyclo-(D,L-Pro)2-(L-Ala)4 monohydrate. , 2002, Acta crystallographica. Section B, Structural science.

[38]  R. Bader,et al.  Theoretical construction of a polypeptide , 1992 .

[39]  Fernando J. Martin Theoretical synthesis of macromolecules from transferable functional groups , 2001 .

[40]  Curt M. Breneman,et al.  QSPR analysis of HPLC column capacity factors for a set of high-energy materials using electronic van der waals surface property descriptors computed by transferable atom equivalent method , 1997, J. Comput. Chem..

[41]  P Coppens,et al.  Chemical applications of X-ray charge-density analysis. , 2001, Chemical reviews.

[42]  J. Platts Theoretical prediction of hydrogen bond donor capacity , 2000 .

[43]  Charles L. Brooks,et al.  Theoretical study of blocked glycine and alanine peptide analogs , 1991 .

[44]  P. Luger,et al.  Electronic Insight into an Antithrombotic Agent by High-Resolution X-Ray Crystallography. , 2001, Angewandte Chemie.

[45]  Preston J. MacDougall,et al.  Identification of molecular reactive sites with an interactive volume rendering tool , 2001 .

[46]  Elfi Kraka,et al.  Chemical Bonds without Bonding Electron Density — Does the Difference Electron‐Density Analysis Suffice for a Description of the Chemical Bond? , 1984 .

[47]  R. Bader,et al.  Prediction of the structures of hydrogen-bonded complexes using the laplacian of the charge density , 1988 .

[48]  W E Stites,et al.  Contributions of the large hydrophobic amino acids to the stability of staphylococcal nuclease. , 1990, Biochemistry.

[49]  A. R. Fresht Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein Folding , 1999 .

[50]  Richard Wolfenden,et al.  Comparing the polarities of the amino acids: side-chain distribution coefficients between the vapor phase, cyclohexane, 1-octanol, and neutral aqueous solution , 1988 .

[51]  P M Cullis,et al.  Affinities of amino acid side chains for solvent water. , 1981, Biochemistry.

[52]  Paul L. A. Popelier,et al.  Atomic partitioning of molecular electrostatic potentials , 2000 .

[53]  Frank J. Millero,et al.  The apparent molal volumes and adiabatic compressibilities of aqueous amino acids at 25.degree.C , 1978 .

[54]  Todd A. Keith,et al.  Properties of atoms in molecules: additivity and transferability of group polarizabilities , 1992 .

[55]  R. Bader,et al.  Calculation of the average properties of atoms in molecules. II , 1981 .

[56]  C. Lecomte,et al.  Experimental charge density and electrostatic potential of glycyl-L-threonine dihydrate. , 2000, Acta crystallographica. Section B, Structural science.

[57]  R. Bader Atoms in molecules : a quantum theory , 1990 .

[58]  V. Hruby Chemistry and Biochemistry of the Amino Acids. , 1986 .

[59]  E. Lattman,et al.  The crystal structure of the ternary complex of staphylococcal nuclease, Ca2+ and the inhibitor pdTp, refined at 1.65 Å , 1989, Proteins.

[60]  T. H. Lilley Physical Properties of Amino Acid Solutions , 1985 .

[61]  Cheng Chang,et al.  Properties of atoms in molecules: atomic volumes , 1987 .

[62]  H. Hinz,et al.  Thermodynamic Data for Biochemistry and Biotechnology , 1986 .