A multivariate study of the relationship between the genetic code and the physical-chemical properties of amino acids

SummaryThe 20 naturally occurring amino acids are characterized by 20 variables: pKNH2, pKCOOH, pI, molecular weight, substituent van der Waals volume, seven1H and13C nuclear magnetic resonance shift variables, and eight hydrophobicity-hydrophilicity scales. The 20-dimensional data set is reduced to a few new dimensions by principal components analysis. The three first principal components reveal relationships between the properties of the amino acids and the genetic code. Thus the amino acids coded for by adenosine (A), uracil (U), or cytosine (C) in their second codon position (corresponding to U, A, or G in the second anticodon position) are grouped in these components. No grouping was detected for the amino acids coded for by guanine (G) in the second codon position (corresponding to C in the second anticodon position). The results show that a relationship exists between the physical-chemical properties of the amino acids and which of the A (U), U (A), or C (G) nucleotide is used in the second codon (anticodon) position. The amino acids coded for by G (C) in the second codon (anticodon) position do not participate in this relationship.

[1]  M. Shimizu Molecular basis for the genetic code , 2005, Journal of Molecular Evolution.

[2]  J. Janin,et al.  Surface and inside volumes in globular proteins , 1979, Nature.

[3]  Kurt Varmuza,et al.  Pattern recognition in chemistry , 1980 .

[4]  W. Horsley,et al.  Carbon-13 magnetic resonance studies of amino acids and peptides. , 1968, Journal of the American Chemical Society.

[5]  Cluster analysis of genes in codon space , 2005, Journal of Molecular Evolution.

[6]  Manfred Schmidt,et al.  Partition coefficients of amino acids and hydrophobic parameters π of their side-chains as measured by thin-layer chromatography☆ , 1981 .

[7]  Akintola A. Aboderin,et al.  An empirical hydrophobicity scale for α-amino-acids and some of its applications , 1971 .

[8]  J. Fendler,et al.  13C Nmr Spectroscopy in Macromolecular Systems of Biochemical Interest , 1976 .

[9]  E. Möller Chemische Struktur und biologische Aktivität von Wirkstoffen. Methoden der Quantitativen Struktur‐Wirkung‐Analyse. Von J. K. Seydel und K.‐J. Schaper. Verlag Chemie, Weinheim 1979. XI, 359 S., geb. DM 118.00 , 1980 .

[10]  Svante Wold,et al.  Pattern recognition by means of disjoint principal components models , 1976, Pattern Recognit..

[11]  S. Wold,et al.  Multivariate Data Analysis in Chemistry , 1984 .

[12]  S Wold,et al.  Structure-activity analyzed by pattern recognition: the asymmetric case. , 1980, Journal of medicinal chemistry.

[13]  P. Sneath Relations between chemical structure and biological activity in peptides. , 1966, Journal of theoretical biology.

[14]  F. Crick Origin of the Genetic Code , 1967, Nature.

[15]  J. Ninio L'explosion des squences: Les annes folles 19801990 , 1983 .

[16]  J. Reuben,et al.  Nucleotide-amino acid interactions and their relation to the genetic code , 1980, Journal of Molecular Evolution.

[17]  Svante Wold,et al.  PLS DISCRIMINANT PLOTS , 1986 .

[18]  C R Woese,et al.  The molecular basis for the genetic code. , 1966, Proceedings of the National Academy of Sciences of the United States of America.

[19]  P M Cullis,et al.  Affinities of amino acid side chains for solvent water. , 1981, Biochemistry.

[20]  S. Wold,et al.  SIMCA: A Method for Analyzing Chemical Data in Terms of Similarity and Analogy , 1977 .

[21]  S. Wold Cross-Validatory Estimation of the Number of Components in Factor and Principal Components Models , 1978 .

[22]  Svante Wold,et al.  Partial least-squares method for spectrofluorimetric analysis of mixtures of humic acid and lignin sulfonate , 1983 .

[23]  John R. Jungck,et al.  The genetic code as a periodic table , 1978, Journal of Molecular Evolution.

[24]  T. Jukes,et al.  The amino acid code. , 1978, Advances in enzymology and related areas of molecular biology.

[25]  A. L. Weber,et al.  Genetic code correlations: Amino acids and their anticodon nucleotides , 1978, Journal of Molecular Evolution.

[26]  J. Fendler,et al.  Origin of the genetic code: A physical-chemical model of primitive codon assignments , 1974, Origins of life.

[27]  C. Tanford,et al.  The solubility of amino acids and two glycine peptides in aqueous ethanol and dioxane solutions. Establishment of a hydrophobicity scale. , 1971, The Journal of biological chemistry.

[28]  James C. Lacey,et al.  Experimental studies related to the origin of the genetic code and the process of protein synthesis — A review , 1983, Origins of life.

[29]  Edmund R. Malinowski,et al.  Factor Analysis in Chemistry , 1980 .

[30]  E. Kaiser,et al.  Amphiphilic secondary structure: design of peptide hormones. , 1984, Science.

[31]  O. Jardetzky,et al.  Nuclear magnetic resonance spectroscopy of amino acids, peptides, and proteins. , 1970, Advances in protein chemistry.

[32]  F. H. C. CRICK,et al.  Origin of the Genetic Code , 1967, Nature.

[33]  Bruce R. Kowalski,et al.  A comparison of five pattern recognition methods based on the classification results from six real data bases , 1979 .

[34]  R. Swanson A unifying concept for the amino acid code. , 1984, Bulletin of mathematical biology.

[35]  S. Wold,et al.  The prediction of bradykinin potentiating potency of pentapeptides. An example of a peptide quantitative structure-activity relationship. , 1986, Acta chemica Scandinavica. Series B: Organic chemistry and biochemistry.