Early Selection of the Amino Acid Alphabet Was Adaptively Shaped by Biophysical Constraints of Foldability

Whereas modern proteins rely on a quasi-universal repertoire of 20 canonical amino acids (AAs), numerous lines of evidence suggest that ancient proteins relied on a limited alphabet of 10 ‘early’ AAs, and that the 10 ‘late’ AAs were products of biosynthetic pathways. However, many non-proteinogenic AAs were also prebiotically available, which begs two fundamental questions: Why do we have the current modern amino acid alphabet, and Would proteins be able to fold into globular structures as well if different amino acids comprised the genetic code? Here, we experimentally evaluated the solubility and secondary structure propensities of several prebiotically relevant amino acids in the context of synthetic combinatorial 25-mer peptide libraries. The most prebiotically abundant linear aliphatic and basic residues were incorporated along with or in place of other early amino acids to explore these alternative sequence spaces. We show that foldability was a critical factor in the selection of the canonical alphabet. Unbranched aliphatic and short-chain basic amino acids were purged from the proteinogenic alphabet despite their high prebiotic abundance because they generate polypeptides that are over-solubilized and have low packing efficiency. Surprisingly, we find that the inclusion of a short-chain basic amino acid also decreases polypeptides’ secondary structure potential. Our results support the view that despite lacking basic residues, the early canonical alphabet was remarkably adaptive at supporting protein folding and explain why basic residues were only incorporated at a later stage of the alphabet evolution.

[1]  S. Freeland,et al.  What Would an Alien Amino Acid Alphabet Look Like and Why? , 2023, Astrobiology.

[2]  J. Vymětal,et al.  Modern and prebiotic amino acids support distinct structural profiles in proteins , 2022, Open Biology.

[3]  K. Fujishima,et al.  In Vitro Evolution Reveals Noncationic Protein–RNA Interaction Mediated by Metal Ions , 2022, Molecular biology and evolution.

[4]  S. Freeland Undefining life's biochemistry: implications for abiogenesis , 2022, Journal of the Royal Society Interface.

[5]  K. Fujishima,et al.  Peptides before and during the nucleotide world: an origins story emphasizing cooperation between proteins and nucleic acids , 2022, Journal of the Royal Society Interface.

[6]  S. Freeland,et al.  Evolution as a Guide to Designing xeno Amino Acid Alphabets , 2021, International journal of molecular sciences.

[7]  Dan S. Tawfik,et al.  Polyamines Mediate Folding of Primordial Hyperacidic Helical Proteins , 2020, Biochemistry.

[8]  Dan S. Tawfik,et al.  Primordial emergence of a nucleic acid-binding protein via phase separation and statistical ornithine-to-arginine conversion , 2020, Proceedings of the National Academy of Sciences.

[9]  Moran Frenkel-Pinter,et al.  Prebiotic Peptides: Molecular Hubs in the Origin of Life. , 2020, Chemical reviews.

[10]  Markus Meringer,et al.  Adaptive Properties of the Genetically Encoded Amino Acid Alphabet Are Inherited from Its Subsets , 2019, Scientific Reports.

[11]  A. Solis Reduced alphabet of prebiotic amino acids optimally encodes the conformational space of diverse extant protein folds , 2019, BMC Evolutionary Biology.

[12]  A. Solis,et al.  Reduced alphabet of prebiotic amino acids optimally encodes the conformational space of diverse extant protein folds , 2019, BMC Evolutionary Biology.

[13]  Jirí Vondrásek,et al.  Sequence Versus Composition: What Prescribes IDP Biophysical Properties? , 2019, Entropy.

[14]  Dana J Morrone,et al.  Genetic Code Evolution Investigated through the Synthesis and Characterisation of Proteins from Reduced‐Alphabet Libraries , 2019, Chembiochem : a European journal of chemical biology.

[15]  B. Ménez,et al.  Abiotic synthesis of amino acids in the recesses of the oceanic lithosphere , 2018, Nature.

[16]  A. Lazcano,et al.  Early Life: Embracing the RNA World , 2018, Current Biology.

[17]  J. Konvalinka,et al.  Random protein sequences can form defined secondary structures and are well-tolerated in vivo , 2017, Scientific Reports.

[18]  A. Dömling,et al.  A Systematic Protein Refolding Screen Method using the DGR Approach Reveals that Time and Secondary TSA are Essential Variables , 2017, Scientific Reports.

[19]  S. Maruyama,et al.  Origins of building blocks of life: A review , 2017, Geoscience Frontiers.

[20]  Lukasz P. Kozlowski,et al.  Proteome-pI: proteome isoelectric point database , 2016, Nucleic Acids Res..

[21]  J. Bada,et al.  On the lack of evolutionary continuity between prebiotic peptides and extant enzymes. , 2016, Physical chemistry chemical physics : PCCP.

[22]  Kumardeep Chaudhary,et al.  PEPstrMOD: structure prediction of peptides containing natural, non-natural and modified residues , 2015, Biology Direct.

[23]  Claudia Percivalle,et al.  Common origins of RNA, protein and lipid precursors in a cyanosulfidic protometabolism. , 2015, Nature Chemistry.

[24]  S. Freeland,et al.  Extraordinarily Adaptive Properties of the Genetically Encoded Amino Acids , 2015, Scientific Reports.

[25]  A. Lazcano,et al.  Norvaline and Norleucine May Have Been More Abundant Protein Components during Early Stages of Cell Evolution , 2013, Origins of Life and Evolution of Biospheres.

[26]  William M. McGee,et al.  The ornithine effect in peptide cation dissociation. , 2013, Journal of mass spectrometry : JMS.

[27]  L. Longo,et al.  Protein design at the interface of the pre-biotic and biotic worlds. , 2012, Archives of biochemistry and biophysics.

[28]  E. Milner-White,et al.  Functional Capabilities of the Earliest Peptides and the Emergence of Life , 2011, Genes.

[29]  S. Freeland “Terrestrial” Amino Acids and their Evolution , 2010 .

[30]  H. Cleaves,et al.  The origin of the biologically coded amino acids. , 2010, Journal of theoretical biology.

[31]  Hiroshi Yanagawa,et al.  Comparative characterization of random‐sequence proteins consisting of 5, 12, and 20 kinds of amino acids , 2010, Protein science : a publication of the Protein Society.

[32]  A. B. Hughes,et al.  Amino acids, peptides and proteins in organic chemistry , 2009 .

[33]  P. Higgs,et al.  A thermodynamic basis for prebiotic amino acid synthesis and the nature of the first genetic code. , 2009, Astrobiology.

[34]  H. Santana,et al.  Which Amino Acids Should Be Used in Prebiotic Chemistry Studies? , 2008, Origins of Life and Evolution of Biospheres.

[35]  W. Martin,et al.  On the origin of biochemistry at an alkaline hydrothermal vent , 2007, Philosophical Transactions of the Royal Society B: Biological Sciences.

[36]  N. C. Price,et al.  How to study proteins by circular dichroism. , 2005, Biochimica et biophysica acta.

[37]  S. Decatur,et al.  Spectroscopic evidence for backbone desolvation of helical peptides by 2,2,2-trifluoroethanol: an isotope-edited FTIR study. , 2005, Biochemistry.

[38]  Edward N. Trifonov,et al.  The Triplet Code From First Principles , 2004, Journal of biomolecular structure & dynamics.

[39]  Jan Hendrik Bredehöft,et al.  Identification of diamino acids in the Murchison meteorite. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[40]  R. L. Baldwin In Search of the Energetic Role of Peptide Hydrogen Bonds , 2003, The Journal of Biological Chemistry.

[41]  E N Trifonov,et al.  Consensus temporal order of amino acids and evolution of the triplet code. , 2000, Gene.

[42]  M. Buck,et al.  Trifluoroethanol and colleagues: cosolvents come of age. Recent studies with peptides and proteins , 1998, Quarterly Reviews of Biophysics.

[43]  I. Apostol,et al.  Incorporation of Norvaline at Leucine Positions in Recombinant Human Hemoglobin Expressed in Escherichia coli* , 1997, The Journal of Biological Chemistry.

[44]  P. Schultz,et al.  An experimental approach to evaluating the role of backbone interactions in proteins using unnatural amino acid mutagenesis. , 1997, Biochemistry.

[45]  Kit S. Lam,et al.  The “One-Bead-One-Compound” Combinatorial Library Method , 1997 .

[46]  R. L. Baldwin,et al.  Helix propensities of basic amino acids increase with the length of the side-chain. , 1996, Journal of molecular biology.

[47]  R A Houghten,et al.  Peptide libraries: Determination of relative reaction rates of protected amino acids in competitive couplings , 1994, Biopolymers.

[48]  J. Kasting,et al.  New Constraints on Precambrian Ocean Composition , 1993, The Journal of Geology.

[49]  David A. Case,et al.  Unfolding of an α‐helix in water , 1991 .

[50]  D. King,et al.  A cleavage method which minimizes side reactions following Fmoc solid phase peptide synthesis. , 1990, International journal of peptide and protein research.

[51]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[52]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[53]  Stanley L. Miller,et al.  Reasons for the occurrence of the twenty coded protein amino acids , 1981, Journal of Molecular Evolution.

[54]  C. Tanford,et al.  The hydrophobic effect and the organization of living matter. , 1978, Science.

[55]  C. Tanford,et al.  Empirical correlation between hydrophobic free energy and aqueous cavity surface area. , 1974, Proceedings of the National Academy of Sciences of the United States of America.

[56]  Sidney Udenfriend,et al.  Fluorescamine: A Reagent for Assay of Amino Acids, Peptides, Proteins, and Primary Amines in the Picomole Range , 1972, Science.

[57]  S. Miller,et al.  Nonprotein amino acids from spark discharges and their comparison with the murchison meteorite amino acids. , 1972, Proceedings of the National Academy of Sciences of the United States of America.

[58]  I. M. Klotz,et al.  Hydrogen Bonds between Model Peptide Groups in Solution , 1962 .

[59]  OUP accepted manuscript , 2022, Molecular Biology And Evolution.

[60]  C. Willis,et al.  Amino Acids, Peptides and Proteins in Organic Chemistry , 2009 .

[61]  D. Santi,et al.  Preparation of equimolar mixtures of peptides by adjustment of activated amino acid concentrations. , 1996, Methods in enzymology.

[62]  R. Hay,et al.  Proton ionisation constants and kinetics of base hydrolysis of some α-amino-acid esters in aqueous solution. Part III. Hydrolysis and intramolecular aminolysis of αω-diamino-acid methyl esters , 1972 .

[63]  K. Poduska,et al.  Amino acids and peptides. LII. Intramolecular aminolysis of amide bonds in derivatives of α,γ-diaminobutyric acid, α,β-diaminopropionic acid, and ornithine , 1965 .