An Ancient Fingerprint Indicates the Common Ancestry of Rossmann-Fold Enzymes Utilizing Different Ribose-Based Cofactors

Nucleoside-based cofactors are presumed to have preceded proteins. The Rossmann fold is one of the most ancient and functionally diverse protein folds, and most Rossmann enzymes utilize nucleoside-based cofactors. We analyzed an omnipresent Rossmann ribose-binding interaction: a carboxylate side chain at the tip of the second β-strand (β2-Asp/Glu). We identified a canonical motif, defined by the β2-topology and unique geometry. The latter relates to the interaction being bidentate (both ribose hydroxyls interacting with the carboxylate oxygens), to the angle between the carboxylate and the ribose, and to the ribose’s ring configuration. We found that this canonical motif exhibits hallmarks of divergence rather than convergence. It is uniquely found in Rossmann enzymes that use different cofactors, primarily SAM (S-adenosyl methionine), NAD (nicotinamide adenine dinucleotide), and FAD (flavin adenine dinucleotide). Ribose-carboxylate bidentate interactions in other folds are not only rare but also have a different topology and geometry. We further show that the canonical geometry is not dictated by a physical constraint—geometries found in noncanonical interactions have similar calculated bond energies. Overall, these data indicate the divergence of several major Rossmann-fold enzyme classes, with different cofactors and catalytic chemistries, from a common pre-LUCA (last universal common ancestor) ancestor that possessed the β2-Asp/Glu motif.

[1]  N. Grishin Fold change in evolution of protein structures. , 2001, Journal of structural biology.

[2]  R. Blumenthal,et al.  Many paths to methyltransfer: a chronicle of convergence. , 2003, Trends in biochemical sciences.

[3]  A. Dean,et al.  Protein engineering reveals ancient adaptive replacements in isocitrate dehydrogenase. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Ying Gao,et al.  Bioinformatics Applications Note Sequence Analysis Cd-hit Suite: a Web Server for Clustering and Comparing Biological Sequences , 2022 .

[5]  Michael G. Rossmann,et al.  Chemical and biological evolution of a nucleotide-binding protein , 1974, Nature.

[6]  Mark N. Wass,et al.  Convergent evolution of enzyme active sites is not a rare phenomenon. , 2007, Journal of molecular biology.

[7]  Margarita Osadchy,et al.  Maps of protein structure space reveal a fundamental relationship between protein structure and function , 2011, Proceedings of the National Academy of Sciences.

[8]  M G Rossmann,et al.  Comparison of super-secondary structures in proteins. , 1973, Journal of molecular biology.

[9]  David A. Lee,et al.  CATH: comprehensive structural and functional annotations for genome sequences , 2014, Nucleic Acids Res..

[10]  P. R. Sibbald,et al.  The P-loop--a common motif in ATP- and GTP-binding proteins. , 1990, Trends in biochemical sciences.

[11]  E. Koonin,et al.  Trends in protein evolution inferred from sequence and structure analysis. , 2002, Current opinion in structural biology.

[12]  Andrei N. Lupas,et al.  β-Propeller Blades as Ancestral Peptides in Protein Evolution , 2013, PloS one.

[13]  Gustavo Caetano-Anollés,et al.  The proteomic complexity and rise of the primordial ancestor of diversified life , 2011, BMC Evolutionary Biology.

[14]  C. Orengo,et al.  Plasticity of enzyme active sites. , 2002, Trends in biochemical sciences.

[15]  C. Ponting,et al.  On the evolution of protein folds: are similar motifs in different protein folds the result of convergence, insertion, or relics of an ancient peptide world? , 2001, Journal of structural biology.

[16]  D. Truhlar,et al.  The M06 suite of density functionals for main group thermochemistry, thermochemical kinetics, noncovalent interactions, excited states, and transition elements: two new functionals and systematic testing of four M06-class functionals and 12 other functionals , 2008 .

[17]  E. Trifonov,et al.  Protein Modules Conserved Since LUCA , 2006, Journal of Molecular Evolution.

[18]  Dan S. Tawfik,et al.  The robustness and innovability of protein folds. , 2014, Current opinion in structural biology.

[19]  M. Strauss,et al.  Book ReviewChance and Necessity: An essay on the natural philosophy of modern biology. , 1972 .

[20]  Mark A. Ratner,et al.  6‐31G* basis set for third‐row atoms , 2001, J. Comput. Chem..

[21]  P. Terpstra,et al.  Prediction of the Occurrence of the ADP-binding βαβ-fold in Proteins, Using an Amino Acid Sequence Fingerprint , 1986 .

[22]  E. Koonin The Logic of Chance: The Nature and Origin of Biological Evolution , 2011 .

[23]  W. Gilbert Origin of life: The RNA world , 1986, Nature.

[24]  A. Mushegian,et al.  Natural history of S-adenosylmethionine-binding proteins , 2005, BMC Structural Biology.

[25]  Gustavo Caetano-Anollés,et al.  The origin of modern metabolic networks inferred from phylogenomic analysis of protein architecture , 2007, Proceedings of the National Academy of Sciences.

[26]  A. Jeltsch,et al.  Biotin-Avidin Microplate Assay for the Quantitative Analysis of Enzymatic Methylation of DNA by DNA Methyltransferases , 2000, Biological chemistry.

[27]  P. Hanson,et al.  AAA+ proteins: have engine, will work , 2005, Nature Reviews Molecular Cell Biology.

[28]  F. H. C. CRICK,et al.  Origin of the Genetic Code , 1967, Nature.

[29]  S. Lacks,et al.  Crystal structure of the DpnM DNA adenine methyltransferase from the DpnII restriction system of streptococcus pneumoniae bound to S-adenosylmethionine. , 1998, Structure.

[30]  E V Koonin,et al.  A common set of conserved motifs in a vast variety of putative nucleic acid-dependent ATPases including MCM proteins involved in the initiation of eukaryotic DNA replication. , 1993, Nucleic acids research.

[31]  I. Berezovsky,et al.  Protein function from its emergence to diversity in contemporary proteins , 2015, Physical biology.

[32]  J. Hurley,et al.  Determinants of cofactor specificity in isocitrate dehydrogenase: structure of an engineered NADP+ --> NAD+ specificity-reversal mutant. , 1996, Biochemistry.

[33]  P. Terpstra,et al.  Prediction of the occurrence of the ADP-binding beta alpha beta-fold in proteins, using an amino acid sequence fingerprint. , 1986, Journal of molecular biology.

[34]  F. Arnold,et al.  General approach to reversing ketol-acid reductoisomerase cofactor dependence from NADPH to NADH , 2013, Proceedings of the National Academy of Sciences.

[35]  C. Cramer,et al.  Universal solvation model based on solute electron density and on a continuum model of the solvent defined by the bulk dielectric constant and atomic surface tensions. , 2009, The journal of physical chemistry. B.

[36]  Detlef D. Leipe,et al.  Evolution and classification of P-loop kinases and related proteins. , 2003, Journal of molecular biology.

[37]  A. Efimov Structural trees for protein superfamilies , 1997, Proteins.

[38]  D. Baker,et al.  Control over overall shape and size in de novo designed proteins , 2015, Proceedings of the National Academy of Sciences.

[39]  Orly Dym,et al.  De Novo Evolutionary Emergence of a Symmetrical Protein Is Shaped by Folding Constraints , 2016, Cell.

[40]  Lei Xie,et al.  Detecting evolutionary relationships across existing fold space, using sequence order-independent profile–profile alignments , 2008, Proceedings of the National Academy of Sciences.

[41]  D. Baker,et al.  Principles for designing ideal protein structures , 2012, Nature.

[42]  E. Fauman Structure and evolution of AdoMet-dependent methyltransferase. , 1999 .

[43]  Rotem Sorek,et al.  Correlated Occurrence and Bypass of Frame-Shifting Insertion-Deletions (InDels) to Give Functional Proteins , 2013, PLoS genetics.

[44]  Donald G. Truhlar,et al.  The M06 suite of density functionals for main group thermochemistry, thermochemical kinetics, noncovalent interactions, excited states, and transition elements: two new functionals and systematic testing of four M06 functionals and 12 other functionals , 2008 .

[45]  Yuxing Liao,et al.  ECOD: An Evolutionary Classification of Protein Domains , 2014, PLoS Comput. Biol..

[46]  Charlotte M. Deane,et al.  Exploring Fold Space Preferences of New-born and Ancient Protein Superfamilies , 2013, PLoS Comput. Biol..

[47]  Dan S. Tawfik,et al.  Evolutionary transitions to new DNA methyltransferases through target site expansion and shrinkage , 2012, Nucleic acids research.

[48]  B. Henrissat,et al.  Structures and mechanisms of glycosyl hydrolases. , 1995, Structure.

[49]  José Arcadio Farías-Rico,et al.  Evolutionary relationship of two ancient protein superfolds. , 2014, Nature chemical biology.

[50]  Janusz M. Bujnicki,et al.  Comparison of protein structures reveals monophyletic origin of AdoMet-dependent methyltransferase family and mechanistic convergence rather than recent differentiation of N4-cytosine and N6-adenine DNA methylation , 1999, Silico Biol..

[51]  Cathy H. Wu,et al.  Structural and functional studies of S-adenosyl-L-methionine binding proteins: a ligand-centric approach , 2013, BMC Structural Biology.

[52]  T. Imanaka,et al.  Analysis of interaction between the Arthrobacter sarcosine oxidase and the coenzyme flavin adenine dinucleotide by site-directed mutagenesis , 1996, Applied and environmental microbiology.

[53]  G. F. Joyce The antiquity of RNA-based evolution , 2002, Nature.

[54]  Bin Wu,et al.  H-bonding activation in highly regioselective acetylation of diols. , 2013, The Journal of organic chemistry.

[55]  Huaiyu Mi,et al.  The InterPro protein families database: the classification resource after 15 years , 2014, Nucleic Acids Res..

[56]  Detlef D. Leipe,et al.  Classification and evolution of P-loop GTPases and related ATPases. , 2002, Journal of molecular biology.

[57]  Dan S. Tawfik,et al.  Divergence and Convergence in Enzyme Evolution: Parallel Evolution of Paraoxonases from Quorum-quenching Lactonases* , 2011, The Journal of Biological Chemistry.

[58]  R. Russell,et al.  Modular architecture of nucleotide-binding pockets , 2010, Nucleic acids research.

[59]  John B. O. Mitchell,et al.  The Natural History of Biocatalytic Mechanisms , 2014, PLoS Comput. Biol..

[60]  D. Moras,et al.  D-glyceraldehyde-3-phosphate dehydrogenase: three-dimensional structure and evolutionary significance. , 1973, Proceedings of the National Academy of Sciences of the United States of America.

[61]  Lvek,et al.  Evolution of protein structures and functions , 2022 .

[62]  A M Lesk,et al.  NAD-binding domains of dehydrogenases. , 1995, Current opinion in structural biology.

[63]  M G Rossmann,et al.  The evolution of dehydrogenases and kinases. , 1975, CRC critical reviews in biochemistry.

[64]  Igor N. Berezovsky,et al.  Prototypes of elementary functional loops unravel evolutionary connections between protein functions , 2010, Bioinform..

[65]  C. Orengo,et al.  Protein Superfamily Evolution and the Last Universal Common Ancestor (LUCA) , 2006, Journal of Molecular Evolution.

[66]  Thomas Madej,et al.  Analysis of protein homology by assessing the (dis)similarity in protein loop regions , 2004, Proteins.

[67]  Gustavo Caetano-Anollés,et al.  Origin and Evolution of Protein Fold Designs Inferred from Phylogenomic Analysis of CATH Domain Structures in Proteomes , 2013, PLoS Comput. Biol..

[68]  David Eisenberg,et al.  GXXXG and GXXXA motifs stabilize FAD and NAD(P)-binding Rossmann folds through C(alpha)-H... O hydrogen bonds and van der waals interactions. , 2002, Journal of molecular biology.

[69]  E. Koonin,et al.  SAP - a putative DNA-binding motif involved in chromosomal organization. , 2000, Trends in biochemical sciences.

[70]  Y. Mély,et al.  Determinants of coenzyme specificity in glyceraldehyde-3-phosphate dehydrogenase: role of the acidic residue in the fingerprint region of the nucleotide binding fold. , 1993, Biochemistry.

[71]  Lisa N Kinch,et al.  Evolution of protein structures and functions. , 2002, Current opinion in structural biology.

[72]  W R Taylor,et al.  Recognition of super-secondary structure in proteins. , 1984, Journal of molecular biology.

[73]  S. Gould The Structure of Evolutionary Theory , 2002 .

[74]  Michael Y. Galperin,et al.  Divergence and Convergence in Enzyme Evolution , 2011, The Journal of Biological Chemistry.

[75]  Donald G. Truhlar,et al.  Correlation and solvation effects on heterocyclic equilibria in aqueous solution , 1993 .

[76]  Ann M Stock,et al.  Crystal structure of the chemotaxis receptor methyltransferase CheR suggests a conserved structural motif for binding S-adenosylmethionine. , 1997, Structure.

[77]  O. Dym,et al.  Sequence‐structure analysis of FAD‐containing proteins , 2001, Protein science : a publication of the Protein Society.

[78]  Hong-yu Zhang,et al.  Characters of very ancient proteins. , 2008, Biochemical and biophysical research communications.

[79]  A. Edison Linus Pauling and the planar peptide bond , 2001, Nature Structural Biology.