Interplay of physics and evolution in the likely origin of protein biochemical function

The intrinsic ability of protein structures to exhibit the geometric and sequence properties required for ligand binding without evolutionary selection is shown by the coincidence of the properties of pockets in native, single domain proteins with those in computationally generated, compact homopolypeptide, artificial (ART) structures. The library of native pockets is covered by a remarkably small number of representative pockets (∼400), with virtually every native pocket having a statistically significant match in the ART library, suggesting that the library is complete. When sequences are selected for ART structures based on fold stability, pocket sequence conservation is coincident to native. The fact that structurally and sequentially similar pockets occur across fold classes combined with the small number of representative pockets in native proteins implies that promiscuous interactions are inherent to proteins. Based on comparison of PDB (real, single domain protein structures found in the Protein Data Bank) and ART structures and pockets, the widespread assumption that the co-occurrence of global structure, pocket similarity, and amino acid conservation demands an evolutionary relationship between proteins is shown to significantly underestimate the random background probability. Indeed, many features of biochemical function arise from the physical properties of proteins that evolution likely fine-tunes to achieve specificity. Finally, our study suggests that a repertoire of thermodynamically (marginally) stable proteins could engage in many of the biochemical reactions needed for living systems without selection for function, a conclusion with significant implications for the origin of life.

[1]  Ronald J. Quinn,et al.  Structural Insights into the Molecular Basis of the Ligand Promiscuity , 2012, J. Chem. Inf. Model..

[2]  Jaime Prilusky,et al.  Automated analysis of interatomic contacts in proteins , 1999, Bioinform..

[3]  Dan S. Tawfik,et al.  Enzyme promiscuity: evolutionary and mechanistic aspects. , 2006, Current opinion in chemical biology.

[4]  A. Kolinski,et al.  Derivation of protein‐specific pair potentials based on weak sequence fragment similarity , 2000, Proteins.

[5]  J. Skolnick,et al.  Erratum: Scoring function for automated assessment of protein structure template quality (Proteins: Structure, Function and Genetics (2004) 57, (702-710)) , 2007 .

[6]  David Baker,et al.  Bridging the gaps in design methodologies by evolutionary optimization of the stability and proficiency of designed Kemp eliminase KE59 , 2012, Proceedings of the National Academy of Sciences.

[7]  Johannes Söding,et al.  Fast and accurate automatic structure prediction with HHpred , 2009, Proteins.

[8]  David S. Wishart,et al.  T3DB: a comprehensively annotated database of common toxins and their targets , 2009, Nucleic Acids Res..

[9]  Yang Zhang,et al.  How significant is a protein structure similarity with TM-score = 0.5? , 2010, Bioinform..

[10]  Dan S. Tawfik,et al.  Enzyme promiscuity: a mechanistic and evolutionary perspective. , 2010, Annual review of biochemistry.

[11]  Xin Wen,et al.  BindingDB: a web-accessible database of experimentally determined protein–ligand binding affinities , 2006, Nucleic Acids Res..

[12]  J. Skolnick,et al.  Automated structure prediction of weakly homologous proteins on a genomic scale. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Jeffrey Skolnick,et al.  Protein structure prediction by pro-Sp3-TASSER. , 2009, Biophysical journal.

[14]  Fabrizio Grandoni,et al.  Combinatorial bounds via measure and conquer: Bounding minimal dominating sets and applications , 2008, TALG.

[15]  Ziding Zhang,et al.  Similarity networks of protein binding sites , 2005, Proteins.

[16]  M. Wilmanns,et al.  Directed evolution of a ( ba ) 8-barrel enzyme to catalyze related reactions in two different metabolic pathways , 2000 .

[17]  J. Skolnick,et al.  Further Evidence for the Likely Completeness of the Library of Solved Single Domain Protein Structures , 2011 .

[18]  J. Skolnick,et al.  Further evidence for the likely completeness of the library of solved single domain protein structures. , 2012, The journal of physical chemistry. B.

[19]  Haruki Nakamura,et al.  The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data , 2006, Nucleic Acids Res..

[20]  Yang Zhang,et al.  Scoring function for automated assessment of protein structure template quality , 2004, Proteins.

[21]  Jeffrey Skolnick,et al.  Fr-TM-align: a new protein structural alignment method based on fragment alignments and the TM-score , 2008, BMC Bioinformatics.

[22]  David Baker,et al.  Optimization of the In-silico-designed Kemp Eliminase Ke70 by Computational Design and Directed Evolution Journal of Molecular Biology , 2022 .

[23]  Jyoti Pande,et al.  Phage display: concept, innovations, applications and future. , 2010, Biotechnology advances.

[24]  David Baker,et al.  Rational Design of Intercellular Adhesion Molecule-1 (ICAM-1) Variants for Antagonizing Integrin Lymphocyte Function-associated Antigen-1-dependent Adhesion* , 2006, Journal of Biological Chemistry.

[25]  Michael H Hecht,et al.  Cofactor binding and enzymatic activity in an unevolved superfamily of de novo designed 4‐helix bundle proteins , 2009, Protein science : a publication of the Protein Society.

[26]  M Wilmanns,et al.  Directed evolution of a (beta alpha)8-barrel enzyme to catalyze related reactions in two different metabolic pathways. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[27]  Michal Brylinski,et al.  The continuity of protein structure space is an intrinsic property of proteins , 2009, Proceedings of the National Academy of Sciences.

[28]  Philip E. Bourne,et al.  PROMISCUOUS: a database for network-based drug-repositioning , 2010, Nucleic Acids Res..

[29]  A. Emons,et al.  Boekbespreking: Molecular biology of the cell, B. Alberts, D. Bray, J. Lewis, M. Raff, K. Robers, D.J. Watson. Garland Publ., New York. 1989. , 1990 .

[30]  Markus Fischer,et al.  Structural relationships among proteins with different global topologies and their implications for function annotation strategies , 2009, Proceedings of the National Academy of Sciences.

[31]  Jeffrey Skolnick,et al.  APoc: large-scale identification of similar protein pockets , 2013, Bioinform..

[32]  Dan S. Tawfik,et al.  Catalytic versatility and backups in enzyme active sites: the case of serum paraoxonase 1. , 2012, Journal of molecular biology.

[33]  Dan S. Tawfik Messy biology and the origins of evolutionary innovations. , 2010, Nature chemical biology.

[34]  M. Schroeder,et al.  LIGSITEcsc: predicting ligand binding sites using the Connolly surface and degree of conservation , 2006, BMC Structural Biology.

[35]  J. Davies,et al.  Molecular Biology of the Cell , 1983, Bristol Medico-Chirurgical Journal.

[36]  J. Skolnick,et al.  Why not consider a spherical protein? Implications of backbone hydrogen bonding for protein structure and function. , 2011, Physical chemistry chemical physics : PCCP.

[37]  Dan S. Tawfik,et al.  Role of chemistry versus substrate binding in recruiting promiscuous enzyme functions. , 2011, Biochemistry.

[38]  Dan S. Tawfik,et al.  The moderately efficient enzyme: evolutionary and physicochemical trends shaping enzyme parameters. , 2011, Biochemistry.

[39]  Ulrich Derigs,et al.  The shortest augmenting path method for solving assignment problems — Motivation and computational experience , 1985 .

[40]  John P. Overington,et al.  ChEMBL: a large-scale bioactivity database for drug discovery , 2011, Nucleic Acids Res..

[41]  Ryo Takeuchi,et al.  Computational redesign of a mononuclear zinc metalloenzyme for organophosphate hydrolysis. , 2012, Nature chemical biology.

[42]  R. Jensen Enzyme recruitment in evolution of new function. , 1976, Annual review of microbiology.

[43]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[44]  Michal Brylinski,et al.  Comparison of structure‐based and threading‐based approaches to protein functional annotation , 2010, Proteins.

[45]  Jeffrey Skolnick,et al.  The distribution of ligand-binding pockets around protein-protein interfaces suggests a general mechanism for pocket formation , 2012, Proceedings of the National Academy of Sciences.

[46]  Jeffrey Skolnick,et al.  Fast procedure for reconstruction of full‐atom protein models from reduced representations , 2008, J. Comput. Chem..