Directed Chemical Evolution with an Outsized Genetic Code

The first demonstration that macromolecules could be evolved in a test tube was reported twenty-five years ago. That breakthrough meant that billions of years of chance discovery and refinement could be compressed into a few weeks, and provided a powerful tool that now dominates all aspects of protein engineering. A challenge has been to extend this scientific advance into synthetic chemical space: to enable the directed evolution of abiotic molecules. The problem has been tackled in many ways. These include expanding the natural genetic code to include unnatural amino acids, engineering polyketide and polypeptide synthases to produce novel products, and tagging combinatorial chemistry libraries with DNA. Importantly, there is still no small-molecule analog of directed protein evolution, i.e. a substantiated approach for optimizing complex (≥ 10^9 diversity) populations of synthetic small molecules over successive generations. We present a key advance towards this goal: a tool for genetically-programmed synthesis of small-molecule libraries from large chemical alphabets. The approach accommodates alphabets that are one to two orders of magnitude larger than any in Nature, and facilitates evolution within the chemical spaces they create. This is critical for small molecules, which are built up from numerous and highly varied chemical fragments. We report a proof-of-concept chemical evolution experiment utilizing an outsized genetic code, and demonstrate that fitness traits can be passed from an initial small-molecule population through to the great-grandchildren of that population. The results establish the practical feasibility of engineering synthetic small molecules through accelerated evolution.

[1]  L. Orzechowski,et al.  DNA Compatible Multistep Synthesis and Applications to DNA Encoded Libraries. , 2015, Bioconjugate chemistry.

[2]  L. Gold,et al.  Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. , 1990, Science.

[3]  Yingming Zhao,et al.  Selective Enrichment of Thiophosphorylated Polypeptides as a Tool for the Analysis of Protein Phosphorylation* , 2003, Molecular & Cellular Proteomics.

[4]  Sachdev S Sidhu,et al.  Molecular recognition by a binary code. , 2005, Journal of molecular biology.

[5]  W. Stemmer Rapid evolution of a protein in vitro by DNA shuffling , 1994, Nature.

[6]  Sindy K. Y. Tang,et al.  Prospective identification of parasitic sequences in phage display screens , 2013, Nucleic acids research.

[7]  P. Harbury,et al.  Synthetic ligands discovered by in vitro selection. , 2007, Journal of the American Chemical Society.

[8]  Hongfeng Deng,et al.  Discovery of highly potent and selective small molecule ADAMTS-5 inhibitors that inhibit human cartilage degradation via encoded library technology (ELT). , 2012, Journal of medicinal chemistry.

[9]  E. Krebs,et al.  Role of multiple basic residues in determining the substrate specificity of cyclic AMP-dependent protein kinase. , 1977, The Journal of biological chemistry.

[10]  Dan S. Tawfik,et al.  Protein engineers turned evolutionists , 2007, Nature Methods.

[11]  Frederic A. Fellouse,et al.  High-throughput generation of synthetic antibodies from highly functional minimalist phage-displayed libraries. , 2007, Journal of molecular biology.

[12]  O. Chaloin,et al.  Selection of a synthetic glycan oligomer from a library of DNA-templated fragments against DC-SIGN and inhibition of HIV gp120 binding to dendritic cells. , 2011, Chemical communications.

[13]  David M. Wilson,et al.  Inhibition of PAD4 activity is sufficient to disrupt mouse and human NET formation , 2015, Nature chemical biology.

[14]  J. Shabb Physiological substrates of cAMP-dependent protein kinase. , 2001, Chemical reviews.

[15]  Sachdev S Sidhu,et al.  The intrinsic contributions of tyrosine, serine, glycine and arginine to the affinity and specificity of antibodies. , 2008, Journal of molecular biology.

[16]  T. Kortemme,et al.  Ionization-reactivity relationships for cysteine thiols in polypeptides. , 1998, Biochemistry.

[17]  M. K. Pflum,et al.  Exploring Kinase Cosubstrate Promiscuity: Monitoring Kinase Activity through Dansylation , 2009, Chembiochem : a European journal of chemical biology.

[18]  Zhengrong Zhu,et al.  Application of encoded library technology (ELT) to a protein-protein interaction target: discovery of a potent class of integrin lymphocyte function-associated antigen 1 (LFA-1) antagonists. , 2014, Bioorganic & medicinal chemistry.

[19]  David R. Liu,et al.  Methods for the directed evolution of proteins , 2015, Nature Reviews Genetics.

[20]  F. Hofmann,et al.  Determination of cyclic nucleotide-dependent protein kinase substrate specificity by the use of peptide libraries on cellulose paper. , 1995, Biochemistry.

[21]  Zhou Songyang,et al.  Use of an oriented peptide library to determine the optimal substrates of protein kinases , 1994, Current Biology.

[22]  E. Tate,et al.  Generation and screening of an oligonucleotide-encoded synthetic peptide library. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Zhengrong Zhu,et al.  Encoded Library Technology Screening of Hepatitis C Virus NS4B Yields a Small-Molecule Compound Series with In Vitro Replicon Activity , 2015, Antimicrobial Agents and Chemotherapy.

[24]  P. Harbury,et al.  Highly Parallel Translation of DNA Sequences into Small Molecules , 2012, PloS one.

[25]  G H Snyder,et al.  Electrostatic influence of local cysteine environments on disulfide exchange kinetics. , 1981, Biochemistry.

[26]  V. Allfrey,et al.  Affinity purification of newly phosphorylated protein molecules. Thiophosphorylation and recovery of histones H1, H2B, and H3 and the high mobility group protein HMG-1 using adenosine 5'-O-(3-thiotriphosphate) and cyclic AMP-dependent protein kinase. , 1980, The Journal of biological chemistry.

[27]  Michael W Deem,et al.  Amino acid alphabet size in protein evolution experiments: better to search a small library thoroughly or a large library sparsely? , 2008, Protein engineering, design & selection : PEDS.

[28]  Sydney Brenner,et al.  Synthetic methods for the implementation of encoded combinatorial chemistry , 1993 .

[29]  Sachdev S Sidhu,et al.  Synthetic antibodies from a four-amino-acid code: a dominant role for tyrosine in antigen recognition. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[30]  T. Chambers,et al.  Phosphorylation by protein kinase C and cyclic AMP-dependent protein kinase of synthetic peptides derived from the linker region of human P-glycoprotein. , 1994, The Biochemical journal.

[31]  Sheri K. Wilcox,et al.  Aptamers and the RNA world, past and present. , 2012, Cold Spring Harbor perspectives in biology.

[32]  Wei Jiang,et al.  Engineering antibodies for cancer therapy. , 2011, Annual review of chemical and biomolecular engineering.

[33]  Lars Kolster Petersen,et al.  A yoctoliter-scale DNA reactor for small-molecule evolution. , 2009, Journal of the American Chemical Society.

[34]  G. F. Joyce,et al.  Amplification, mutation and selection of catalytic RNA. , 1989, Gene.

[35]  J. Keasling,et al.  High-throughput metabolic engineering: advances in small-molecule screening and selection. , 2010, Annual review of biochemistry.

[36]  P. Harbury,et al.  DNA Display II. Genetic Manipulation of Combinatorial Chemistry Libraries for Small-Molecule Evolution , 2004, PLoS biology.

[37]  Robert J. Marinelli,et al.  Mesofluidic Devices for DNA-Programmed Combinatorial Chemistry , 2012, PloS one.

[38]  J. Szostak,et al.  In vitro selection of RNA molecules that bind specific ligands , 1990, Nature.

[39]  Ashwani Kumar,et al.  Directed evolution: tailoring biocatalysts for industrial applications , 2013, Critical reviews in biotechnology.

[40]  Donald Hilvert,et al.  Directed Evolution of a Model Primordial Enzyme Provides Insights into the Development of the Genetic Code , 2013, PLoS genetics.

[41]  P. Harbury,et al.  DNA Display III. Solid-Phase Organic Synthesis on Unprotected DNA , 2004, PLoS biology.

[42]  Christoph E. Dumelin,et al.  High-throughput sequencing allows the identification of binding molecules isolated from DNA-encoded chemical libraries , 2008, Proceedings of the National Academy of Sciences.

[43]  P. Schultz,et al.  Profiling protein function with small molecule microarrays , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Baoguang Zhao,et al.  Design, synthesis and selection of DNA-encoded small-molecule libraries. , 2009, Nature chemical biology.

[45]  G. P. Smith,et al.  Filamentous fusion phage: novel expression vectors that display cloned antigens on the virion surface. , 1985, Science.

[46]  David R. Liu,et al.  DNA-Templated Organic Synthesis and Selection of a Library of Macrocycles , 2004, Science.