GeneORator: An Effective Strategy for Navigating Protein Sequence Space More Efficiently through Boolean OR-Type DNA Libraries

Directed evolution requires the creation of genetic diversity and subsequent screening or selection for improved variants. For DNA mutagenesis, conventional site-directed methods implicitly utilize the Boolean AND operator (creating all mutations simultaneously), producing a combinatorial explosion in the number of genetic variants as the number of mutations increases. We introduce GeneORator, a novel strategy for creating DNA libraries based on the Boolean logical OR operator. Here, a single library is divided into many subsets, each containing different combinations of the desired mutations. Consequently, the effect of adding more mutations on the number of genetic combinations is additive (Boolean OR logic) and not exponential (AND logic). We demonstrate this strategy with large-scale mutagenesis studies, using monoamine oxidase-N (Aspergillus niger) as the exemplar target. First, we mutated every residue in the secondary structure-containing regions (276 out of a total 495 amino acids) to screen for improvements in kcat. Second, combinatorial OR-type libraries permitted screening of diverse mutation combinations in the enzyme active site to detect activity toward novel substrates. In both examples, OR-type libraries effectively reduced the number of variants searched up to 1010-fold, dramatically reducing the screening effort required to discover variants with improved and/or novel activity. Importantly, this approach enables the screening of a greater diversity of mutation combinations, accessing a larger area of a protein’s sequence space. OR-type libraries can be applied to any biological engineering objective requiring DNA mutagenesis, and the approach has wide ranging applications in, for example, enzyme engineering, antibody engineering, and synthetic biology.

[1]  Elena R. Lozovsky,et al.  Biophysical principles predict fitness landscapes of drug resistance , 2016, Proceedings of the National Academy of Sciences.

[2]  Timothy K Lu,et al.  Synthetic circuits integrating logic and memory in living cells , 2013, Nature Biotechnology.

[3]  Michael H Hecht,et al.  De novo proteins from binary-patterned combinatorial libraries. , 2006, Methods in molecular biology.

[4]  Paul A Dalby,et al.  Strategy and success for the directed evolution of enzymes. , 2011, Current opinion in structural biology.

[5]  R. Kazlauskas,et al.  Improving enzyme properties: when are closer mutations better? , 2005, Trends in biotechnology.

[6]  Philip A. Romero,et al.  Exploring protein fitness landscapes by directed evolution , 2009, Nature Reviews Molecular Cell Biology.

[7]  Dmitry Chudakov,et al.  Local fitness landscape of the green fluorescent protein , 2016, Nature.

[8]  Leilei Zhu,et al.  Directed evolution 2.0: improving and deciphering enzyme properties. , 2015, Chemical communications.

[9]  Hao Lin,et al.  MDC-Analyzer-facilitated combinatorial strategy for improving the activity and stability of halohydrin dehalogenase from Agrobacterium radiobacter AD1. , 2015, Journal of biotechnology.

[10]  D. Kell,et al.  Array-based evolution of DNA aptamers allows modelling of an explicit sequence-fitness landscape , 2008, Nucleic acids research.

[11]  Frances H Arnold,et al.  Fancy footwork in the sequence space shuffle , 2006, Nature Biotechnology.

[12]  Jianzhi Zhang,et al.  The fitness landscape of a tRNA gene , 2016, Science.

[13]  Nicholas J. Turner,et al.  Deracemization of α‐Methylbenzylamine Using an Enzyme Obtained by In Vitro Evolution , 2002 .

[14]  Manfred T Reetz,et al.  Addressing the Numbers Problem in Directed Evolution , 2008, Chembiochem : a European journal of chemical biology.

[15]  Xia Li,et al.  SynBioLGDB: a resource for experimentally validated logic gates in synthetic biology , 2015, Scientific Reports.

[16]  E. D. Weinberger,et al.  The NK model of rugged fitness landscapes and its application to maturation of the immune response. , 1989, Journal of theoretical biology.

[17]  F. Arnold,et al.  Strategies for the in vitro evolution of protein function: enzyme evolution by random recombination of improved sequences. , 1997, Journal of molecular biology.

[18]  L. H. Bradley,et al.  Protein design by binary patterning of polar and nonpolar amino acids. , 1993, Methods in molecular biology.

[19]  Stanley Fields,et al.  Deep Mutational Scanning: A Highly Parallel Method to Measure the Effects of Mutation on Protein Function. , 2015, Cold Spring Harbor protocols.

[20]  T. Yomo,et al.  Experimental Rugged Fitness Landscape in Protein Sequence Space , 2006, PloS one.

[21]  Neil Swainston,et al.  GeneGenie: optimized oligomer design for directed evolution , 2014, Nucleic Acids Res..

[22]  K N Houk,et al.  The Role of Distant Mutations and Allosteric Regulation on LovD Active Site Dynamics , 2014, Nature chemical biology.

[23]  Manfred T. Reetz,et al.  Biocatalysis in Organic Chemistry and Biotechnology: Past, Present, and Future , 2013 .

[24]  Neil Swainston,et al.  CodonGenie: optimised ambiguous codon design tools , 2017, PeerJ Prepr..

[25]  Andrew Currin,et al.  Synthetic biology for the directed evolution of protein biocatalysts: navigating sequence space intelligently , 2014, Chemical Society reviews.

[26]  Nicholas J. Turner,et al.  Directed Evolution of the Enzyme Monoamine Oxidase (MAO‐N): Highly Efficient Chemo‐enzymatic Deracemisation of the Alkaloid (±)‐Crispine A , 2012 .

[27]  Frances H. Arnold,et al.  Computational method to reduce the search space for directed protein evolution , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[28]  A. Green,et al.  Monoamine Oxidase (MAO-N) Catalyzed Deracemization of Tetrahydro-β-carbolines: Substrate Dependent Switch in Enantioselectivity , 2013 .

[29]  Nicholas J Turner,et al.  A template-based mnemonic for monoamine oxidase (MAO-N) catalyzed reactions and its application to the chemo-enzymatic deracemisation of the alkaloid (+/-)-crispine A. , 2007, Chemical communications.

[30]  Catherine L. Worth,et al.  Structural and functional constraints in the evolution of protein families , 2009, Nature Reviews Molecular Cell Biology.

[31]  Andreas Krause,et al.  Navigating the protein fitness landscape with Gaussian processes , 2012, Proceedings of the National Academy of Sciences.

[32]  Joshua D. Knowles Closed-loop evolutionary multiobjective optimization , 2009, IEEE Computational Intelligence Magazine.

[33]  Manfred T Reetz,et al.  Directed evolution of enantioselective enzymes: iterative cycles of CASTing for probing protein-sequence space. , 2006, Angewandte Chemie.

[34]  S. Kauffman,et al.  Coevolution to the edge of chaos: coupled fitness landscapes, poised states, and coevolutionary avalanches. , 1991, Journal of theoretical biology.

[35]  John C Whitman,et al.  Improving catalytic function by ProSAR-driven enzyme evolution , 2007, Nature Biotechnology.

[36]  Nicholas J Turner,et al.  Mapping the substrate scope of monoamine oxidase (MAO-N) as a synthetic tool for the enantioselective synthesis of chiral amines. , 2017, Bioorganic & medicinal chemistry.

[37]  Uwe T Bornscheuer,et al.  A Retrosynthesis Approach for Biocatalysis in Organic Synthesis. , 2017, Chemistry.

[38]  Nicholas J Turner,et al.  Biocatalysis enters a new era. , 2013, Current opinion in chemical biology.

[39]  Nicholas J Turner,et al.  The structure of monoamine oxidase from Aspergillus niger provides a molecular context for improvements in activity obtained by directed evolution. , 2008, Journal of molecular biology.

[40]  Jaroslav Bendl,et al.  Computational tools for designing smart libraries. , 2014, Methods in molecular biology.

[41]  Nicholas J Turner,et al.  Directed evolution drives the next generation of biocatalysts. , 2009, Nature chemical biology.

[42]  D. Hilvert,et al.  Protein design by directed evolution. , 2008, Annual review of biophysics.

[43]  Andrew Currin,et al.  SpeedyGenes: an improved gene synthesis method for the efficient production of error-corrected, synthetic protein libraries for directed evolution , 2014, Protein engineering, design & selection : PEDS.

[44]  Gjalt W Huisman,et al.  Enzyme optimization: moving from blind evolution to statistical exploration of sequence-function space. , 2008, Trends in biotechnology.

[45]  Nicholas J Turner,et al.  A Regio- and Stereoselective ω-Transaminase/Monoamine Oxidase Cascade for the Synthesis of Chiral 2,5-Disubstituted Pyrrolidines , 2014, Angewandte Chemie.

[46]  Takafumi Miyamoto,et al.  Synthesizing biomolecule-based Boolean logic gates. , 2013, ACS synthetic biology.

[47]  Stanley Fields,et al.  Measuring the activity of protein variants on a large scale using deep mutational scanning , 2014, Nature Protocols.

[48]  Zujun Yang,et al.  DC-Analyzer-facilitated combinatorial strategy for rapid directed evolution of functional enzymes with multiple mutagenesis sites. , 2014, Journal of biotechnology.

[49]  Marc Garcia-Borràs,et al.  Computational tools for the evaluation of laboratory-engineered biocatalysts , 2016, Chemical communications.

[50]  Huimin Zhao,et al.  Directed evolution: an evolving and enabling synthetic biology tool. , 2012, Current opinion in chemical biology.

[51]  Fyodor A Kondrashov,et al.  Topological features of rugged fitness landscapes in sequence space. , 2015, Trends in genetics : TIG.

[52]  Douglas B Kell,et al.  Scientific discovery as a combinatorial optimisation problem: How best to navigate the landscape of possible experiments? , 2012, BioEssays : news and reviews in molecular, cellular and developmental biology.

[53]  N. Turner,et al.  A chemo-enzymatic route to enantiomerically pure cyclic tertiary amines. , 2006, Journal of the American Chemical Society.

[54]  Huimin Zhao,et al.  Directed Evolution: Past, Present and Future. , 2013, AIChE journal. American Institute of Chemical Engineers.

[55]  Neil Swainston,et al.  Fast and Flexible Synthesis of Combinatorial Libraries for Directed Evolution. , 2018, Methods in enzymology.

[56]  Ferran Feixas,et al.  Hidden Conformations in Aspergillus niger Monoamine Oxidase are Key for Catalytic Efficiency. , 2019, Angewandte Chemie.

[57]  Manfred T. Reetz,et al.  Enzymatic site-selectivity enabled by structure-guided directed evolution. , 2017, Chemical communications.

[58]  Andreas Vogel,et al.  Expanding the substrate scope of enzymes: combining mutations obtained by CASTing. , 2006, Chemistry.

[59]  M. Hecht,et al.  Binary patterning of polar and nonpolar amino acids in the sequences and structures of native proteins , 1995, Protein science : a publication of the Protein Society.

[60]  Neil Swainston,et al.  SpeedyGenes: Exploiting an Improved Gene Synthesis Method for the Efficient Production of Synthetic Protein Libraries for Directed Evolution. , 2017, Methods in molecular biology.

[61]  C. Voigt,et al.  Rational evolutionary design: the theory of in vitro protein evolution. , 2000, Advances in protein chemistry.

[62]  Stuart A. Kauffman,et al.  The origins of order , 1993 .

[63]  Xiong Wang,et al.  Construction of "small-intelligent" focused mutagenesis libraries using well-designed combinatorial degenerate primers. , 2012, BioTechniques.

[64]  Nicholas J Turner,et al.  Directed evolution of enzymes: new biocatalysts for asymmetric synthesis. , 2003, Organic & biomolecular chemistry.

[65]  G. Huisman,et al.  Engineering the third wave of biocatalysis , 2012, Nature.

[66]  Magali Remaud-Siméon,et al.  A web-based tool for rational screening of mutants libraries using ProSAR. , 2014, Protein engineering, design & selection : PEDS.