Enzyme optimization: moving from blind evolution to statistical exploration of sequence-function space.

Directed evolution is a powerful tool for the creation of commercially useful enzymes, particularly those approaches that are based on in vitro recombination methods, such as DNA shuffling. Although these types of search algorithms are extraordinarily efficient compared with purely random methods, they do not explicitly represent or interrogate the genotype-phenotype relationship and are essentially blind in nature. Recently, however, researchers have begun to apply multivariate statistical techniques to model protein sequence-function relationships and guide the evolutionary process by rapidly identifying beneficial diversity for recombination. In conjunction with state-of-the-art library generation methods, the statistical approach to sequence optimization is now being used routinely to create enzymes efficiently for industrial applications.

[1]  Frances H. Arnold,et al.  Computational method to reduce the search space for directed protein evolution , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Richard Fox,et al.  Directed molecular evolution by machine learning and the influence of nonlinear interactions. , 2005, Journal of theoretical biology.

[3]  Claes Gustafsson,et al.  Semi-synthetic DNA shuffling of aveC leads to improved industrial scale production of doramectin by Streptomyces avermitilis. , 2005, Metabolic engineering.

[4]  F. Arnold,et al.  Strategies for the in vitro evolution of protein function: enzyme evolution by random recombination of improved sequences. , 1997, Journal of molecular biology.

[5]  Biological chemistry: Enzymes in focus , 2005, Nature.

[6]  Ramesh N. Patel Biocatalysis in the pharmaceutical and biotechnology industries , 2006 .

[7]  S. Anderson,et al.  Predicting the reactivity of proteins from their sequence alone: Kazal family of protein inhibitors of serine proteinases. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[8]  A W Edwards,et al.  The genetical theory of natural selection. , 2000, Genetics.

[9]  E. Weinberger NP Completeness of Kauffman's N-k Model, A Tuneable Rugged Fitness Landscape , 1996 .

[10]  John C Whitman,et al.  Improving catalytic function by ProSAR-driven enzyme evolution , 2007, Nature Biotechnology.

[11]  R. Dawkins Climbing Mount Improbable , 1996 .

[12]  Roberto A Chica,et al.  Semi-rational approaches to engineering enzyme activity: combining the benefits of directed evolution and rational design. , 2005, Current opinion in biotechnology.

[13]  T. Yomo,et al.  Experimental Rugged Fitness Landscape in Protein Sequence Space , 2006, PloS one.

[14]  Wayne M Patrick,et al.  Novel methods for directed evolution of enzymes: quality, not quantity. , 2004, Current opinion in biotechnology.

[15]  Stephen L Mayo,et al.  Computationally designed libraries of fluorescent proteins evaluated by preservation and diversity of function , 2007, Proceedings of the National Academy of Sciences.

[16]  Stuart A. Kauffman,et al.  ORIGINS OF ORDER , 2019, Origins of Order.

[17]  Sachdev S Sidhu,et al.  Comprehensive functional maps of the antigen-binding site of an anti-ErbB2 antibody obtained with shotgun scanning mutagenesis. , 2002, Journal of molecular biology.

[18]  Iosif I. Vaisman,et al.  Accurate prediction of enzyme mutant activity based on a multibody statistical potential , 2007, Bioinform..

[19]  W. Scott,et al.  Biochemistry: Designer enzymes , 2007, Nature.

[20]  G. Stormo,et al.  Additivity in protein-DNA interactions: how good an approximation is it? , 2002, Nucleic acids research.

[21]  Thomas E. Ferrin,et al.  Designed divergent evolution of enzyme function , 2006, Nature.

[22]  S. Govindarajan,et al.  Putting engineering back into protein engineering: bioinformatic approaches to catalyst design. , 2003, Current opinion in biotechnology.

[23]  D. Youvan Searching Sequence Space , 1995, Bio/Technology.

[24]  A. Arkin,et al.  An algorithm for protein engineering: simulations of recursive ensemble mutagenesis. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[25]  R. Kazlauskas,et al.  Improving enzyme properties: when are closer mutations better? , 2005, Trends in biotechnology.

[26]  Gerald H Lushington,et al.  Whither combine? New opportunities for receptor-based QSAR. , 2007, Current medicinal chemistry.

[27]  S A Kauffman,et al.  Prolegomenon to a General Biology , 2001, Annals of the New York Academy of Sciences.

[28]  H. Kubinyi QSAR and 3D QSAR in drug design Part 1: methodology , 1997 .

[29]  Ann Thayer,et al.  Enzymes at Work , 2006, Science.

[30]  W. Stemmer DNA shuffling by random fragmentation and reassembly: in vitro recombination for molecular evolution. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Ling Yuan,et al.  Laboratory-Directed Protein Evolution , 2005, Microbiology and Molecular Biology Reviews.

[32]  Yasuhiko Shibanaka,et al.  Surveying a local fitness landscape of a protein with epistatic sites for the study of directed evolution. , 2002, Biopolymers.

[33]  Robert J. Keenan,et al.  The Molecular Basis of Glyphosate Resistance by an Optimized Microbial Acetyltransferase* , 2007, Journal of Biological Chemistry.

[34]  H. Schoemaker,et al.  Dispelling the Myths--Biocatalysis in Industrial Synthesis , 2003, Science.

[35]  F. Arnold,et al.  A diverse family of thermostable cytochrome P450s created by recombination of stabilizing fragments , 2007, Nature Biotechnology.

[36]  Mark P. Styczynski,et al.  The intelligent design of evolution , 2006, Molecular systems biology.

[37]  Mark A. Wall,et al.  Biocatalytic Conversion of Avermectin to 4″-Oxo-Avermectin: Improvement of Cytochrome P450 Monooxygenase Specificity by Directed Evolution , 2007, Applied and Environmental Microbiology.

[38]  F. Arnold,et al.  Combinatorial protein design by in vitro recombination. , 1998, Current opinion in chemical biology.

[39]  Karen M Polizzi,et al.  Better library design: data‐driven protein engineering , 2007, Biotechnology journal.

[40]  Claes Gustafsson,et al.  Optimizing the search algorithm for protein engineering by directed evolution. , 2003, Protein engineering.

[41]  J. Wells,et al.  Additivity of mutational effects in proteins. , 1990, Biochemistry.

[42]  Dan S. Tawfik Loop Grafting and the Origins of Enzyme Species , 2006, Science.

[43]  Loren L Looger,et al.  Computational Design of a Biologically Active Enzyme , 2004, Science.

[44]  Manfred T Reetz,et al.  Directed evolution of enantioselective enzymes: iterative cycles of CASTing for probing protein-sequence space. , 2006, Angewandte Chemie.

[45]  W. Stemmer,et al.  DNA shuffling of a family of genes from diverse species accelerates directed evolution , 1998, Nature.

[46]  Huimin Zhao,et al.  Recent advances in biocatalysis by directed enzyme evolution. , 2006, Combinatorial chemistry & high throughput screening.

[47]  Linda A. Castle,et al.  Discovery and Directed Evolution of a Glyphosate Tolerance Gene , 2004, Science.

[48]  Kalyanmoy Deb,et al.  Multi-objective Genetic Algorithms: Problem Difficulties and Construction of Test Problems , 1999, Evolutionary Computation.

[49]  John M Woodley,et al.  Biocatalysis for pharmaceutical intermediates: the future is now. , 2007, Trends in biotechnology.

[50]  S. D. Jong SIMPLS: an alternative approach to partial least squares regression , 1993 .

[51]  T C Terwilliger,et al.  Engineering multiple properties of a protein by combinatorial mutagenesis. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[52]  Matthew Prior,et al.  Letter from “J” , 1863, The Dental register.

[53]  Manfred K. Warmuth,et al.  Engineering proteinase K using machine learning and synthetic genes , 2007, BMC biotechnology.

[54]  Sung-Hun Nam,et al.  Design and Evolution of New Catalytic Activity with an Existing Protein Scaffold , 2006, Science.

[55]  Y Husimi,et al.  A cross-section of the fitness landscape of dihydrofolate reductase. , 2001, Protein engineering.

[56]  Tuck Seng Wong,et al.  Steering directed protein evolution: strategies to manage combinatorial complexity of mutant libraries. , 2007, Environmental microbiology.

[57]  W. Stemmer Rapid evolution of a protein in vitro by DNA shuffling , 1994, Nature.