To the Final Goal: Can We Predict and Suggest Mutations for Protein to Develop Desired Phenotype?

Directed evolution of proteins is a good approach to develop desired phenotypes from existing proteins. Fully experimental protein evolution usually utilizes randomization of a given protein sequence by error-prone PCR or gene shuffling followed by high-throughput selection or timeconsuming screening method. However, these random methods create mutant library full of deleterious mutations. In addition, they need high-throughput screening or selection method to search for positive clones from an enormous size of mutant library. Construction of a mutant library while retaining the original function is important for efficient protein evolution because it greatly reduces time and effort for the identification of positive mutants. Therefore, researchers have tried to reduce the size of mutant library by minimizing the occurrence of deleterious mutants. Such efforts have led to the creation of a concept of ‘small but smart library’. For this goal, neutral drift theory has been applied. Although smart library greatly reduces the library size, it is still the beyond the capacity of low-throughput assay. In parallel, computational analysis of protein structure and efforts to discriminate mutatable residues from all residues of a given protein have been consistently pursued. Accumulated knowledge of protein evolution through random mutation and selection has improved our understanding of functions of amino acids in protein structure. Protein evolution by rational design is being developed based on such understanding. In this review, we describe how the use of semi-rationally designed library rather than completely random one has impacted the overall procedure of directed evolution. We also describe efforts made to evaluate the effect of single mutation. Such efforts will bring lazy boys to the final goal - computational mutation suggestion system.

[1]  Uwe T Bornscheuer,et al.  Natural Diversity to Guide Focused Directed Evolution , 2010, Chembiochem : a European journal of chemical biology.

[2]  Antonín Pavelka,et al.  HotSpot Wizard: a web server for identification of hot spots in protein engineering , 2009, Nucleic Acids Res..

[3]  Byung-Gee Kim,et al.  Solubilization and Iterative Saturation Mutagenesis of α1,3‐fucosyltransferase from Helicobacter pylori to enhance its catalytic efficiency , 2016, Biotechnology and bioengineering.

[4]  Frances H Arnold,et al.  Neutral genetic drift can aid functional protein evolution , 2007, 0705.0201.

[5]  S. Vishveshwara,et al.  A network representation of protein structures: implications for protein stability. , 2005, Biophysical journal.

[6]  Gil Amitai,et al.  Network analysis of protein structures identifies functional residues. , 2004, Journal of molecular biology.

[7]  Frances H. Arnold,et al.  Molecular evolution by staggered extension process (StEP) in vitro recombination , 1998, Nature Biotechnology.

[8]  Costas D Maranas,et al.  Optimal protein library design using recombination or point mutations based on sequence-based scoring functions. , 2007, Protein engineering, design & selection : PEDS.

[9]  Nicholas J. Turner,et al.  Directed Evolution of Enzymes: New Biocatalysts for Asymmetric Synthesis , 2004 .

[10]  Maximilian Ccjc Ebert,et al.  Computational tools for enzyme improvement: why everyone can - and should - use them. , 2017, Current opinion in chemical biology.

[11]  D. Goeddel,et al.  A method for random mutagenesis of a defined DNA segment using a modified polymerase chain reaction , 1989 .

[12]  Frances H Arnold,et al.  Innovation by homologous recombination. , 2013, Current opinion in chemical biology.

[13]  Jooyoung Lee,et al.  Necessary and sufficient conditions for the asymmetric synthesis of chiral amines using ω‐aminotransferases , 2011, Biotechnology and bioengineering.

[14]  G. F. Joyce,et al.  Randomization of genes by PCR mutagenesis. , 1992, PCR methods and applications.

[15]  C D Maranas,et al.  Creating multiple-crossover DNA libraries independent of sequence identity , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[16]  W. Stemmer,et al.  DNA shuffling of a family of genes from diverse species accelerates directed evolution , 1998, Nature.

[17]  Y. Vorobjev,et al.  Advances in implicit models of water solvent to compute conformational free energy and molecular dynamics of proteins at constant pH. , 2011, Advances in protein chemistry and structural biology.

[18]  Dan S. Tawfik,et al.  Directed enzyme evolution via small and effective neutral drift libraries , 2008, Nature Methods.

[19]  D. Baker,et al.  Improved recognition of native‐like protein structures using a combination of sequence‐dependent and sequence‐independent features of proteins , 1999, Proteins.

[20]  Donald Hilvert,et al.  Relative tolerance of mesostable and thermostable protein homologs to extensive mutation , 2006, Proteins.

[21]  M. DePristo,et al.  Missense meanderings in sequence space: a biophysical view of protein evolution , 2005, Nature Reviews Genetics.

[22]  Stephen J Benkovic,et al.  FamClash: A method for ranking the activity of engineered enzymes , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Roberto A Chica,et al.  Semi-rational approaches to engineering enzyme activity: combining the benefits of directed evolution and rational design. , 2005, Current opinion in biotechnology.

[24]  Sebastian E. Ahnert,et al.  Residue Geometry Networks: A Rigidity-Based Approach to the Amino Acid Network and Evolutionary Rate Analysis , 2016, Scientific Reports.

[25]  T. S. Wong,et al.  The diversity challenge in directed protein evolution. , 2006, Combinatorial chemistry & high throughput screening.

[26]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[27]  Ayelet Fishman,et al.  Protein Engineering by Random Mutagenesis and Structure-Guided Consensus of Geobacillus stearothermophilus Lipase T6 for Enhanced Stability in Methanol , 2013, Applied and Environmental Microbiology.

[28]  Hening Lin,et al.  Directed evolution of a glycosynthase via chemical complementation. , 2004, Journal of the American Chemical Society.

[29]  Joshua A Bittker,et al.  Directed evolution of protein enzymes using nonhomologous random recombination. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Volker Sieber,et al.  Libraries of hybrid proteins from distantly related sequences , 2001, Nature Biotechnology.

[31]  R. Sauer,et al.  Combinatorial cassette mutagenesis as a probe of the informational content of protein sequences. , 1988, Science.

[32]  Philip T. Pienkos,et al.  DNA shuffling method for generating highly recombined genes and evolved enzymes , 2001, Nature Biotechnology.

[33]  John B. O. Mitchell,et al.  A review of methods for the calculation of solution free energies and the modelling of systems in solution. , 2015, Physical chemistry chemical physics : PCCP.

[34]  F. Arnold,et al.  Protein stability promotes evolvability. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Jeremy Minshull,et al.  Protein engineering of improved prolyl endopeptidases for celiac sprue therapy. , 2008, Protein engineering, design & selection : PEDS.

[36]  W. P. Russ,et al.  Evolutionary information for specifying a protein fold , 2005, Nature.

[37]  Tal Pupko,et al.  ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids , 2010, Nucleic Acids Res..

[38]  Nita Parekh,et al.  NAPS: Network Analysis of Protein Structures , 2016, Nucleic Acids Res..

[39]  Guangyu Yang,et al.  Enhanced Enzyme Kinetic Stability by Increasing Rigidity within the Active Site* , 2014, The Journal of Biological Chemistry.

[40]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[41]  G. Huisman,et al.  Engineering the third wave of biocatalysis , 2012, Nature.

[42]  Frances H Arnold,et al.  Library analysis of SCHEMA‐guided protein recombination , 2003, Protein science : a publication of the Protein Society.

[43]  Uwe T Bornscheuer,et al.  Thermostabilization of an esterase by alignment-guided focussed directed evolution. , 2010, Protein engineering, design & selection : PEDS.

[44]  M. Kimura Evolutionary Rate at the Molecular Level , 1968, Nature.

[45]  F. Arnold,et al.  A diverse family of thermostable cytochrome P450s created by recombination of stabilizing fragments , 2007, Nature Biotechnology.

[46]  A D Ellington,et al.  In vitro evolution of beta-glucuronidase into a beta-galactosidase proceeds through non-specific intermediates. , 2001, Journal of molecular biology.

[47]  H. Leemhuis,et al.  Directed evolution of enzymes: Library screening strategies , 2009, IUBMB life.

[48]  Uwe T Bornscheuer,et al.  Use of ‘small but smart’ libraries to enhance the enantioselectivity of an esterase from Bacillus stearothermophilus towards tetrahydrofuran‐3‐yl acetate , 2013, The FEBS journal.

[49]  Hiroshi Murakami,et al.  Random insertion and deletion of arbitrary number of bases for codon-based random mutation of DNAs , 2002, Nature Biotechnology.

[50]  Christoph Adami,et al.  Thermodynamic prediction of protein neutrality. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[51]  J. Sohng,et al.  Expanding substrate specificity of GT‐B fold glycosyltransferase via domain swapping and high‐throughput screening , 2009, Biotechnology and bioengineering.

[52]  F. Arnold,et al.  Functional and nonfunctional mutations distinguished by random recombination of homologous genes. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[53]  Marc Ostermeier,et al.  A combinatorial approach to hybrid enzymes independent of DNA homology , 1999, Nature Biotechnology.

[54]  Stefan Lutz,et al.  Beyond directed evolution--semi-rational protein engineering and design. , 2010, Current opinion in biotechnology.

[55]  François Stricher,et al.  The FoldX web server: an online force field , 2005, Nucleic Acids Res..

[56]  Dan S. Tawfik,et al.  Directed evolution of proteins for heterologous expression and stability. , 2005, Current opinion in structural biology.

[57]  Frances H Arnold,et al.  To whom correspondence should be addressed. , 2022 .

[58]  W. Quax,et al.  Selection strategies for improved biocatalysts , 2007, The FEBS journal.

[59]  W. Stemmer Rapid evolution of a protein in vitro by DNA shuffling , 1994, Nature.

[60]  Cameron Neylon,et al.  Chemical and biochemical strategies for the randomization of protein encoding DNA sequences: library construction methods for directed evolution. , 2004, Nucleic acids research.

[61]  Randall J. Weselake,et al.  Site saturation mutagenesis: Methods and applications in protein engineering , 2012 .

[62]  Silvio C. E. Tosatto,et al.  The RING 2.0 web server for high quality residue interaction networks , 2016, Nucleic Acids Res..

[63]  J. L. Bose Chemical and UV Mutagenesis. , 2016, Methods in molecular biology.

[64]  M Karplus,et al.  Small-world view of the amino acids that play a key role in protein folding. , 2002, Physical review. E, Statistical, nonlinear, and soft matter physics.

[65]  Marc Ostermeier,et al.  Finding Cinderella's slipper—proteins that fit , 1999, Nature Biotechnology.

[66]  Jennifer R. Hale,et al.  Applying neutral drift to the directed molecular evolution of a β-glucuronidase into a β-galactosidase: Two different evolutionary pathways lead to the same variant , 2011, BMC Research Notes.

[67]  Gert Vriend,et al.  3DM: Systematic analysis of heterogeneous superfamily data to discover protein functionalities , 2010, Proteins.

[68]  David R. Liu,et al.  Methods for the directed evolution of proteins , 2015, Nature Reviews Genetics.

[69]  Christophe Ampe,et al.  Reducing mutational bias in random protein libraries. , 2005, Analytical biochemistry.

[70]  U. Bornscheuer,et al.  Engineering the Amine Transaminase from Vibrio fluvialis towards Branched‐Chain Substrates , 2016 .

[71]  Juhan Kim,et al.  Pro-antibiotic Substrates for the Identification of Enantioselective Hydrolases , 2006, Biotechnology Letters.

[72]  Paul N. Devine,et al.  Biocatalytic Asymmetric Synthesis of Chiral Amines from Ketones Applied to Sitagliptin Manufacture , 2010, Science.

[73]  C. Wilke,et al.  Thermodynamics of Neutral Protein Evolution , 2006, Genetics.

[74]  Manfred T Reetz,et al.  Iterative saturation mutagenesis (ISM) for rapid directed evolution of functional enzymes , 2007, Nature Protocols.