Computational Protein Engineering Approaches for Effective Design of New Molecules

Protein engineering involves synthesis of new proteins, or amendment in the existing protein sequence/structure to achieve desired functions. Novel protein design algorithms, advancement in structural bioinformatics, molecular force fields, and availability of immense information regarding 3D protein structure have made it possible to use computational approaches for protein engineering. Here, successful stories of proteins that have been engineered using in silico approaches have been discussed, with a focus on three important aspects: protein specificity , stability , and novel functionality . Though computational protein engineering methods have evolved to be imperative tools in the tedious and complex process of engineering proteins, some associated challenges that leave scope for further innovation and improvement have also been discussed.

[1]  Thomas Szyperski,et al.  Computational de novo design of a four‐helix bundle protein—DND_4HB , 2015, Protein science : a publication of the Protein Society.

[2]  J. Marvin,et al.  Conversion of a maltose receptor into a zinc biosensor by computational design , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Marianne Rooman,et al.  PoPMuSiC, rationally designing point mutations in protein structures , 2002, Bioinform..

[4]  F. Beaudoin,et al.  Overview and analysis of the polyprotein cleavage sites in the family Potyviridae. , 2005, Molecular plant pathology.

[5]  E. Lander,et al.  Development and Applications of CRISPR-Cas9 for Genome Engineering , 2014, Cell.

[6]  Dan Liang,et al.  Tandem amino acid repeats in the green anole (Anolis carolinensis) and other squamates may have a role in increasing genetic variability , 2016, BMC Genomics.

[7]  David A. Scott,et al.  Rationally engineered Cas9 nucleases with improved specificity , 2015, Science.

[8]  F. Richards,et al.  Construction of new ligand binding sites in proteins of known structure. I. Computer-aided modeling of sites with pre-defined geometry. , 1991, Journal of molecular biology.

[9]  Jiangning Song,et al.  Computational enzyme design approaches with significant biological outcomes: progress and challenges , 2012, Computational and structural biotechnology journal.

[10]  F. Quiocho,et al.  Atomic structure and specificity of bacterial periplasmic receptors for active transport and chemotaxis: variation of common themes , 1996, Molecular microbiology.

[11]  Hwangseo Park,et al.  Free energy perturbation approach for the rational engineering of the antibody for human hepatitis B virus. , 2011, Journal of molecular graphics & modelling.

[12]  P. S. Kim,et al.  High-resolution protein design with backbone freedom. , 1998, Science.

[13]  Loren L Looger,et al.  Computational Design of a Biologically Active Enzyme , 2004, Science.

[14]  Kouhei Tsumoto,et al.  Affinity Improvement of a Therapeutic Antibody by Structure-Based Computational Design: Generation of Electrostatic Interactions in the Transition State Stabilizes the Antibody-Antigen Complex , 2014, PloS one.

[15]  H. Hellinga,et al.  Periplasmic binding proteins: a versatile superfamily for protein engineering. , 2004, Current opinion in structural biology.

[16]  Durai Sundar,et al.  An ensemble micro neural network approach for elucidating interactions between zinc finger proteins and their target DNA , 2016, BMC Genomics.

[17]  Mona Singh,et al.  Predicting DNA recognition by Cys2His2 zinc finger proteins , 2009, Bioinform..

[18]  M. Gerstein How representative are the known structures of the proteins in a complete genome? A comprehensive structural census. , 1998, Folding & design.

[19]  P S Kim,et al.  Repacking protein cores with backbone freedom: structure prediction for coiled coils. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[20]  J. Keith Joung,et al.  Improving CRISPR-Cas nuclease specificity using truncated guide RNAs , 2014, Nature Biotechnology.

[21]  Majid Masso,et al.  AUTO-MUTE: web-based tools for predicting stability changes in proteins due to single amino acid replacements. , 2010, Protein engineering, design & selection : PEDS.

[22]  J. Doudna,et al.  The new frontier of genome engineering with CRISPR-Cas9 , 2014, Science.

[23]  François Stricher,et al.  The FoldX web server: an online force field , 2005, Nucleic Acids Res..

[24]  C. Pabo,et al.  Design and selection of novel Cys2His2 zinc finger proteins. , 2001, Annual review of biochemistry.

[25]  Jinquan Luo,et al.  Design of novel FN3 domains with high stability by a consensus sequence approach. , 2012, Protein engineering, design & selection : PEDS.

[26]  Eric A. Althoff,et al.  Kemp elimination catalysts by computational enzyme design , 2008, Nature.

[27]  Peter Timmerman,et al.  Affinity maturation of antibodies assisted by in silico modeling , 2008, Proceedings of the National Academy of Sciences.

[28]  Eytan Ruppin,et al.  MuD: an interactive web server for the prediction of non-neutral substitutions using protein structural data , 2010, Nucleic Acids Research.

[29]  B. Höcker,et al.  PocketOptimizer and the Design of Ligand Binding Sites. , 2016, Methods in molecular biology.

[30]  C. Pabo,et al.  DNA recognition by Cys2His2 zinc finger proteins. , 2000, Annual review of biophysics and biomolecular structure.

[31]  David A. Scott,et al.  Genome engineering using the CRISPR-Cas9 system , 2013, Nature Protocols.

[32]  Shantanu Roy,et al.  Escherchia coli ribose binding protein based bioreporters revisited , 2014, Scientific Reports.

[33]  C. Pabo,et al.  High-resolution structures of variant Zif268-DNA complexes: implications for understanding zinc finger-DNA recognition. , 1998, Structure.

[34]  Gary Walsh,et al.  Protein engineering: Case studies of commercialized engineered products , 2007, Biochemistry and molecular biology education : a bimonthly publication of the International Union of Biochemistry and Molecular Biology.

[35]  Jin-Soo Kim,et al.  Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases , 2014, Bioinform..

[36]  Using lobster noses to inspire robot sensor design. , 2002, Trends in biotechnology.

[37]  J. Dye,et al.  Design and characterization of ebolavirus GP prehairpin intermediate mimics as drug targets , 2015, Protein science : a publication of the Protein Society.

[38]  Amy L. Robertson,et al.  Enhancing the stability and solubility of TEV protease using in silico design , 2007, Protein science : a publication of the Protein Society.

[39]  Z. P. Çakar,et al.  Protein Engineering Methods and Applications , 2012 .

[40]  Michael McDuffie,et al.  FlexPred: a web-server for predicting residue positions involved in conformational switches in proteins , 2008, Bioinformation.

[41]  David A. Scott,et al.  Double Nicking by RNA-Guided CRISPR Cas9 for Enhanced Genome Editing Specificity , 2013, Cell.

[42]  Zhicheng Zuo,et al.  Cas9-catalyzed DNA Cleavage Generates Staggered Ends: Evidence from Molecular Dynamics Simulations , 2016, Scientific Reports.

[43]  David R. Liu,et al.  Supercharging proteins can impart unusual resilience. , 2007, Journal of the American Chemical Society.

[44]  Martin J. Aryee,et al.  GUIDE-Seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases , 2014, Nature Biotechnology.

[45]  J. Andrew McCammon,et al.  Striking Plasticity of CRISPR-Cas9 and Key Role of Non-target DNA, as Revealed by Molecular Simulations , 2016, ACS central science.

[46]  Frances H Arnold,et al.  Library analysis of SCHEMA‐guided protein recombination , 2003, Protein science : a publication of the Protein Society.

[47]  M. Boutros,et al.  E-CRISP: fast CRISPR target site identification , 2014, Nature Methods.

[48]  J. Marvin,et al.  Protein engineering and the development of generic biosensors. , 1998, Trends in biotechnology.

[49]  S. L. Mayo,et al.  Protein design automation , 1996, Protein science : a publication of the Protein Society.

[50]  Jeffrey C. Miller,et al.  Highly efficient endogenous human gene correction using designed zinc-finger nucleases , 2005, Nature.

[51]  Rajni Verma,et al.  Computer-Aided Protein Directed Evolution: a Review of Web Servers, Databases and other Computational Tools for Protein Engineering , 2012, Computational and structural biotechnology journal.

[52]  R. Johnston,et al.  The nucleotide sequence of the coding region of tobacco etch virus genomic RNA: evidence for the synthesis of a single polyprotein. , 1986, Virology.

[53]  N. D. Clarke,et al.  Metal search: A computer program that helps design tetrahedral metal‐binding sites , 1995, Proteins.

[54]  K N Houk,et al.  SABER: A computational method for identifying active sites for new reactions , 2012, Protein science : a publication of the Protein Society.

[55]  Yang Zhang,et al.  EvoDesign: de novo protein design based on structural and evolutionary profiles , 2013, Nucleic Acids Res..

[56]  Xingxu Huang,et al.  sgRNAcas9: A Software Package for Designing CRISPR sgRNA and Evaluating Potential Off-Target Cleavage Sites , 2014, PloS one.

[57]  Christodoulos A. Floudas,et al.  Advances in protein structure prediction and de novo protein design : A review , 2006 .

[58]  Tobias M. Fischer,et al.  Monitoring regulated protein-protein interactions using split TEV , 2006, Nature Methods.

[59]  Spandan Madan,et al.  Exploiting the recognition code for elucidating the mechanism of zinc finger protein-DNA interactions , 2016, BMC Genomics.

[60]  Chun-Chi Chou,et al.  An effective approach for generating a three-Cys2His2 zinc-finger-DNA complex model by docking , 2010, BMC Bioinformatics.

[61]  A. Schug,et al.  Structure based design of protein linkers for zinc finger nuclease , 2013, FEBS letters.

[62]  Jared Adolf-Bryfogle,et al.  The PyRosetta Toolkit: A Graphical User Interface for the Rosetta Software Suite , 2013, PloS one.

[63]  Shu Wang,et al.  Comparative transcriptome analysis of wheat embryo and endosperm responses to ABA and H2O2 stresses during seed germination , 2016, BMC Genomics.

[64]  Jeffrey G. Mandell,et al.  Zinc Finger Tools: custom DNA-binding domains for transcription factors and nucleases , 2006, Nucleic Acids Res..

[65]  L. Grinberg,et al.  Stabilization of the third fibronectin type III domain of human tenascin-C through minimal mutation and rational design. , 2014, Protein engineering, design & selection : PEDS.

[66]  C. Maranas,et al.  IPRO: an iterative computational protein library redesign and optimization procedure. , 2006, Biophysical journal.

[67]  Sergey Lyskov,et al.  Alternative Computational Protocols for Supercharging Protein Surfaces for Reversible Unfolding and Retention of Stability , 2013, PloS one.

[68]  J. L. Mateo,et al.  CCTop: An Intuitive, Flexible and Reliable CRISPR/Cas9 Target Prediction Tool , 2015, PloS one.

[69]  Liang-Tsung Huang,et al.  iPTREE-STAB: interpretable decision tree based method for predicting protein stability changes upon mutations , 2007, Bioinform..

[70]  Sheldon Park,et al.  Advances in computational protein design. , 2004, Current opinion in structural biology.

[71]  Kornel Labun,et al.  CHOPCHOP v2: a web tool for the next generation of CRISPR genome engineering , 2016, Nucleic Acids Res..

[72]  Antonín Pavelka,et al.  CAVER 3.0: A Tool for the Analysis of Transport Pathways in Dynamic Protein Structures , 2012, PLoS Comput. Biol..

[73]  V. Iyer,et al.  Efficient genome modification by CRISPR-Cas9 nickase with minimal off-target effects , 2014, Nature Methods.

[74]  S. L. Mayo,et al.  De novo protein design: fully automated sequence selection. , 1997, Science.

[75]  Mark A Hallen,et al.  Dead‐end elimination with perturbations (DEEPer): A provable protein design algorithm with continuous sidechain and backbone flexibility , 2013, Proteins.

[76]  John C Whitman,et al.  Improving catalytic function by ProSAR-driven enzyme evolution , 2007, Nature Biotechnology.

[77]  R. Beerli,et al.  Engineering polydactyl zinc-finger transcription factors , 2002, Nature Biotechnology.

[78]  Brian Kuhlman,et al.  Future Challenges Of Computational Protein Design , 2009 .

[79]  D Gilis,et al.  Stability changes upon mutation of solvent-accessible residues in proteins evaluated by database-derived potentials. , 1996, Journal of molecular biology.

[80]  Abhinav Grover,et al.  Re-programming DNA-binding specificity in zinc finger proteins for targeting unique address in a genome , 2010, Systems and Synthetic Biology.

[81]  David R. Liu,et al.  Fusion of catalytically inactive Cas9 to FokI nuclease improves the specificity of genome modification , 2014, Nature Biotechnology.

[82]  Michael A. Johnston,et al.  Integrated prediction of the effect of mutations on multiple protein characteristics , 2011, Proteins.

[83]  Timothy L. Bailey,et al.  GT-Scan: identifying unique genomic targets , 2014, Bioinform..

[84]  Stephen L Mayo,et al.  Evaluating and optimizing computational protein design force fields using fixed composition-based negative design , 2008, Proceedings of the National Academy of Sciences.

[85]  L. Looger,et al.  Computational design of receptor and sensor proteins with novel functions , 2003, Nature.

[86]  Jérôme Golebiowski,et al.  Sweetness prediction of natural compounds. , 2017, Food chemistry.

[87]  George M. Church,et al.  CHOPCHOP: a CRISPR/Cas9 and TALEN web tool for genome editing , 2014, Nucleic Acids Res..

[88]  Stephen L Mayo,et al.  Evaluation of the energetic contribution of an ionic network to beta‐sheet stability , 2002, Protein science : a publication of the Protein Society.

[89]  Sonu Kumar,et al.  ZiF-Predict: A Web Tool for Predicting DNA-Binding Specificity in C2H2 Zinc Finger Proteins , 2010, Genom. Proteom. Bioinform..