Massively parallel de novo protein design for targeted therapeutics

De novo protein design holds promise for creating small stable proteins with shapes customized to bind therapeutic targets. We describe a massively parallel approach for designing, manufacturing and screening mini-protein binders, integrating large-scale computational design, oligonucleotide synthesis, yeast display screening and next-generation sequencing. We designed and tested 22,660 mini-proteins of 37–43 residues that target influenza haemagglutinin and botulinum neurotoxin B, along with 6,286 control sequences to probe contributions to folding and binding, and identified 2,618 high-affinity binders. Comparison of the binding and non-binding design sets, which are two orders of magnitude larger than any previously investigated, enabled the evaluation and improvement of the computational model. Biophysical characterization of a subset of the binder designs showed that they are extremely stable and, unlike antibodies, do not lose activity after exposure to high temperatures. The designs elicit little or no immune response and provide potent prophylactic and therapeutic protection against influenza, even after extensive repeated dosing.

[1]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[2]  Timothy A. Whitehead,et al.  Computational Design of Proteins Targeting the Conserved Stem Region of Influenza Hemagglutinin , 2011, Science.

[3]  Vincent B. Chen,et al.  Correspondence e-mail: , 2000 .

[4]  L. Benatuil,et al.  An improved yeast transformation method for the generation of very large human antibody libraries. , 2010, Protein engineering, design & selection : PEDS.

[5]  Tri Giang Phan,et al.  Challenges and opportunities for non-antibody scaffold drugs. , 2015, Drug discovery today.

[6]  Jiajie Zhang,et al.  PEAR: a fast and accurate Illumina Paired-End reAd mergeR , 2013, Bioinform..

[7]  S. Nosé,et al.  Constant pressure molecular dynamics for molecular systems , 1983 .

[8]  Andreas Plückthun,et al.  Efficient tumor targeting with high-affinity designed ankyrin repeat proteins: effects of affinity and molecular size. , 2010, Cancer research.

[9]  Alan Bye,et al.  Pharmacokinetics of Zanamivir After Intravenous, Oral, Inhaled or Intranasal Administration to Healthy Volunteers , 1999, Clinical pharmacokinetics.

[10]  G. Watson,et al.  Computer simulation , 1988 .

[11]  Nathaniel Echols,et al.  The Phenix software for automated determination of macromolecular structures. , 2011, Methods.

[12]  David Baker,et al.  A Computationally Designed Inhibitor of an Epstein-Barr Viral Bcl-2 Protein Induces Apoptosis in Infected Cells , 2014, Cell.

[13]  A. Brünger Free R value: a novel statistical quantity for assessing the accuracy of crystal structures , 1992, Nature.

[14]  L. Murr,et al.  Computer Simulation in Materials Science and Engineering , 2015 .

[15]  Gerrit Groenhof,et al.  GROMACS: Fast, flexible, and free , 2005, J. Comput. Chem..

[16]  Min Dong,et al.  Structure of Botulinum neurotoxin B binding domain in complex with both synaptotagmin II and GD1a , 2013 .

[17]  Timothy A. Whitehead,et al.  Optimization of affinity, specificity and function of designed influenza inhibitors using deep sequencing , 2012, Nature Biotechnology.

[18]  D. Hoover,et al.  DNAWorks: an automated method for designing oligonucleotides for PCR-based gene synthesis. , 2002, Nucleic acids research.

[19]  Jennifer R Cochran,et al.  Engineered knottin peptides as diagnostics, therapeutics, and drug delivery vehicles. , 2016, Current opinion in chemical biology.

[20]  Michaela Gebauer,et al.  Engineered protein scaffolds as next-generation antibody therapeutics. , 2009, Current opinion in chemical biology.

[21]  D. Baker,et al.  RosettaRemodel: A Generalized Framework for Flexible Backbone Protein Design , 2011, PloS one.

[22]  N. Koga,et al.  Principles for Designing Ideal Protein Structures , 2013 .

[23]  Michael J MacCoss,et al.  Computationally designed high specificity inhibitors delineate the roles of BCL2 family proteins in cancer , 2016, eLife.

[24]  Randy J. Read,et al.  Phaser crystallographic software , 2007, Journal of applied crystallography.

[25]  D. Baker,et al.  Global analysis of protein folding using massively parallel design, synthesis, and testing , 2017, Science.

[26]  Axel T. Brunger,et al.  Botulinum neurotoxin B recognizes its protein receptor with high affinity and specificity , 2006, Nature.

[27]  David Baker,et al.  Removing T-cell epitopes with computational protein design , 2014, Proceedings of the National Academy of Sciences.

[28]  D. Baker,et al.  Control over overall shape and size in de novo designed proteins , 2015, Proceedings of the National Academy of Sciences.

[29]  D. J. Stevens,et al.  The Structure and Receptor Binding Properties of the 1918 Influenza Hemagglutinin , 2004, Science.

[30]  M. Parrinello,et al.  Canonical sampling through velocity rescaling. , 2007, The Journal of chemical physics.

[31]  Daniel-Adriano Silva,et al.  Motif-Driven Design of Protein-Protein Interfaces. , 2016, Methods in molecular biology.

[32]  J. Skehel,et al.  A Neutralizing Antibody Selected from Plasma Cells That Binds to Group 1 and Group 2 Influenza A Hemagglutinins , 2011, Science.

[33]  Maria Zambon,et al.  Antiviral resistance during the 2009 influenza A H1N1 pandemic: public health, laboratory, and clinical perspectives. , 2012, The Lancet. Infectious diseases.

[34]  Kevin Cowtan,et al.  research papers Acta Crystallographica Section D Biological , 2005 .

[35]  Z. Otwinowski,et al.  [20] Processing of X-ray diffraction data collected in oscillation mode. , 1997, Methods in enzymology.

[36]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997, J. Comput. Chem..

[37]  Jens Meiler,et al.  RosettaScripts: A Scripting Language Interface to the Rosetta Macromolecular Modeling Suite , 2011, PloS one.

[38]  D. Baker,et al.  Principles for designing ideal protein structures , 2012, Nature.

[39]  David Baker,et al.  A Computationally Designed Hemagglutinin Stem-Binding Protein Provides In Vivo Protection from Influenza Independent of a Host Immune Response , 2016, PLoS pathogens.

[40]  A. Blitzer,et al.  Spasmodic dysphonia and botulinum toxin: experience from the largest treatment series , 2010, European journal of neurology.

[41]  R. Dror,et al.  Improved side-chain torsion potentials for the Amber ff99SB protein force field , 2010, Proteins.

[42]  Z. Otwinowski,et al.  Processing of X-ray diffraction data collected in oscillation mode. , 1997, Methods in enzymology.

[43]  T. Darden,et al.  A smooth particle mesh Ewald method , 1995 .

[44]  Patrick J. Paddison,et al.  Production of complex nucleic acid libraries using highly parallel in situ oligonucleotide synthesis , 2004, Nature Methods.

[45]  Philip M. Kim,et al.  Protein engineering by highly parallel screening of computationally designed variants , 2016, Science Advances.

[46]  K Dane Wittrup,et al.  Isolating and engineering human antibodies using yeast surface display , 2006, Nature Protocols.

[47]  David Baker,et al.  Accurate de novo design of hyperstable constrained peptides , 2016, Nature.

[48]  B. Kuhlman,et al.  SwiftLib: rapid degenerate-codon-library optimization through dynamic programming , 2014, Nucleic acids research.

[49]  Jaap Heringa,et al.  PRALINE: a versatile multiple sequence alignment toolkit. , 2014, Methods in molecular biology.