Sampling and energy evaluation challenges in ligand binding protein design

The steroid hormone 17α‐hydroxylprogesterone (17‐OHP) is a biomarker for congenital adrenal hyperplasia and hence there is considerable interest in development of sensors for this compound. We used computational protein design to generate protein models with binding sites for 17‐OHP containing an extended, nonpolar, shape‐complementary binding pocket for the four‐ring core of the compound, and hydrogen bonding residues at the base of the pocket to interact with carbonyl and hydroxyl groups at the more polar end of the ligand. Eight of 16 designed proteins experimentally tested bind 17‐OHP with micromolar affinity. A co‐crystal structure of one of the designs revealed that 17‐OHP is rotated 180° around a pseudo‐two‐fold axis in the compound and displays multiple binding modes within the pocket, while still interacting with all of the designed residues in the engineered site. Subsequent rounds of mutagenesis and binding selection improved the ligand affinity to nanomolar range, while appearing to constrain the ligand to a single bound conformation that maintains the same “flipped” orientation relative to the original design. We trace the discrepancy in the design calculations to two sources: first, a failure to model subtle backbone changes which alter the distribution of sidechain rotameric states and second, an underestimation of the energetic cost of desolvating the carbonyl and hydroxyl groups of the ligand. The difference between design model and crystal structure thus arises from both sampling limitations and energy function inaccuracies that are exacerbated by the near two‐fold symmetry of the molecule.

[1]  David Baker,et al.  De Novo Enzyme Design Using Rosetta3 , 2011, PloS one.

[2]  Z. Otwinowski,et al.  Processing of X-ray diffraction data collected in oscillation mode. , 1997, Methods in enzymology.

[3]  D. G. Gibson,et al.  Enzymatic assembly of DNA molecules up to several hundred kilobases , 2009, Nature Methods.

[4]  Colin W. Taylor,et al.  Analysis of protein-ligand interactions by fluorescence polarization , 2011, Nature Protocols.

[5]  D. Baker,et al.  Relaxation of backbone bond geometry improves protein energy landscape modeling , 2014, Protein science : a publication of the Protein Society.

[6]  Samuel L. DeLuca,et al.  Small-molecule ligand docking into comparative models with Rosetta , 2013, Nature Protocols.

[7]  L. Benatuil,et al.  An improved yeast transformation method for the generation of very large human antibody libraries. , 2010, Protein engineering, design & selection : PEDS.

[8]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[9]  P. Emsley,et al.  Features and development of Coot , 2010, Acta crystallographica. Section D, Biological crystallography.

[10]  M. New,et al.  Congenital adrenal hyperplasia. , 1988, Biochemical Society transactions.

[11]  D. Baker De novo Design of Protein Homo‐Oligomers with Modular Hydrogen‐Bond Network‐Mediated Specificity. , 2016 .

[12]  D. Baker,et al.  Restricted sidechain plasticity in the structures of native proteins and complexes , 2011, Protein science : a publication of the Protein Society.

[13]  D. Baker,et al.  Computational design of a protein-based enzyme inhibitor. , 2013, Journal of molecular biology.

[14]  M. Stewart,et al.  The 1.6 angstroms resolution crystal structure of nuclear transport factor 2 (NTF2). , 1997, Journal of molecular biology.

[15]  David Baker,et al.  A Pareto-Optimal Refinement Method for Protein Design Scaffolds , 2013, PloS one.

[16]  Helen M. Kent,et al.  The 1.6 Å Resolution Crystal Structure of Nuclear Transport Factor 2 (NTF2) , 1996 .

[17]  F. J. Poelwijk,et al.  The spatial architecture of protein function and adaptation , 2012, Nature.

[18]  Timothy A. Whitehead,et al.  Computational Design of Proteins Targeting the Conserved Stem Region of Influenza Hemagglutinin , 2011, Science.

[19]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[20]  Jens Meiler,et al.  New algorithms and an in silico benchmark for computational enzyme design , 2006, Protein science : a publication of the Protein Society.

[21]  Randy J. Read,et al.  Acta Crystallographica Section D Biological , 2003 .

[22]  Vahid Mirjalili,et al.  Protein Structure Refinement through Structure Selection and Averaging from Molecular Dynamics Ensembles. , 2013, Journal of chemical theory and computation.

[23]  I. Silva,et al.  Neonatal screening for congenital adrenal hyperplasia. , 2012, Revista da Associacao Medica Brasileira.

[24]  David Baker,et al.  Bioluminescent sensor proteins for point-of-care therapeutic drug monitoring. , 2014, Nature chemical biology.

[25]  Marcus D. Hanwell,et al.  Avogadro: an advanced semantic chemical editor, visualization, and analysis platform , 2012, Journal of Cheminformatics.

[26]  Matthew P. Repasky,et al.  Extra precision glide: docking and scoring incorporating a model of hydrophobic enclosure for protein-ligand complexes. , 2006, Journal of medicinal chemistry.

[27]  Jens Meiler,et al.  RosettaScripts: A Scripting Language Interface to the Rosetta Macromolecular Modeling Suite , 2011, PloS one.

[28]  M. James,et al.  Crystal structure of Mycobacterium tuberculosis Rv0760c at 1.50 A resolution, a structural homolog of Delta(5)-3-ketosteroid isomerase. , 2008, Biochimica et biophysica acta.

[29]  T. Maurer,et al.  CD57 Expression and Cytokine Production by T Cells in Lesional and Unaffected Skin from Patients with Psoriasis , 2013, PloS one.

[30]  Ruth Nussinov,et al.  PatchDock and SymmDock: servers for rigid and symmetric docking , 2005, Nucleic Acids Res..

[31]  David Baker,et al.  A general strategy to construct small molecule biosensors , 2017 .

[32]  Douglas M. Fowler,et al.  Enrich: software for analysis of protein function by enrichment and depletion of variants , 2011, Bioinform..

[33]  K Dane Wittrup,et al.  Isolating and engineering human antibodies using yeast surface display , 2006, Nature Protocols.

[34]  J. Gasteiger,et al.  ITERATIVE PARTIAL EQUALIZATION OF ORBITAL ELECTRONEGATIVITY – A RAPID ACCESS TO ATOMIC CHARGES , 1980 .

[35]  Randy J. Read,et al.  Phaser crystallographic software , 2007, Journal of applied crystallography.

[36]  David Baker,et al.  Computational design of ligand-binding proteins with high affinity and selectivity , 2013, Nature.

[37]  Benjamin A. Ellingson,et al.  Conformer Generation with OMEGA: Algorithm and Validation Using High Quality Structures from the Protein Databank and Cambridge Structural Database , 2010, J. Chem. Inf. Model..

[38]  Jiajie Zhang,et al.  PEAR: a fast and accurate Illumina Paired-End reAd mergeR , 2013, Bioinform..

[39]  Jens Meiler,et al.  ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. , 2011, Methods in enzymology.