A combined NMR and computational approach to investigate peptide binding to a designed Armadillo repeat protein.

The specific recognition of peptide sequences by proteins plays an important role both in biology and in diagnostic applications. Here we characterize the relatively weak binding of the peptide neurotensin (NT) to the previously developed Armadillo repeat protein VG_328 by a multidisciplinary approach based on solution NMR spectroscopy, mutational studies, and molecular dynamics (MD) simulations, totaling 20μs for all MD runs. We describe assignment challenges arising from the repetitive nature of the protein sequence, and we present novel approaches to address them. Partial assignments obtained for VG_328 in combination with chemical shift perturbations allowed us to identify the repeats not involved in binding. Their subsequent elimination resulted in a reduced-size binder with very similar affinity for NT, for which near-complete backbone assignments were achieved. A binding mode suggested by automatic docking and further validated by explicit solvent MD simulations is consistent with paramagnetic relaxation enhancement data collected using spin-labeled NT. Favorable intermolecular interactions are observed in the MD simulations for the residues that were previously shown to contribute to binding in an Ala scan of NT. We further characterized the role of residues within the N-cap for protein stability and peptide binding. Our multidisciplinary approach demonstrates that an initial low-resolution picture for a low-micromolar-peptide binder can be refined through the combination of NMR, protein design, docking, and MD simulations to establish its binding mode, even in the absence of crystallographic data, thereby providing valuable information for further design.

[1]  Temple F. Smith,et al.  The WD repeat: a common architecture for diverse functions. , 1999, Trends in biochemical sciences.

[2]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[3]  J. C. Almagro,et al.  Identification of differences in the specificity‐determining residues of antibodies that recognize antigens of different size: implications for the rational design of antibody repertoires , 2004, Journal of molecular recognition : JMR.

[4]  K. Sharp,et al.  Accurate Calculation of Hydration Free Energies Using Macroscopic Solvent Models , 1994 .

[5]  R. Keller,et al.  The Computer Aided Resonance Assignment Tutorial , 2004 .

[6]  H. Hoogenboom,et al.  Selecting and screening recombinant antibody libraries , 2005, Nature Biotechnology.

[7]  M. Parrinello,et al.  Canonical sampling through velocity rescaling. , 2007, The Journal of chemical physics.

[8]  J. Cavanagh Protein NMR Spectroscopy: Principles and Practice , 1995 .

[9]  G. Blatch,et al.  The tetratricopeptide repeat: a structural motif mediating protein-protein interactions. , 1999, BioEssays : news and reviews in molecular, cellular and developmental biology.

[10]  Andreas Plückthun,et al.  DARPins and other repeat protein scaffolds: advances in engineering and applications. , 2011, Current opinion in biotechnology.

[11]  Peer Bork,et al.  HEAT repeats in the Huntington's disease protein , 1995, Nature Genetics.

[12]  G. Blobel,et al.  Crystallographic Analysis of the Recognition of a Nuclear Localization Signal by the Nuclear Import Factor Karyopherin α , 1998, Cell.

[13]  Alexander D. MacKerell,et al.  Optimization of the additive CHARMM all-atom protein force field targeting improved sampling of the backbone φ, ψ and side-chain χ(1) and χ(2) dihedral angles. , 2012, Journal of chemical theory and computation.

[14]  W J Nelson,et al.  Three-dimensional structure of the armadillo repeat region of beta-catenin. , 1997, Cell.

[15]  William I. Weis,et al.  Three-Dimensional Structure of the Armadillo Repeat Region of β-Catenin , 1997, Cell.

[16]  Andreas Plückthun,et al.  Structure‐based optimization of designed Armadillo‐repeat proteins , 2012, Protein science : a publication of the Protein Society.

[17]  D Cowburn,et al.  Modular peptide recognition domains in eukaryotic signaling. , 1997, Annual review of biophysics and biomolecular structure.

[18]  M. Hatzfeld The armadillo family of structural proteins. , 1999, International review of cytology.

[19]  Andreas Plückthun,et al.  Optimization of designed armadillo repeat proteins by molecular dynamics simulations and NMR spectroscopy , 2012, Protein science : a publication of the Protein Society.

[20]  W. Jahnke,et al.  Measurement of fast proton exchange rates in isotopically labeled compounds , 1993 .

[21]  M. Zweckstetter,et al.  Mars - robust automatic backbone assignment of proteins , 2004, Journal of biomolecular NMR.

[22]  R. Ishima,et al.  Structural Basis of the Allosteric Inhibitor Interaction on the HIV‐1 Reverse Transcriptase RNase H Domain , 2012, Chemical biology & drug design.

[23]  D. Gerloff,et al.  Highly Sensitive Detection of Individual HEAT and ARM Repeats with HHpred and COACH , 2009, PloS one.

[24]  A. Plückthun,et al.  Designed Armadillo repeat proteins: library generation, characterization and selection of peptide binders with high specificity. , 2012, Journal of molecular biology.

[25]  D. Schnur,et al.  Development of an internal searching algorithm for parameterization of the MM2/MM3 force fields , 1991 .

[26]  Andreas Plückthun,et al.  Modular peptide binding: from a comparison of natural binders to designed armadillo repeat proteins. , 2014, Journal of structural biology.

[27]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[28]  P. Hajduk,et al.  NMR-based screening in drug discovery , 1999, Quarterly Reviews of Biophysics.

[29]  Arthur J. Olson,et al.  AutoDock Vina: Improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading , 2009, J. Comput. Chem..

[30]  G. Wagner,et al.  Efficient side-chain and backbone assignment in large proteins: Application to tGCN5 , 1999, Journal of biomolecular NMR.

[31]  V. Bennett,et al.  Identification and partial purification of ankyrin, the high affinity membrane attachment site for human erythrocyte spectrin. , 1979, The Journal of biological chemistry.

[32]  Bostjan Kobe,et al.  Biophysical Characterization of Interactions Involving Importin-α during Nuclear Import* , 2001, The Journal of Biological Chemistry.

[33]  William I. Weis,et al.  The Structure of the β-Catenin/E-Cadherin Complex and the Molecular Basis of Diverse Ligand Recognition by β-Catenin , 2001, Cell.

[34]  Andreas Plückthun,et al.  Spontaneous self-assembly of engineered armadillo repeat protein fragments into a folded structure. , 2014, Structure.

[35]  A. Plückthun,et al.  Engineering novel binding proteins from nonimmunoglobulin domains , 2005, Nature Biotechnology.

[36]  J. Coates,et al.  Armadillo repeat proteins: beyond the animal kingdom. , 2003, Trends in cell biology.

[37]  H. Berendsen,et al.  Molecular dynamics with coupling to an external bath , 1984 .

[38]  F. Bermejo,et al.  Assignment and conformation of neurotensin in aqueous solution by 1H NMR. , 2009, International journal of peptide and protein research.

[39]  E. Wieschaus,et al.  Molecular analysis of the armadillo locus: uniformly distributed transcripts and a protein with novel internal repeats are associated with a Drosophila segment polarity gene. , 1989, Genes & development.

[40]  Andreas Plückthun,et al.  Designed armadillo repeat proteins as general peptide-binding scaffolds: consensus design and computational optimization of the hydrophobic core. , 2008, Journal of molecular biology.

[41]  J Kuriyan,et al.  Crystallographic analysis of the specific yet versatile recognition of distinct nuclear localization signals by karyopherin alpha. , 2000, Structure.

[42]  Christian Griesinger,et al.  Heteronuclear multidimensional NMR experiments for the structure determination of proteins in solution employing pulsed field gradients , 1999 .

[43]  A. Reynolds,et al.  The p120 catenin family: complex roles in adhesion, signaling and cancer. , 2000, Journal of cell science.

[44]  E. Nice,et al.  Biophysical characterization of interactions involving importin-alpha during nuclear import. , 2001, The Journal of biological chemistry.

[45]  M F Sanner,et al.  Python: a programming language for software integration and development. , 1999, Journal of molecular graphics & modelling.

[46]  B. Power,et al.  A new generation of protein display scaffolds for molecular recognition , 2006, Protein science : a publication of the Protein Society.

[47]  Richard Bonneau,et al.  Ab initio protein structure prediction of CASP III targets using ROSETTA , 1999, Proteins.

[48]  S. Ståhl,et al.  Non-immunoglobulin based protein scaffolds. , 2011, Current opinion in biotechnology.

[49]  Alexey Lugovskoy,et al.  Design of next-generation protein therapeutics. , 2010, Current opinion in chemical biology.

[50]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997, J. Comput. Chem..

[51]  W. Weis,et al.  ICAT inhibits beta-catenin binding to Tcf/Lef-family transcription factors and the general coactivator p300 using independent structural modules. , 2002, Molecular cell.

[52]  Lee Fielding,et al.  NMR methods for the determination of protein-ligand dissociation constants. , 2003, Current topics in medicinal chemistry.

[53]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[54]  Jason D. Perlmutter,et al.  The Methionine-aromatic Motif Plays a Unique Role in Stabilizing Protein Structure* , 2012, The Journal of Biological Chemistry.

[55]  J. Sambrook,et al.  Molecular Cloning: A Laboratory Manual , 2001 .

[56]  Neal J. Zondlo Aromatic-proline interactions: electronically tunable CH/π interactions. , 2013, Accounts of chemical research.

[57]  T. Darden,et al.  Particle mesh Ewald: An N⋅log(N) method for Ewald sums in large systems , 1993 .

[58]  A. Plückthun,et al.  Residue-resolved stability of full-consensus ankyrin repeat proteins probed by NMR. , 2010, Journal of molecular biology.

[59]  Jack Snoeyink,et al.  Nucleic Acids Research Advance Access published April 22, 2007 MolProbity: all-atom contacts and structure validation for proteins and nucleic acids , 2007 .