Computational design of a leucine-rich repeat protein with a predefined geometry

Significance Repeat proteins are used in nature to bind to proteins and peptides. The shape of their binding surfaces can vary substantially, even for proteins within the same family. This variability likely arose because they evolved to match the proteins they interact with geometrically. Repeat proteins are often engineered to develop binders specific to new target proteins. It would be highly beneficial to design repeat proteins with predefined geometrical shapes because such a method would enable development of engineered repeat proteins that are shape-optimized to their targets. Here, we demonstrate that repeat proteins with a predefined shape can be designed using a computational design method. The approach is exemplified by the design of a protein that forms a ring structure not seen in nature. Structure-based protein design offers a possibility of optimizing the overall shape of engineered binding scaffolds to match their targets better. We developed a computational approach for the structure-based design of repeat proteins that allows for adjustment of geometrical features like length, curvature, and helical twist. By combining sequence optimization of existing repeats and de novo design of capping structures, we designed leucine-rich repeats (LRRs) from the ribonuclease inhibitor (RI) family that assemble into structures with a predefined geometry. The repeat proteins were built from self-compatible LRRs that are designed to interact to form highly curved and planar assemblies. We validated the geometrical design approach by engineering a ring structure constructed from 10 self-compatible repeats. Protein design can also be used to increase our structural understanding of repeat proteins. We use our design constructs to demonstrate that buried Cys play a central role for stability and folding cooperativity in RI-type LRR proteins. The computational procedure presented here may be used to develop repeat proteins with various geometrical shapes for applications where greater control of the interface geometry is desired.

[1]  Simon W. Ginzinger,et al.  SHIFTX2: significantly improved protein chemical shift prediction , 2011, Journal of biomolecular NMR.

[2]  E. Komives,et al.  Biophysical characterization of the free IκBα ankyrin repeat domain in solution , 2004 .

[3]  D. Baker,et al.  RosettaHoles: Rapid assessment of protein core packing for structure prediction, refinement, design, and validation , 2008, Protein science : a publication of the Protein Society.

[4]  J. Kong,et al.  Fourier transform infrared spectroscopic analysis of protein secondary structures. , 2007, Acta biochimica et biophysica Sinica.

[5]  A. Kajava Structural diversity of leucine-rich repeat proteins. , 1998, Journal of molecular biology.

[6]  Patrik Lundström,et al.  Fractional 13C enrichment of isolated carbons using [1-13C]- or [2-13C]-glucose facilitates the accurate measurement of dynamics at backbone Cα and side-chain methyl positions in proteins , 2007, Journal of biomolecular NMR.

[7]  P. Cossart,et al.  Crystal structure and standardized geometric analysis of InlJ, a listerial virulence factor and leucine-rich repeat protein with a novel cysteine ladder. , 2008, Journal of molecular biology.

[8]  David Baker,et al.  Macromolecular modeling with rosetta. , 2008, Annual review of biochemistry.

[9]  Randy J. Read,et al.  Overview of the CCP4 suite and current developments , 2011, Acta crystallographica. Section D, Biological crystallography.

[10]  Jens Meiler,et al.  RosettaScripts: A Scripting Language Interface to the Rosetta Macromolecular Modeling Suite , 2011, PloS one.

[11]  Conformational exchange of aromatic side chains characterized by L-optimized TROSY-selected 13C CPMG relaxation dispersion , 2012, Journal of biomolecular NMR.

[12]  D. Scott,et al.  Analytical Ultracentrifugation: Techniques and Methods , 2007 .

[13]  R. Mariuzza,et al.  Sedimentation equilibrium analysis of protein interactions with global implicit mass conservation constraints and systematic noise decomposition. , 2004, Analytical biochemistry.

[14]  Doug Barrick,et al.  The contribution of entropy, enthalpy, and hydrophobic desolvation to cooperativity in repeat-protein folding. , 2011, Structure.

[15]  Elizabeth A. Komives,et al.  Regions of IκBα that are critical for its inhibition of NF-κB·DNA interaction fold upon binding to NF-κB , 2006, Proceedings of the National Academy of Sciences.

[16]  D. Baker,et al.  RosettaRemodel: A Generalized Framework for Flexible Backbone Protein Design , 2011, PloS one.

[17]  A. Kalia,et al.  Functional diversity of ankyrin repeats in microbial proteins. , 2010, Trends in microbiology.

[18]  S. Grzesiek,et al.  NMRPipe: A multidimensional spectral processing system based on UNIX pipes , 1995, Journal of biomolecular NMR.

[19]  N. Gay,et al.  Structural and functional diversity in the leucine-rich repeat family of proteins. , 1996, Progress in biophysics and molecular biology.

[20]  Bostjan Kobe,et al.  Crystal structure of porcine ribonuclease inhibitor, a protein with leucine-rich repeats , 1993, Nature.

[21]  Andreas Plückthun,et al.  Designing repeat proteins: well-expressed, soluble and stable proteins from combinatorial libraries of consensus ankyrin repeat proteins. , 2003, Journal of molecular biology.

[22]  David Baker,et al.  Modeling Symmetric Macromolecular Structures in Rosetta3 , 2011, PloS one.

[23]  J Deisenhofer,et al.  Mechanism of ribonuclease inhibition by ribonuclease inhibitor protein based on the crystal structure of its complex with ribonuclease A. , 1996, Journal of molecular biology.

[24]  N. Greenfield Using circular dichroism collected as a function of temperature to determine the thermodynamics of protein unfolding and binding interactions , 2006, Nature Protocols.

[25]  Andreas Plückthun,et al.  DARPins and other repeat protein scaffolds: advances in engineering and applications. , 2011, Current opinion in biotechnology.

[26]  D. Barrick,et al.  The leucine-rich repeat domain of Internalin B folds along a polarized N-terminal pathway. , 2008, Structure.

[27]  E. Main,et al.  Repeat protein engineering: creating functional nanostructures/biomaterials from modular building blocks. , 2013, Biochemical Society transactions.

[28]  B. Kobe,et al.  The leucine-rich repeat as a protein recognition motif. , 2001, Current opinion in structural biology.

[29]  Andreas Plückthun,et al.  Designing repeat proteins: modular leucine-rich repeat protein libraries based on the mammalian ribonuclease inhibitor family. , 2003, Journal of molecular biology.

[30]  Elizabeth A Komives,et al.  Regions of IkappaBalpha that are critical for its inhibition of NF-kappaB.DNA interaction fold upon binding to NF-kappaB. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[31]  J. García de la Torre,et al.  Prediction of hydrodynamic and other solution properties of rigid proteins from atomic- and residue-level models. , 2011, Biophysical journal.

[32]  Andrey V Kajava,et al.  Tandem repeats in proteins: from sequence to structure. , 2012, Journal of structural biology.

[33]  David Baker,et al.  Prediction of the structure of symmetrical protein assemblies , 2007, Proceedings of the National Academy of Sciences.

[34]  L. Kay,et al.  Slow internal dynamics in proteins: application of NMR relaxation dispersion spectroscopy to methyl groups in a cavity mutant of T4 lysozyme. , 2002, Journal of the American Chemical Society.

[35]  Frans A A Mulder,et al.  Sequence-specific random coil chemical shifts of intrinsically disordered proteins. , 2010, Journal of the American Chemical Society.

[36]  M. Tsai,et al.  Ankyrin repeat: a unique motif mediating protein-protein interactions. , 2006, Biochemistry.

[37]  A. Plückthun,et al.  Designed to be stable: Crystal structure of a consensus ankyrin repeat protein , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[38]  A. Plückthun,et al.  A novel strategy to design binding molecules harnessing the modular nature of repeat proteins , 2003, FEBS letters.

[39]  Doug Barrick,et al.  Repeat-protein folding: new insights into origins of cooperativity, stability, and topology. , 2008, Archives of biochemistry and biophysics.

[40]  Dominique Durand,et al.  Design, production and molecular structure of a new family of artificial alpha-helicoidal repeat proteins (αRep) based on thermostable HEAT-like repeats. , 2010, Journal of molecular biology.

[41]  J Deisenhofer,et al.  Proteins with leucine-rich repeats. , 1995, Current opinion in structural biology.

[42]  E. Komives,et al.  Biophysical characterization of the free IkappaBalpha ankyrin repeat domain in solution. , 2004, Protein science : a publication of the Protein Society.

[43]  T. Grove,et al.  Consensus design of a NOD receptor leucine rich repeat domain with binding affinity for a muramyl dipeptide, a bacterial cell wall fragment , 2014, Protein science : a publication of the Protein Society.

[44]  Masakatsu Kamiya,et al.  Structural principles of leucine‐rich repeat (LRR) proteins , 2003, Proteins.

[45]  Yong Xiong,et al.  Design of stable alpha-helical arrays from an idealized TPR motif. , 2003, Structure.

[46]  D. Barrick,et al.  Capping motifs stabilize the leucine‐rich repeat protein PP32 and rigidify adjacent repeats , 2014, Protein science : a publication of the Protein Society.

[47]  Bruce A Johnson,et al.  Using NMRView to visualize and analyze the NMR spectra of macromolecules. , 2004, Methods in molecular biology.

[48]  M. Lawrence,et al.  Shape complementarity at protein/protein interfaces. , 1993, Journal of molecular biology.

[49]  Seungpyo Hong,et al.  Design of a binding scaffold based on variable lymphocyte receptors of jawless vertebrates by module engineering , 2012, Proceedings of the National Academy of Sciences.

[50]  Lynne Regan,et al.  TPR proteins: the versatile helix. , 2003, Trends in biochemical sciences.