Solution structure of a de novo protein from a designed combinatorial library

Combinatorial libraries of de novo amino acid sequences can provide a rich source of diversity for the discovery of novel proteins. Randomly generated sequences, however, rarely fold into well ordered protein-like structures. To enhance the quality of a library, diversity must be focused into those regions of sequence space most likely to yield well folded structures. We have constructed focused libraries of de novo sequences by designing the binary pattern of polar and nonpolar amino acids to favor structures that contain abundant secondary structure, while simultaneously burying hydrophobic side chains in the protein interior and exposing hydrophilic side chains to solvent. Because binary patterning specifies only the polar/nonpolar periodicity, but not the identities of the side chains, detailed structural features, including packing interactions, cannot be designed a priori. Can binary patterned libraries nonetheless encode well folded proteins? An unambiguous answer to this question requires determination of a 3D structure. We used NMR spectroscopy to determine the structure of S-824, a novel protein from a recently constructed library of 102-residue sequences. This library is “naïve” in that it has not been subjected to high-throughput screens or directed evolution. The experimentally determined structure of S-824 is a four-helix bundle, as specified by the design. As dictated by the binary-code strategy, nonpolar side chains are buried in the protein interior, and polar side chains are exposed to solvent. The polypeptide backbone and buried side chains are well ordered, demonstrating that S-824 is not a molten globule and forms a unique structure. These results show that amino acid sequences that have neither been selected by evolution, nor designed by computer, nor isolated by high-throughput screening, can form native-like structures. These findings validate the binary-code strategy as an effective method for producing vast collections of well folded de novo proteins.

[1]  F. Crick,et al.  The packing of α‐helices: simple coiled‐coils , 1953 .

[2]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[3]  F. Lederer,et al.  Improvement of the 2.5 A resolution model of cytochrome b562 by redetermining the primary structure and using molecular graphics. , 1979, Journal of molecular biology.

[4]  G. P. Smith,et al.  Filamentous fusion phage: novel expression vectors that display cloned antigens on the virion surface. , 1985, Science.

[5]  K. Wüthrich NMR of proteins and nucleic acids , 1988 .

[6]  G. Fasman Prediction of Protein Structure and the Principles of Protein Conformation , 2012, Springer US.

[7]  W Mandecki,et al.  A method for construction of long randomized open reading frames and polypeptides. , 1990, Protein engineering.

[8]  Ponzy Lu,et al.  Prediction of protein structure and principles of protein conformation Edited by G. D. Fasman. New York: Plenum Press. (1989). 798 pp. $95.00 , 1990, Cell.

[9]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[10]  Ad Bax,et al.  Quantitative J correlation: a new approach for measuring homonuclear three-bond J(HNH.alpha.) coupling constants in 15N-enriched proteins , 1993 .

[11]  L. H. Bradley,et al.  Protein design by binary patterning of polar and nonpolar amino acids. , 1993, Methods in molecular biology.

[12]  Robert T. Sauer,et al.  Cooperatively folded proteins in random sequence libraries , 1995, Nature Structural Biology.

[13]  L. Mueller,et al.  Characterization of a new four‐chain coiled‐coil: Influence of chain length on stability , 1995, Protein science : a publication of the Protein Society.

[14]  Eric Oldfield,et al.  1H, 13C and 15N chemical shift referencing in biomolecular NMR , 1995, Journal of biomolecular NMR.

[15]  A. Plückthun,et al.  In vitro selection and evolution of functional proteins by using ribosome display. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[16]  S. L. Mayo,et al.  De novo protein design: fully automated sequence selection. , 1997, Science.

[17]  Michael H. Hecht,et al.  A PROTEIN DESIGNED BY BINARY PATTERNING OF POLAR AND NONPOLAR AMINO ACIDS DISPLAYS NATIVE-LIKE PROPERTIES , 1997 .

[18]  A T Brünger,et al.  Torsion-angle molecular dynamics as a new efficient tool for NMR structure calculation. , 1997, Journal of magnetic resonance.

[19]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[20]  P. Güntert Structure calculation of biological macromolecules from NMR data , 1998, Quarterly Reviews of Biophysics.

[21]  M. Hecht,et al.  De novo amyloid proteins from designed combinatorial libraries. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[22]  M. Hecht,et al.  Screening combinatorial libraries of de Novo proteins by hydrogen- deuterium exchange and electrospray mass spectrometry , 1999 .

[23]  Christopher M. Summa,et al.  De novo design and structural characterization of proteins and metalloproteins. , 1999, Annual review of biochemistry.

[24]  Michael H. Hecht,et al.  Peroxidase Activity in Heme Proteins Derived from a Designed Combinatorial Library , 2000 .

[25]  M. Hecht,et al.  Cooperative thermal denaturation of proteins designed by binary patterning of polar and nonpolar amino acids. , 2000, Biochemistry.

[26]  R. Hodges,et al.  Designing heterodimeric two-stranded alpha-helical coiled-coils: the effect of chain length on protein folding, stability and specificity. , 2001, The journal of peptide research : official journal of the American Peptide Society.

[27]  Anthony D. Keefe,et al.  Functional proteins from a random-sequence library , 2001, Nature.

[28]  M. Hecht,et al.  Rationally designed mutations convert de novo amyloid-like fibrils into monomeric β-sheet proteins , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[29]  D. Saville,et al.  Template-directed assembly of a de novo designed protein. , 2002, Journal of the American Chemical Society.

[30]  Yinan Wei,et al.  Stably folded de novo proteins from a designed combinatorial library , 2003, Protein science : a publication of the Protein Society.

[31]  Letter to the Editor: 1H, 13C and 15N resonance assignments of S-824, a de novo four-helix bundle from a designed combinatorial library , 2003, Journal of biomolecular NMR.

[32]  Ian W. Davis,et al.  Structure validation by Cα geometry: ϕ,ψ and Cβ deviation , 2003, Proteins.

[33]  L. H. Bradley,et al.  De novo proteins from designed combinatorial libraries , 2004, Protein science : a publication of the Protein Society.