Review: protein design--where we were, where we are, where we're going.

Protein design has become a powerful approach for understanding the relationship between amino acid sequence and 3-dimensional structure. In the past 5 years, there have been many breakthroughs in the development of computational methods that allow the selection of novel sequences given the structure of a protein backbone. Successful design of protein scaffolds has now paved the way for new endeavors to design function. The ability to design sequences compatible with a fold may also be useful in structural and functional genomics by expanding the range of proteins used for fold recognition and for the identification of functionally important domains from multiple sequence alignments.

[1]  B. Tidor,et al.  Do salt bridges stabilize proteins? A continuum electrostatic analysis , 1994, Protein science : a publication of the Protein Society.

[2]  B. Matthews,et al.  Response of a protein structure to cavity-creating mutations and its relation to the hydrophobic effect. , 1992, Science.

[3]  W. L. Jorgensen,et al.  Development and Testing of the OPLS All-Atom Force Field on Conformational Energetics and Properties of Organic Liquids , 1996 .

[4]  J R Desjarlais,et al.  Computer search algorithms in protein modification and design. , 1998, Current opinion in structural biology.

[5]  S J Wodak,et al.  Automatic protein design with all atom force-fields by exact and heuristic optimization. , 2000, Journal of molecular biology.

[6]  D. Eisenberg,et al.  Atomic solvation parameters applied to molecular dynamics of proteins in solution , 1992, Protein science : a publication of the Protein Society.

[7]  B. Tidor,et al.  Rational modification of protein stability by the mutation of charged surface residues. , 2000, Biochemistry.

[8]  N. D. Clarke,et al.  Metal search: A computer program that helps design tetrahedral metal‐binding sites , 1995, Proteins.

[9]  R. Lavery,et al.  A new approach to the rapid determination of protein side chain conformations. , 1991, Journal of biomolecular structure & dynamics.

[10]  R. L. Baldwin,et al.  N‐ and C‐capping preferences for all 20 amino acids in α‐helical peptides , 1995, Protein science : a publication of the Protein Society.

[11]  F. Richards,et al.  Construction of new ligand binding sites in proteins of known structure. I. Computer-aided modeling of sites with pre-defined geometry. , 1991, Journal of molecular biology.

[12]  P. S. Kim,et al.  High-resolution protein design with backbone freedom. , 1998, Science.

[13]  S. A. Marshall,et al.  Energy functions for protein design. , 1999, Current opinion in structural biology.

[14]  J. Moult,et al.  An algorithm for determining the conformation of polypeptide segments in proteins by systematic search , 1986, Proteins.

[15]  M. Gilson,et al.  Prediction of pH-dependent properties of proteins. , 1994, Journal of molecular biology.

[16]  W. C. Still,et al.  The GB/SA Continuum Model for Solvation. A Fast Analytical Method for the Calculation of Approximate Born Radii , 1997 .

[17]  C. Tanford,et al.  Theory of Protein Titration Curves. I. General Equations for Impenetrable Spheres , 1957 .

[18]  R. Abagyan,et al.  Protein engineering with monomeric triosephosphate isomerase (monoTIM): the modelling and structure verification of a seven-residue loop. , 1997, Protein engineering.

[19]  L. Kier,et al.  Amino acid side chain parameters for correlation studies in biology and pharmacology. , 2009, International journal of peptide and protein research.

[20]  Johan Desmet,et al.  The dead-end elimination theorem and its use in protein side-chain positioning , 1992, Nature.

[21]  R. Lerner,et al.  Control of the exo and endo pathways of the Diels-Alder reaction by antibody catalysis. , 1993, Science.

[22]  G. Makhatadze,et al.  Engineering a thermostable protein via optimization of charge-charge interactions on the protein surface. , 1999, Biochemistry.

[23]  A. Lesk,et al.  Principles determining the structure of beta-sheet barrels in proteins. II. The observed structures. , 1994, Journal of molecular biology.

[24]  P. S. Kim,et al.  Context-dependent secondary structure formation of a designed protein sequence , 1996, Nature.

[25]  J. Mendes,et al.  Improved modeling of side‐chains in proteins with rotamer‐based methods: A flexible rotamer model , 1999, Proteins.

[26]  J A Wozniak,et al.  Replacements of Pro86 in phage T4 lysozyme extend an alpha-helix but do not alter protein stability. , 1990, Science.

[27]  E. Mehler Self-Consistent, Free Energy Based Approximation To Calculate pH Dependent Electrostatic Effects in Proteins , 1996 .

[28]  Lynne Regan,et al.  The de novo design of a rubredoxin‐like fe site , 1998, Protein science : a publication of the Protein Society.

[29]  R. Srinivasan,et al.  A physical basis for protein secondary structure. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[30]  S. L. Mayo,et al.  Probing the role of packing specificity in protein design. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[31]  R. L. Baldwin,et al.  Determination of free energies of N-capping in alpha-helices by modification of the Lifson-Roig helix-coil therapy to include N- and C-capping. , 1994, Biochemistry.

[32]  Gregory D. Hawkins,et al.  Pairwise solute descreening of solute charges from a dielectric medium , 1995 .

[33]  S L Mayo,et al.  Structure of a protein G helix variant suggests the importance of helix propensity and helix dipole interactions in protein design , 2000, Protein science : a publication of the Protein Society.

[34]  S. L. Mayo,et al.  Automated design of the surface positions of protein helices , 1997, Protein science : a publication of the Protein Society.

[35]  S L Mayo,et al.  Pairwise calculation of protein solvent-accessible surface areas. , 1998, Folding & design.

[36]  Bruce Tidor,et al.  Electrostatic interactions in the GCN4 leucine zipper: Substantial contributions arise from intramolecular interactions enhanced on binding , 1999, Protein science : a publication of the Protein Society.

[37]  W. Jencks Catalysis in chemistry and enzymology , 1969 .

[38]  A. D. McLachlan,et al.  Solvation energy in protein folding and binding , 1986, Nature.

[39]  S. L. Mayo,et al.  DREIDING: A generic force field for molecular simulations , 1990 .

[40]  P. S. Kim,et al.  Context is a major determinant of β-sheet propensity , 1994, Nature.

[41]  P. S. Kim,et al.  Mechanism of specificity in the Fos-Jun oncoprotein heterodimer , 1992, Cell.

[42]  B Tidor,et al.  Protein stabilization by removal of unsatisfied polar groups: computational approaches and experimental tests. , 1996, Biochemistry.

[43]  J. Janin,et al.  Analytical approximation to the accessible surface area of proteins. , 1980, Proceedings of the National Academy of Sciences of the United States of America.

[44]  K. Sharp,et al.  Macroscopic models of aqueous solutions : biological and chemical applications , 1993 .

[45]  B K Shoichet,et al.  A relationship between protein stability and protein function. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[46]  L. Pauling Chemical achievement and hope for the future. , 1948, American scientist.

[47]  D. Hilvert Critical analysis of antibody catalysis. , 2000, Annual review of biochemistry.

[48]  W. DeGrado,et al.  Solution structure and dynamics of a de novo designed three-helix bundle protein. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Andrew M Wollacott,et al.  Prediction of amino acid sequence from structure , 2000, Protein science : a publication of the Protein Society.

[50]  B. Matthews,et al.  Design and structural analysis of alternative hydrophobic core packing arrangements in bacteriophage T4 lysozyme. , 1993, Journal of molecular biology.

[51]  P. Schultz,et al.  Immunological origins of binding and catalysis in a Diels-Alderase antibody. , 1998, Science.

[52]  M. Levitt,et al.  De novo protein design. I. In search of stability and specificity. , 1999, Journal of molecular biology.

[53]  R. Sauer,et al.  Are buried salt bridges important for protein stability and conformational specificity? , 1995, Nature Structural Biology.

[54]  P. Koehl,et al.  Polar and nonpolar atomic environments in the protein core: Implications for folding and binding , 1994, Proteins.

[55]  Neil D. Clarke,et al.  Novel metal-binding proteins by design , 1995, Nature Structural Biology.

[56]  R. Goldstein,et al.  Optimizing potentials for the inverse protein folding problem. , 1998, Protein engineering.

[57]  Stephen L. Mayo,et al.  Designing protein β-sheet surfaces by Z-score optimization , 2000 .

[58]  D. Baker,et al.  Native protein sequences are close to optimal for their structures. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[59]  D. Shortle,et al.  Mutant forms of staphylococcal nuclease with altered patterns of guanidine hydrochloride and urea denaturation , 1986, Proteins.

[60]  W. DeGrado,et al.  From synthetic coiled coils to functional proteins: automated design of a receptor for the calmodulin-binding domain of calcineurin. , 1998, Journal of molecular biology.

[61]  C. Lee,et al.  Predicting protein mutant energetics by self-consistent ensemble optimization. , 1994, Journal of molecular biology.

[62]  R. Abagyan,et al.  Biased probability Monte Carlo conformational searches and electrostatic calculations for peptides and proteins. , 1994, Journal of molecular biology.

[63]  J R Desjarlais,et al.  From coiled coils to small globular proteins: Design of a native‐like three‐helix bundle , 1998, Protein science : a publication of the Protein Society.

[64]  Stephen L. Mayo,et al.  Design, structure and stability of a hyperthermophilic protein variant , 1998, Nature Structural Biology.

[65]  J R Desjarlais,et al.  Side-chain and backbone flexibility in protein core design. , 1999, Journal of molecular biology.

[66]  D A Agard,et al.  Computational method for the design of enzymes with altered substrate specificity. , 1991, Journal of molecular biology.

[67]  S. L. Mayo,et al.  Protein design automation , 1996, Protein science : a publication of the Protein Society.

[68]  G. A. Lazar,et al.  De novo design of the hydrophobic core of ubiquitin , 1997, Protein science : a publication of the Protein Society.

[69]  S. L. Mayo,et al.  De novo protein design: fully automated sequence selection. , 1997, Science.

[70]  R. Goldstein Efficient rotamer elimination applied to protein side-chains and related spin glasses. , 1994, Biophysical journal.

[71]  H. Scheraga,et al.  Accessible surface areas as a measure of the thermodynamic parameters of hydration of peptides. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[72]  J R Desjarlais,et al.  De novo design of the hydrophobic cores of proteins , 1995, Protein science : a publication of the Protein Society.

[73]  A. Lesk,et al.  Principles determining the structure of beta-sheet barrels in proteins. I. A theoretical analysis. , 1994, Journal of molecular biology.

[74]  F. Richards,et al.  The crystal structure of a mutant protein with altered but improved hydrophobic core packing. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[75]  H W Hellinga,et al.  Rational design of nascent metalloenzymes. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[76]  Baldomero Oliva,et al.  An automated classification of the structure of protein loops. , 1997, Journal of molecular biology.

[77]  G. A. Lazar,et al.  Solution structure and dynamics of a designed hydrophobic core variant of ubiquitin. , 1999, Structure.

[78]  P. Harbury,et al.  Tanford-Kirkwood electrostatics for protein modeling. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[79]  M. Karplus,et al.  Effective energy function for proteins in solution , 1999, Proteins.

[80]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[81]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules J. Am. Chem. Soc. 1995, 117, 5179−5197 , 1996 .

[82]  D. Case,et al.  Generalized born models of macromolecular solvation effects. , 2000, Annual review of physical chemistry.

[83]  P G Schultz,et al.  At the crossroads of chemistry and immunology: catalytic antibodies. , 1991, Science.

[84]  F M Richards,et al.  Construction of new ligand binding sites in proteins of known structure. II. Grafting of a buried transition metal binding site into Escherichia coli thioredoxin. , 1991, Journal of molecular biology.

[85]  N. Metropolis,et al.  Equation of State Calculations by Fast Computing Machines , 1953, Resonance.

[86]  H. Farid,et al.  A new approach to the design of uniquely folded thermally stable proteins , 2008, Protein science : a publication of the Protein Society.

[87]  Roland L. Dunbrack,et al.  Bayesian statistical analysis of protein side‐chain rotamer preferences , 1997, Protein science : a publication of the Protein Society.

[88]  Christopher A. Voigt,et al.  Trading accuracy for speed: A quantitative comparison of search algorithms in protein sequence design. , 2000, Journal of molecular biology.

[89]  B. Matthews,et al.  The role of backbone flexibility in the accommodation of variants that repack the core of T4 lysozyme. , 1994, Science.

[90]  Junichi Takagi,et al.  Computational design of an integrin I domain stabilized in the open high affinity conformation , 2000, Nature Structural Biology.

[91]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[92]  B. Dominy,et al.  Development of a generalized Born model parameterization for proteins and nucleic acids , 1999 .

[93]  Designing the hydrophobic core of Thermus flavus malate dehydrogenase based on side-chain packing. , 1998, Protein engineering.

[94]  P M Cullis,et al.  Affinities of amino acid side chains for solvent water. , 1981, Biochemistry.

[95]  D B Gordon,et al.  Branch-and-terminate: a combinatorial optimization algorithm for protein design. , 1999, Structure.

[96]  Richard A. Friesner,et al.  Solvation Free Energies of Peptides: Comparison of Approximate Continuum Solvation Models with Accurate Solution of the Poisson−Boltzmann Equation , 1997 .

[97]  A. Warshel,et al.  Electrostatic effects in macromolecules: fundamental concepts and practical modeling. , 1998, Current opinion in structural biology.

[98]  M. Karplus,et al.  pKa's of ionizable groups in proteins: atomic detail from a continuum electrostatic model. , 1990, Biochemistry.

[99]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[100]  S L Mayo,et al.  Coupling backbone flexibility and amino acid sequence selection in protein design , 1997, Protein science : a publication of the Protein Society.

[101]  G. A. Lazar,et al.  Rotamer strain as a determinant of protein structural specificity , 1999, Protein science : a publication of the Protein Society.

[102]  H. Scheraga,et al.  Energy parameters in polypeptides. 10. Improved geometrical parameters and nonbonded interactions for use in the ECEPP/3 algorithm, with application to proline-containing peptides , 1994 .

[103]  B. Matthews,et al.  Structural basis of amino acid alpha helix propensity. , 1993, Science.

[104]  R. L. Baldwin,et al.  Helix propensities of the amino acids measured in alanine‐based peptides without helix‐stabilizing side‐chain interactions , 1994, Protein science : a publication of the Protein Society.

[105]  D. Case,et al.  Modification of the Generalized Born Model Suitable for Macromolecules , 2000 .

[106]  M. Levitt,et al.  De novo protein design. II. Plasticity in sequence space. , 1999, Journal of molecular biology.

[107]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[108]  F. Crick,et al.  The packing of α‐helices: simple coiled‐coils , 1953 .

[109]  C. M. Summa,et al.  INAUGURAL ARTICLE by a Recently Elected Academy Member:Retrostructural analysis of metalloproteins: Application to the design of a minimal model for diiron proteins , 2000 .