The Protein-Folding Problem, 50 Years On

Protein Folding: Past and Future Fifty years ago the Nobel Prize in chemistry was awarded to Max Perutz and John Kendrew for determining the structure of globular proteins. Since first viewing their structure of myoglobin, scientists have sought to understand protein folding. Dill and MacCallum (p. 1042) review the progress that has been made on three central questions: What is the code that relates sequence to structure? How do proteins fold so fast? Can protein structure be computationally predicted? While we have come some way toward answering these questions, new questions have been gene rated. It is no longer useful to talk about “solving the protein-folding problem”—protein folding has grown into a field of research where the next 50 years promise to be as exciting as the last. The protein-folding problem was first posed about one half-century ago. The term refers to three broad questions: (i) What is the physical code by which an amino acid sequence dictates a protein’s native structure? (ii) How can proteins fold so fast? (iii) Can we devise a computer algorithm to predict protein structures from their sequences? We review progress on these problems. In a few cases, computer simulations of the physical forces in chemically detailed models have now achieved the accurate folding of small proteins. We have learned that proteins fold rapidly because random thermal motions cause conformational changes leading energetically downhill toward the native structure, a principle that is captured in funnel-shaped energy landscapes. And thanks in part to the large Protein Data Bank of known structures, predicting protein structures is now far more successful than was thought possible in the early days. What began as three questions of basic science one half-century ago has now grown into the full-fledged research field of protein physical science.

[1]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[2]  L. Pauling,et al.  The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[3]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[4]  J. Kendrew,et al.  A Three-Dimensional Model of the Myoglobin Molecule Obtained by X-Ray Analysis , 1958, Nature.

[5]  M. Perutz,et al.  Structure of Hæmoglobin: A Three-Dimensional Fourier Synthesis at 5.5-Å. Resolution, Obtained by X-Ray Analysis , 1960, Nature.

[6]  R. G. Hart,et al.  Structure of Myoglobin: A Three-Dimensional Fourier Synthesis at 2 Å. Resolution , 1960, Nature.

[7]  M. Karplus,et al.  Dynamics of folded proteins , 1977, Nature.

[8]  K. Dill Theory for the folding and stability of globular proteins. , 1985, Biochemistry.

[9]  P. Wolynes,et al.  Spin glasses and the statistical mechanics of protein folding. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[10]  P. S. Kim,et al.  Intermediates in the folding reactions of small proteins. , 1990, Annual review of biochemistry.

[11]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[12]  R. Zwanzig,et al.  Levinthal's paradox. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[13]  J. Onuchic,et al.  Protein folding funnels: a kinetic approach to the sequence-structure relationship. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Matthews Cr PATHWAYS OF PROTEIN FOLDING , 1993 .

[15]  L. H. Bradley,et al.  Protein design by binary patterning of polar and nonpolar amino acids. , 1993, Methods in molecular biology.

[16]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[17]  J. Onuchic,et al.  Funnels, pathways, and the energy landscape of protein folding: A synthesis , 1994, Proteins.

[18]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[19]  K. Dill,et al.  From Levinthal to pathways to funnels , 1997, Nature Structural Biology.

[20]  Samuel H. Gellman,et al.  Foldamers: A Manifesto , 1998 .

[21]  Stanley B. Prusiner,et al.  Nobel Lecture: Prions , 1998 .

[22]  Richard Bonneau,et al.  Ab initio protein structure prediction of CASP III targets using ROSETTA , 1999, Proteins.

[23]  K A Dill,et al.  Designing polymers that mimic biomolecules. , 1999, Current opinion in structural biology.

[24]  Vijay S. Pande,et al.  Screen Savers of the World Unite! , 2000, Science.

[25]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[26]  김삼묘,et al.  “Bioinformatics” 특집을 내면서 , 2000 .

[27]  Richard Bonneau,et al.  Rosetta in CASP4: Progress in ab initio protein structure prediction , 2001, Proteins.

[28]  David C. Jones Predicting novel protein folds by using FRAGFOLD , 2001, Proteins.

[29]  C. Brooks,et al.  From folding theories to folding proteins: a review and assessment of simulation studies of protein folding and unfolding. , 2001, Annual review of physical chemistry.

[30]  Ajay K. Royyuru,et al.  Blue Gene: A vision for protein science using a petaflop supercomputer , 2001, IBM Syst. J..

[31]  Sandor Vajda,et al.  CAPRI: A Critical Assessment of PRedicted Interactions , 2003, Proteins.

[32]  Adam Zemla,et al.  LGA: a method for finding 3D similarities in protein structures , 2003, Nucleic Acids Res..

[33]  Liam J McGuffin,et al.  Assembling novel protein folds from super‐secondary structural fragments , 2003, Proteins.

[34]  C. Dobson Protein folding and misfolding , 2003, Nature.

[35]  J. Ponder,et al.  Force fields for protein simulations. , 2003, Advances in protein chemistry.

[36]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[37]  Johannes Söding,et al.  The HHpred interactive server for protein homology detection and structure prediction , 2005, Nucleic Acids Res..

[38]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[39]  Byoung-Chul Lee,et al.  Folding a nonbiological polymer into a compact multihelical structure. , 2005, Journal of the American Chemical Society.

[40]  宁北芳,et al.  疟原虫var基因转换速率变化导致抗原变异[英]/Paul H, Robert P, Christodoulou Z, et al//Proc Natl Acad Sci U S A , 2005 .

[41]  John Moult,et al.  A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction. , 2005, Current opinion in structural biology.

[42]  S Walter Englander,et al.  Protein folding and misfolding: mechanism and principles , 2007, Quarterly Reviews of Biophysics.

[43]  Julia Hockenmaier,et al.  Routes are trees: The parsing perspective on protein folding , 2006, Proteins.

[44]  Vincent A Voelz,et al.  Exploring zipping and assembly as a protein folding principle , 2006, Proteins.

[45]  W Seth Horne,et al.  Foldamers with heterogeneous backbones. , 2008, Accounts of chemical research.

[46]  Alan R. Fersht,et al.  From the first protein structures to our current knowledge of protein folding: delights and scepticisms , 2008, Nature Reviews Molecular Cell Biology.

[47]  K. Dill,et al.  Biomimetic nanostructures: creating a high-affinity zinc-binding site in a folded nonbiological polymer. , 2008, Journal of the American Chemical Society.

[48]  J. P. Grossman,et al.  Anton, a special-purpose machine for molecular dynamics simulation , 2008, CACM.

[49]  K. Dill,et al.  The protein folding problem. , 1993, Annual review of biophysics.

[50]  K. Kirshenbaum,et al.  Photoresponsive peptoid oligomers bearing azobenzene side chains. , 2008, Organic & biomolecular chemistry.

[51]  R. Rosenfeld Nature , 2009, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[52]  Mark B Gerstein,et al.  Computational analysis of membrane proteins: the largest class of drug targets. , 2009, Drug discovery today.

[53]  Helen E Blackwell,et al.  Structure-function relationships in peptoids: recent advances toward deciphering the structural requirements for biological function. , 2009, Organic & biomolecular chemistry.

[54]  Charles L. Brooks,et al.  Community-wide assessment of GPCR structure modelling and ligand docking: GPCR Dock 2008 , 2009, Nature Reviews Drug Discovery.

[55]  J. Guthrie,et al.  A blind challenge for computational solvation free energies: introduction and overview. , 2009, The journal of physical chemistry. B.

[56]  Jasmine L. Gallaher,et al.  Computational Design of an Enzyme Catalyst for a Stereoselective Bimolecular Diels-Alder Reaction , 2010, Science.

[57]  Ryan A. Mesch,et al.  Free-floating ultrathin two-dimensional crystals from sequence-specific peptoid polymers. , 2010, Nature materials.

[58]  Adrien Treuille,et al.  Predicting protein structures with a multiplayer online game , 2010, Nature.

[59]  A. Dunker,et al.  Understanding protein non-folding. , 2010, Biochimica et biophysica acta.

[60]  Joseph A. Bank,et al.  Supporting Online Material Materials and Methods Figs. S1 to S10 Table S1 References Movies S1 to S3 Atomic-level Characterization of the Structural Dynamics of Proteins , 2022 .

[61]  Vincent A Voelz,et al.  Taming the complexity of protein folding. , 2011, Current opinion in structural biology.

[62]  Timothy A. Whitehead,et al.  Computational Design of Proteins Targeting the Conserved Stem Region of Influenza Hemagglutinin , 2011, Science.

[63]  Thomas A. Hopf,et al.  Protein 3D Structure Computed from Evolutionary Sequence Variation , 2011, PloS one.

[64]  R. Dror,et al.  How Fast-Folding Proteins Fold , 2011, Science.

[65]  Annelise E Barron,et al.  Peptoids: bio-inspired polymers as potential pharmaceuticals. , 2011, Current pharmaceutical design.

[66]  D. Baker,et al.  Computation-Guided Backbone Grafting of a Discontinuous Motif onto a Protein Scaffold , 2011, Science.

[67]  Krzysztof Fidelis,et al.  CASP9 results compared to those of previous casp experiments , 2011, Proteins.

[68]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP)—round IX , 2011, Proteins.

[69]  Torsten Schwede,et al.  Assessment of template based protein structure predictions in CASP9 , 2011, Proteins.

[70]  C. Sander,et al.  Direct-coupling analysis of residue coevolution captures native contacts across many protein families , 2011, Proceedings of the National Academy of Sciences.

[71]  D. Tieleman,et al.  Hydrophobicity scales: a thermodynamic looking glass into lipid-protein interactions. , 2011, Trends in biochemical sciences.

[72]  Sarah Rauscher,et al.  Molecular mechanism of β‐sheet self‐organization at water‐hydrophobic interfaces , 2011, Proteins.

[73]  Stefano Piana,et al.  Refinement of protein structure homology models via long, all‐atom molecular dynamics simulations , 2012, Proteins.

[74]  A. Biegert,et al.  HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment , 2011, Nature Methods.

[75]  Massimiliano Pontil,et al.  PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments , 2012, Bioinform..