The long coming of computational structural biology.

Fifty years ago, the structures of the alpha-helix, the beta-sheet, the alpha-helical coiled coil and the collagen triple helix raised the expectation that protein structure could be understood computationally, using a combination of geometric considerations, model-building and parametric equations. The first crystal structures dispelled this hope, revealing a disconcerting lack of regularity in the folding patterns of proteins. Gradually it became clear that the protein folding problem-namely deducing the structure of a protein from its amino acid sequence-was of exceptional difficulty. Its solution has remained outside our reach to this day and, arguably, it represents the most important unsolved problem in molecular biology. Nevertheless, our ability to understand and predict molecular structure by computational means has made steady progress, suggesting that we will eventually conquer the problem, not by a few heroic insights, but by steady advances in biophysical knowledge, biological databases, software applications and raw computer power. Computational structural biology, whose influence is already pervasive, will come to dominate structural approaches in the next decades.

[1]  A. Kajava,et al.  Review: proteins with repeated sequence--structural prediction and modeling. , 2001, Journal of structural biology.

[2]  Krzysztof Fidelis,et al.  Progress from CASP6 to CASP7 , 2007, Proteins.

[3]  L. Pauling,et al.  Two hydrogen-bonded spiral configurations of the polypeptide chain , 1950 .

[4]  C. Ponting,et al.  Protein repeats: structures, functions, and evolution. , 2001, Journal of structural biology.

[5]  Erik L. L. Sonnhammer,et al.  A Hidden Markov Model for Predicting Transmembrane Helices in Protein Sequences , 1998, ISMB.

[6]  K. Aertgeerts,et al.  Rational design of complex formation between plasminogen activator inhibitor-1 and its target proteinases. , 1997, Journal of structural biology.

[7]  Brendan Borrell,et al.  Chemistry: Power play , 2008, Nature.

[8]  V. Pande,et al.  Heterogeneity even at the speed limit of folding: large-scale molecular dynamics study of a fast-folding variant of the villin headpiece. , 2007, Journal of molecular biology.

[9]  D. Lipman,et al.  Rapid similarity searches of nucleic acid and protein data banks. , 1983, Proceedings of the National Academy of Sciences of the United States of America.

[10]  Peter A. Kollman,et al.  AMBER: Assisted model building with energy refinement. A general program for modeling molecules and their interactions , 1981 .

[11]  T M Handel,et al.  Review: protein design--where we were, where we are, where we're going. , 2001, Journal of structural biology.

[12]  A. Steven,et al.  New HEAT-like repeat motifs in proteins regulating proteasome structure and function. , 2004, Journal of structural biology.

[13]  A. Mclachlan,et al.  Repeating Sequences and Gene Duplication in Proteins , 1989 .

[14]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[15]  D E McRee,et al.  XtalView/Xfit--A versatile program for manipulating atomic coordinates and electron density. , 1999, Journal of structural biology.

[16]  John Moult,et al.  Comparative modeling in structural genomics. , 2008, Structure.

[17]  L. Pauling,et al.  The pleated sheet, a new layer configuration of polypeptide chains. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[18]  C. Levinthal Are there pathways for protein folding , 1968 .

[19]  A. Mclachlan Gene duplication and the origin of repetitive protein structures. , 1987, Cold Spring Harbor symposia on quantitative biology.

[20]  F. Crick,et al.  Is α-Keratin a Coiled Coil? , 1952, Nature.

[21]  D. Parry Coiled-coils in α-helix-containing proteins: analysis of the residue types within the heptad repeat and the use of these data in the prediction of coiled-coils in other proteins , 1982, Bioscience reports.

[22]  C. Anfinsen,et al.  The kinetics of formation of native ribonuclease during oxidation of the reduced polypeptide chain. , 1961, Proceedings of the National Academy of Sciences of the United States of America.

[23]  K Schulten,et al.  VMD: visual molecular dynamics. , 1996, Journal of molecular graphics.

[24]  J M Thornton,et al.  Successful protein fold recognition by optimal sequence threading validated by rigorous blind testing , 1995, Proteins.

[25]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[26]  K Y Sanbonmatsu,et al.  High performance computing in biology: multimillion atom simulations of nanoscale systems. , 2007, Journal of structural biology.

[27]  A. Lupas,et al.  Classification of AAA+ proteins. , 2006, Journal of structural biology.

[28]  Marco Punta,et al.  Membrane protein prediction methods. , 2007, Methods.

[29]  Sergei V Strelkov,et al.  Analysis of alpha-helical coiled coils with the program TWISTER reveals a structural mechanism for stutter compensation. , 2002, Journal of structural biology.

[30]  L. Pauling,et al.  Atomic coordinates and structure factors for two helical configurations of polypeptide chains. , 1951, Proceedings of the National Academy of Sciences of the United States of America.

[31]  F. Sanger,et al.  The amide groups of insulin. , 1955, The Biochemical journal.

[32]  Shoshana J. Wodak,et al.  Knowledge Based Potentials for Predicting the Three-Dimensional Conformation of Proteins , 1994 .

[33]  L. Pauling,et al.  Configuration of Polypeptide Chains , 1951, Nature.

[34]  Vijay S. Pande,et al.  Screen Savers of the World Unite! , 2000, Science.

[35]  D. Woolfson,et al.  Generalized Crick equations for modeling noncanonical coiled coils. , 2002, Journal of structural biology.

[36]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[37]  Detlef D. Leipe,et al.  Evolutionary history and higher order classification of AAA+ ATPases. , 2004, Journal of structural biology.

[38]  C. Ponting,et al.  On the evolution of protein folds: are similar motifs in different protein folds the result of convergence, insertion, or relics of an ancient peptide world? , 2001, Journal of structural biology.

[39]  S. Bryant,et al.  Critical assessment of methods of protein structure prediction (CASP): Round II , 1997, Proteins.

[40]  Ceslovas Venclovas,et al.  Progress over the first decade of CASP experiments , 2005, Proteins.

[41]  B Francis,et al.  Power play. , 1989, Advancing clinical care : official journal of NOAADN.

[42]  Eric Schulman,et al.  A Briefer History of Time , 1999 .

[43]  J. Ramshaw,et al.  Gly-X-Y tripeptide frequencies in collagen: a context for host-guest triple-helical peptides. , 1998, Journal of structural biology.

[44]  Johannes Söding,et al.  Protein homology detection by HMM?CHMM comparison , 2005, Bioinform..

[45]  H. Guy,et al.  Structural models of the transmembrane region of voltage-gated and other K+ channels in open, closed, and inactivated conformations. , 1998, Journal of structural biology.

[46]  Johannes Söding,et al.  Comparative analysis of coiled-coil prediction methods. , 2006, Journal of structural biology.

[47]  Mark Israel,et al.  XtalView, protein structure solution and protein graphics, a short history. , 2008, Journal of structural biology.

[48]  A. Mclachlan Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c 551 . , 1971, Journal of molecular biology.

[49]  B. Rost,et al.  Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[50]  B. Rost Review: protein secondary structure prediction continues to rise. , 2001, Journal of structural biology.

[51]  A. Lustig,et al.  Removing an interhelical salt bridge abolishes coiled-coil formation in a de novo designed peptide. , 2002, Journal of structural biology.

[52]  Frieeland Judson Horace The eighth day of creation , 1979 .

[53]  G. N. Ramachandran,et al.  Structure of Collagen , 1955, Nature.

[54]  Francis Crick,et al.  The Fourier transform of a coiled-coil , 1953 .

[55]  Richard Bonneau,et al.  Ab initio protein structure prediction of CASP III targets using ROSETTA , 1999, Proteins.

[56]  Nagasuma Chandra,et al.  PocketDepth: a new depth based algorithm for identification of ligand binding sites in proteins. , 2008, Journal of structural biology.

[57]  Sharon Beder,et al.  Power play. , 1994, Nursing standard (Royal College of Nursing (Great Britain) : 1987).

[58]  R. Jernigan,et al.  Global ribosome motions revealed with elastic network model. , 2004, Journal of structural biology.

[59]  Y. Duan,et al.  Folding free-energy landscape of villin headpiece subdomain from molecular dynamics simulations , 2007, Proceedings of the National Academy of Sciences.

[60]  N. Grishin Fold change in evolution of protein structures. , 2001, Journal of structural biology.

[61]  Paul L. Mariani The Eighth Day of Creation: William Carlos Williams' Late Poems , 1975 .

[62]  K. Koretke,et al.  Bioinformatic analysis of ClpS, a protein module involved in prokaryotic and eukaryotic protein degradation. , 2003, Journal of structural biology.

[63]  F. Crick,et al.  The packing of α‐helices: simple coiled‐coils , 1953 .

[64]  G. Wolff,et al.  Molecular modeling of the myosin-S1(A1) isoform. , 2007, Journal of structural biology.

[65]  Andrei N Lupas,et al.  Phylogenetic analysis of AAA proteins. , 2004, Journal of structural biology.

[66]  L. Pauling,et al.  Compound Helical Configurations of Polypeptide Chains: Structure of Proteins of the α-Keratin Type , 1953, Nature.

[67]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[68]  S. Wodak,et al.  Docking and scoring protein complexes: CAPRI 3rd Edition , 2007, Proteins.