Advances in protein structure prediction and de novo protein design : A review

This review provides an exposition to the important problems of (i) structure prediction in protein folding and (ii) de novo protein design. The recent advances in protein folding are reviewed based on a classification of the approaches in comparative modeling, fold recognition, and first principles methods with and without database information. The advances towards the challenging problem of loop structure prediction and the first principles method, ASTRO-FOLD, along with the developments in the area of force-fields development have been discussed. Finally, the recent progress in the area of de novo protein design is presented with focus on template flexibility, in silico sequence selection, and successful peptide and protein designs.

[1]  A. Neumaier,et al.  A global optimization method, αBB, for general twice-differentiable constrained NLPs — I. Theoretical advances , 1998 .

[2]  H. Scheraga,et al.  Medium- and long-range interaction parameters between amino acids for predicting three-dimensional structures of proteins. , 1976, Macromolecules.

[3]  Stephen F. Betz,et al.  De novo protein design: from molten globules to native-like states , 1993 .

[4]  T L Blundell,et al.  FUGUE: sequence-structure homology recognition using environment-specific substitution tables and structure-dependent gap penalties. , 2001, Journal of molecular biology.

[5]  H. Scheraga,et al.  Optimization of Parameters in Macromolecular Potential Energy Functions by Conformational Space Annealing , 2001 .

[6]  W. DeGrado,et al.  From synthetic coiled coils to functional proteins: automated design of a receptor for the calmodulin-binding domain of calcineurin. , 1998, Journal of molecular biology.

[7]  J G Saven,et al.  Statistical theory for protein combinatorial libraries. Packing interactions, backbone flexibility, and the sequence variability of a main-chain structure. , 2001, Journal of molecular biology.

[8]  F M Richards,et al.  Construction of new ligand binding sites in proteins of known structure. II. Grafting of a buried transition metal binding site into Escherichia coli thioredoxin. , 1991, Journal of molecular biology.

[9]  Andrew E. Torda,et al.  The GROMOS biomolecular simulation program package , 1999 .

[10]  P. Karplus,et al.  Hydrophobicity regained. , 1997, Protein science : a publication of the Protein Society.

[11]  M. Levitt,et al.  De novo protein design. I. In search of stability and specificity. , 1999, Journal of molecular biology.

[12]  David C. Jones,et al.  GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. , 1999, Journal of molecular biology.

[13]  Song Liu,et al.  Accurate and efficient loop selections by the DFIRE‐based all‐atom statistical potential , 2004, Protein science : a publication of the Protein Society.

[14]  J. Skolnick,et al.  TOUCHSTONE II: a new approach to ab initio protein structure prediction. , 2003, Biophysical journal.

[15]  R. Jernigan,et al.  Inter-residue potentials in globular proteins and the dominance of highly specific hydrophilic interactions at close separation. , 1997, Journal of molecular biology.

[16]  D. Eisenberg,et al.  A method to identify protein sequences that fold into a known three-dimensional structure. , 1991, Science.

[17]  Christodoulos A. Floudas,et al.  Global optimization of MINLP problems in Process Synthesis and Design , 1997 .

[18]  G. Crippen,et al.  Contact potential that recognizes the correct folding of globular proteins. , 1992, Journal of molecular biology.

[19]  D. Baker,et al.  A large scale test of computational protein design: folding and stability of nine completely redesigned globular proteins. , 2003, Journal of molecular biology.

[20]  M. Hao,et al.  Designing potential energy functions for protein folding. , 1999, Current opinion in structural biology.

[21]  V. Muñoz,et al.  Folding dynamics and mechanism of β-hairpin formation , 1997, Nature.

[22]  John L. Klepeis,et al.  Design of peptide analogues with improved activity using a novel de novo protein design approach , 2004 .

[23]  R. Elber,et al.  Distance‐dependent, pair potential for protein folding: Results from linear optimization , 2000, Proteins.

[24]  L. Looger,et al.  Computational design of receptor and sensor proteins with novel functions , 2003, Nature.

[25]  D. Benjamin Gordon,et al.  Exact rotamer optimization for protein design , 2003, J. Comput. Chem..

[26]  J. Skolnick,et al.  Ab initio protein structure prediction via a combination of threading, lattice folding, clustering, and structure refinement , 2001, Proteins.

[27]  Valerie Daggett,et al.  Ultrafast folding of alpha3D: a de novo designed three-helix bundle protein. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[28]  Niles A Pierce,et al.  Protein design is NP-hard. , 2002, Protein engineering.

[29]  A. Lesk,et al.  Principles determining the structure of beta-sheet barrels in proteins. II. The observed structures. , 1994, Journal of molecular biology.

[30]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[31]  John L. Klepeis,et al.  Prediction of β‐sheet topology and disulfide bridges in polypeptides , 2003, J. Comput. Chem..

[32]  H. Scheraga,et al.  Energy parameters in polypeptides. 10. Improved geometrical parameters and nonbonded interactions for use in the ECEPP/3 algorithm, with application to proline-containing peptides , 1994 .

[33]  B Honig,et al.  Free energy balance in protein folding. , 1995, Advances in protein chemistry.

[34]  S. L. Mayo,et al.  Designed protein G core variants fold to native‐like structures: Sequence selection by ORBIT tolerates variation in backbone specification , 2001, Protein science : a publication of the Protein Society.

[35]  John L. Klepeis,et al.  Analysis and prediction of loop segments in protein structures , 2005, Comput. Chem. Eng..

[36]  David Baker,et al.  Searching for folded proteins in vitro and in silico. , 2004, European journal of biochemistry.

[37]  A. Finkelstein,et al.  Why do protein architectures have boltzmann‐like statistics? , 1995, Proteins.

[38]  Gaetano T Montelione,et al.  Automatic target selection for structural genomics on eukaryotes , 2004, Proteins.

[39]  Daisuke Kihara,et al.  TOUCHSTONE: A unified approach to protein structure prediction , 2003, Proteins.

[40]  Alexander D. MacKerell,et al.  All-atom empirical potential for molecular modeling and dynamics studies of proteins. , 1998, The journal of physical chemistry. B.

[41]  Analog Vlsi,et al.  On the Design of , 2000 .

[42]  Chen Zeng,et al.  Emergence of highly designable protein‐backbone conformations in an off‐lattice model , 2001, Proteins.

[43]  Janet M. Thornton,et al.  Protein domain superfolds and superfamilies , 1994 .

[44]  V S Pande,et al.  Molecular dynamics simulations of unfolding and refolding of a beta-hairpin fragment of protein G. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[45]  G M Crippen,et al.  Constructing smooth potential functions for protein folding. , 2001, Journal of molecular graphics & modelling.

[46]  L L Looger,et al.  Generalized dead-end elimination algorithms make large-scale protein side-chain structure prediction tractable: implications for protein design and structural genomics. , 2001, Journal of molecular biology.

[47]  M Vendruscolo,et al.  Can a pairwise contact potential stabilize native protein folds against decoys obtained by threading? , 2000, Proteins.

[48]  John L. Klepeis,et al.  Ab initio Tertiary Structure Prediction of Proteins , 2003, J. Glob. Optim..

[49]  Richard A. Friesner,et al.  Solvation Free Energies of Peptides: Comparison of Approximate Continuum Solvation Models with Accurate Solution of the Poisson−Boltzmann Equation , 1997 .

[50]  R. Bruccoleri,et al.  Criteria that discriminate between native proteins and incorrectly folded models , 1988, Proteins.

[51]  Ron Elber,et al.  Maximum feasibility guideline in the design and analysis of protein folding potentials , 2002, J. Comput. Chem..

[52]  Jens Meiler,et al.  Rosetta predictions in CASP5: Successes, failures, and prospects for complete automation , 2003, Proteins.

[53]  E I Shakhnovich,et al.  Protein design: a perspective from simple tractable models , 1998, Folding & design.

[54]  A. Liwo,et al.  United‐residue force field for off‐lattice protein‐structure simulations: III. Origin of backbone hydrogen‐bonding cooperativity in united‐residue potentials , 1998 .

[55]  E. Lattman,et al.  Rapid calculation of the solution scattering profile from a macromolecule of known structure , 1989, Proteins.

[56]  G. Casari,et al.  Identification of native protein folds amongst a large number of incorrect models. The calculation of low energy conformations from potentials of mean force. , 1990, Journal of molecular biology.

[57]  J L Klepeis,et al.  A new pairwise folding potential based on improved decoy generation and side‐chain packing , 2004, Proteins.

[58]  F. Richards,et al.  The crystal structure of a mutant protein with altered but improved hydrophobic core packing. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[59]  J. Ponder,et al.  An efficient newton‐like method for molecular mechanics energy minimization of large molecules , 1987 .

[60]  S. Bryant,et al.  Critical assessment of methods of protein structure prediction (CASP): Round II , 1997, Proteins.

[61]  Lonnie Hamm,et al.  GLOBAL OPTIMIZATION METHODS , 2002 .

[62]  M Vendruscolo,et al.  Folding Lennard‐Jones proteins by a contact potential , 1999, Proteins.

[63]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[64]  John R Desjarlais,et al.  A de novo redesign of the WW domain , 2003, Protein science : a publication of the Protein Society.

[65]  R. Goldstein Efficient rotamer elimination applied to protein side-chains and related spin glasses. , 1994, Biophysical journal.

[66]  Christopher Bystroff,et al.  Predicting interresidue contacts using templates and pathways , 2003, Proteins.

[67]  Adam Liwo,et al.  A united-residue force field for off-lattice protein-structure simulations. I. Functional forms and parameters of long-range side-chain interaction potentials from protein crystal data , 1997, J. Comput. Chem..

[68]  Junichi Takagi,et al.  Computational design of an integrin I domain stabilized in the open high affinity conformation , 2000, Nature Structural Biology.

[69]  David C. Jones Predicting novel protein folds by using FRAGFOLD , 2001, Proteins.

[70]  Yang Zhang,et al.  SPICKER: A clustering approach to identify near‐native protein folds , 2004, J. Comput. Chem..

[71]  H A Scheraga,et al.  New developments of the electrostatically driven Monte Carlo method: test on the membrane-bound portion of melittin. , 1998, Biopolymers.

[72]  Hao Li,et al.  Designability of protein structures: A lattice‐model study using the Miyazawa‐Jernigan matrix , 2002, Proteins.

[73]  D. Baker,et al.  Improved recognition of native‐like protein structures using a combination of sequence‐dependent and sequence‐independent features of proteins , 1999, Proteins.

[74]  M. DePristo,et al.  Ab initio construction of polypeptide fragments: Efficient generation of accurate, representative ensembles , 2003, Proteins.

[75]  Ming Li,et al.  Assessment of RAPTOR's linear programming approach in CAFASP3 , 2003, Proteins.

[76]  Cinque S. Soto,et al.  Evaluating conformational free energies: The colony energy and its application to the problem of loop prediction , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[77]  R. Lathrop The protein threading problem with sequence amino acid interaction preferences is NP-complete. , 1994, Protein engineering.

[78]  Burkhard Rost,et al.  Target space for structural genomics revisited , 2002, Bioinform..

[79]  N. Wingreen,et al.  Flexibility of α-Helices: Results of a Statistical Analysis of Database Protein Structures , 2003 .

[80]  Yang Zhang,et al.  Tertiary structure predictions on a comprehensive benchmark of medium to large size proteins. , 2004, Biophysical journal.

[81]  J. Skolnick,et al.  Development and large scale benchmark testing of the PROSPECTOR_3 threading algorithm , 2004, Proteins.

[82]  Adam Liwo,et al.  A united-residue force field for off-lattice protein-structure simulations. II. Parameterization of short-range interactions and determination of weights of energy terms by Z-score optimization , 1997, J. Comput. Chem..

[83]  C. Floudas Nonlinear and Mixed-Integer Optimization: Fundamentals and Applications , 1995 .

[84]  A. Liwo,et al.  Cumulant-based expressions for the multibody terms for the correlation between local and electrostatic interactions in the united-residue force field , 2001 .

[85]  C. Levinthal How to fold graciously , 1969 .

[86]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[87]  Valerie Daggett,et al.  Ultrafast folding of α3D: A de novo designed three-helix bundle protein , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[88]  Kevin W Plaxco,et al.  NMR and temperature-jump measurements of de novo designed proteins demonstrate rapid folding in the absence of explicit selection for kinetics. , 2003, Journal of molecular biology.

[89]  W. DeGrado,et al.  Solution structure and dynamics of a de novo designed three-helix bundle protein. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[90]  N Vaidehi,et al.  Stabilization of coiled-coil peptide domains by introduction of trifluoroleucine. , 2001, Biochemistry.

[91]  J Skolnick,et al.  How do potentials derived from structural databases relate to “true” potentials? , 1998, Protein science : a publication of the Protein Society.

[92]  Julian Lee,et al.  PROTEINS: Structure, Function, and Bioinformatics 56:704–714 (2004) Prediction of Protein Tertiary Structure Using PROFESY, a Novel Method Based on Fragment Assembly and , 2022 .

[93]  C. Deane,et al.  CODA: A combined algorithm for predicting the structurally variable regions of protein models , 2001, Protein science : a publication of the Protein Society.

[94]  N. Wingreen,et al.  Identifying proteins of high designability via surface‐exposure patterns , 2001, Proteins.

[95]  Maurice K. Wong,et al.  Algorithm AS136: A k-means clustering algorithm. , 1979 .

[96]  D. Baker,et al.  Native protein sequences are close to optimal for their structures. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[97]  David Baker,et al.  Exploring folding free energy landscapes using computational protein design. , 2004, Current opinion in structural biology.

[98]  R. A. Scott,et al.  Discriminating compact nonnative structures from the native structure of globular proteins. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[99]  Richard Bonneau,et al.  An improved protein decoy set for testing energy functions for protein structure prediction , 2003, Proteins.

[100]  B. Honig,et al.  A hierarchical approach to all‐atom protein loop prediction , 2004, Proteins.

[101]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[102]  R. Samudrala,et al.  An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction. , 1998, Journal of molecular biology.

[103]  C. Floudas,et al.  Ab initio prediction of the three‐dimensional structure of a de novo designed protein: A double‐blind case study , 2004, Proteins.

[104]  A. Sali,et al.  Modeling of loops in protein structures , 2000, Protein science : a publication of the Protein Society.

[105]  A. Liwo,et al.  A method for optimizing potential-energy functions by a hierarchical design of the potential-energy landscape: Application to the UNRES force field , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[106]  Patrick Aloy,et al.  Predictions without templates: New folds, secondary structure, and contacts in CASP5 , 2003, Proteins.

[107]  Richard A Friesner,et al.  A novel fold recognition method using composite predicted secondary structures , 2002, Proteins.

[108]  R. Jernigan,et al.  Structure-derived potentials and protein simulations. , 1996, Current opinion in structural biology.

[109]  Stephen L. Mayo,et al.  Design, structure and stability of a hyperthermophilic protein variant , 1998, Nature Structural Biology.

[110]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[111]  H. Scheraga,et al.  Energy parameters in polypeptides. VII. Geometric parameters, partial atomic charges, nonbonded interactions, hydrogen bond interactions, and intrinsic torsional potentials for the naturally occurring amino acids , 1975 .

[112]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[113]  John L. Klepeis,et al.  A new class of hybrid global optimization algorithms for peptide structure prediction: integrated hybrids , 2003 .

[114]  Adam Liwo,et al.  Improved conformational space annealing method to treat β-structure with the UNRES force-field and to enhance scalability of parallel implementation , 2004 .

[115]  Ying Xu,et al.  Raptor: Optimal Protein Threading by Linear Programming , 2003, J. Bioinform. Comput. Biol..

[116]  J. Skolnick,et al.  MONSSTER: a method for folding globular proteins with a small number of distance restraints. , 1997, Journal of molecular biology.

[117]  R. Zwanzig,et al.  Levinthal's paradox. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[118]  L. H. Bradley,et al.  De novo proteins from designed combinatorial libraries , 2004, Protein science : a publication of the Protein Society.

[119]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[120]  K E Drexler,et al.  Molecular engineering: An approach to the development of general capabilities for molecular manipulation. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[121]  Kam Y. J. Zhang,et al.  Accurate computer-based design of a new backbone conformation in the second turn of protein L. , 2002, Journal of molecular biology.

[122]  Stephen L. Mayo,et al.  Rubredoxin Variant Folds without Iron , 1999 .

[123]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[124]  D. Baker,et al.  An orientation-dependent hydrogen bonding potential improves prediction of specificity and structure for proteins and protein-protein complexes. , 2003, Journal of molecular biology.

[125]  S J Wodak,et al.  Automatic protein design with all atom force-fields by exact and heuristic optimization. , 2000, Journal of molecular biology.

[126]  M. Karplus,et al.  Understanding beta-hairpin formation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[127]  V S Pande,et al.  Mechanical unfolding of a beta-hairpin using molecular dynamics. , 2000, Biophysical journal.

[128]  Jeffery G Saven,et al.  Combinatorial protein design. , 2002, Current opinion in structural biology.

[129]  Christodoulos A. Floudas,et al.  Deterministic global optimization - theory, methods and applications , 2010, Nonconvex optimization and its applications.

[130]  M. Sternberg,et al.  Enhanced genome annotation using structural profiles in the program 3D-PSSM. , 2000, Journal of molecular biology.

[131]  S. L. Mayo,et al.  Enzyme-like proteins by computational design , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[132]  J R Desjarlais,et al.  De novo design of the hydrophobic cores of proteins , 1995, Protein science : a publication of the Protein Society.

[133]  Adam Liwo,et al.  Efficient parallel algorithms in global optimization of potential energy functions for peptides, proteins, and crystals , 2000 .

[134]  Drexler Ke,et al.  Molecular engineering: An approach to the development of general capabilities for molecular manipulation. , 1981 .

[135]  Jeffery G. Saven Connecting statistical and optimized potentials in protein folding via a generalized foldability criterion , 2003 .

[136]  N. Linial,et al.  On the design and analysis of protein folding potentials , 2000, Proteins.

[137]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[138]  Ned S. Wingreen,et al.  Designability of α-helical proteins , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[139]  S. L. Mayo,et al.  Conformational splitting: A more powerful criterion for dead‐end elimination , 2000, J. Comput. Chem..

[140]  A. Liwo,et al.  Parametrization of Backbone−Electrostatic and Multibody Contributions to the UNRES Force Field for Protein-Structure Prediction from Ab Initio Energy Surfaces of Model Systems† , 2004 .

[141]  S. L. Mayo,et al.  Protein design automation , 1996, Protein science : a publication of the Protein Society.

[142]  Gordon M. Crippen,et al.  Potential Energy Function for Continuous State Models of Globular Proteins , 2000, J. Comput. Biol..

[143]  Stephen L Mayo,et al.  Repacking the Core of T4 lysozyme by automated design. , 2003, Journal of molecular biology.

[144]  A. Lesk,et al.  Principles determining the structure of beta-sheet barrels in proteins. I. A theoretical analysis. , 1994, Journal of molecular biology.

[145]  J R Desjarlais,et al.  From coiled coils to small globular proteins: Design of a native‐like three‐helix bundle , 1998, Protein science : a publication of the Protein Society.

[146]  J R Desjarlais,et al.  Computational protein design. , 2001, Current opinion in chemical biology.

[147]  A. Godzik,et al.  Derivation and testing of pair potentials for protein folding. When is the quasichemical approximation correct? , 1997, Protein science : a publication of the Protein Society.

[148]  Liam J McGuffin,et al.  Assembling novel protein folds from super‐secondary structural fragments , 2003, Proteins.

[149]  J. Skolnick,et al.  Automated structure prediction of weakly homologous proteins on a genomic scale. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[150]  Christodoulos A. Floudas,et al.  αBB: A global optimization method for general constrained nonconvex problems , 1995, J. Glob. Optim..

[151]  G. Rose,et al.  Hydrophobicity of amino acid subgroups in proteins , 1990, Proteins.

[152]  Loren L Looger,et al.  Computational design of receptors for an organophosphate surrogate of the nerve agent soman. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[153]  Torsten Schwede,et al.  Automated protein structure homology modeling: a progress report. , 2004, Pharmacogenomics.

[154]  Vijay S. Pande,et al.  Mechanical Unfolding of a β-Hairpin Using Molecular Dynamics , 2000 .

[155]  E. Domany,et al.  Pairwise contact potentials are unsuitable for protein folding , 1998 .

[156]  R. Srinivasan,et al.  LINUS: A hierarchic procedure to predict the fold of a protein , 1995, Proteins.

[157]  M. Levitt,et al.  Protein folding: the endgame. , 1997, Annual review of biochemistry.

[158]  Vijay S Pande,et al.  Thoroughly sampling sequence space: Large‐scale protein design of structural ensembles , 2002, Protein science : a publication of the Protein Society.

[159]  T M Handel,et al.  Review: protein design--where we were, where we are, where we're going. , 2001, Journal of structural biology.

[160]  K. Sharp,et al.  Macroscopic models of aqueous solutions : biological and chemical applications , 1993 .

[161]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[162]  C. Notredame,et al.  Recent progress in multiple sequence alignment: a survey. , 2002, Pharmacogenomics.

[163]  J Moult,et al.  Predicting protein three-dimensional structure. , 1999, Current opinion in biotechnology.

[164]  R. Srinivasan,et al.  Ab initio prediction of protein structure using LINUS , 2002, Proteins.

[165]  C. Pabo Molecular technology: Designing proteins and peptides , 1983, Nature.

[166]  Harold A. Scheraga,et al.  Conformational space annealing by parallel computations: Extensive conformational search of Met‐enkephalin and of the 20‐residue membrane‐bound portion of melittin , 1999 .

[167]  Adam Godzik,et al.  De novo and inverse folding predictions of protein structure and dynamics , 1993, J. Comput. Aided Mol. Des..

[168]  D Eisenberg,et al.  Crystal structure of a synthetic triple-stranded alpha-helical bundle. , 1993, Science.

[169]  P. S. Kim,et al.  High-resolution protein design with backbone freedom. , 1998, Science.

[170]  Adam Liwo,et al.  Development of Physics-Based Energy Functions that Predict Medium-Resolution Structures for Proteins of the α, β, and α/β Structural Classes , 2001 .

[171]  Richard Hughey,et al.  Hidden Markov models for detecting remote protein homologies , 1998, Bioinform..

[172]  Loren L Looger,et al.  Computational Design of a Biologically Active Enzyme , 2004, Science.

[173]  Thomas B Woolf,et al.  Discrimination of native loop conformations in membrane proteins: Decoy library design and evaluation of effective energy scoring functions , 2003, Proteins.

[174]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[175]  K Karplus,et al.  Predicting protein structure using only sequence information , 1999, Proteins.

[176]  R A Sayle,et al.  RASMOL: biomolecular graphics for all. , 1995, Trends in biochemical sciences.

[177]  Johan Desmet,et al.  The dead-end elimination theorem and its use in protein side-chain positioning , 1992, Nature.

[178]  Dimitrios Morikis,et al.  Design and NMR characterization of active analogues of compstatin containing non-natural amino acids. , 2005, Journal of medicinal chemistry.

[179]  J R Desjarlais,et al.  Side-chain and backbone flexibility in protein core design. , 1999, Journal of molecular biology.

[180]  R. Jernigan,et al.  An empirical energy potential with a reference state for protein fold and sequence recognition , 1999, Proteins.

[181]  J L Klepeis,et al.  Hybrid global optimization algorithms for protein structure prediction: alternating hybrids. , 2003, Biophysical journal.

[182]  Anthony K. Felts,et al.  Protein tertiary structure prediction using a branch and bound algorithm , 1999, Proteins.

[183]  Christodoulos A Floudas,et al.  Integrated computational and experimental approach for lead optimization and design of compstatin variants with improved activity. , 2003, Journal of the American Chemical Society.

[184]  M. Levitt,et al.  De novo protein design. II. Plasticity in sequence space. , 1999, Journal of molecular biology.

[185]  David T. Jones Successful ab initio prediction of the tertiary structure of NK‐lysin using multiple sequences and recognized supersecondary structural motifs , 1997, Proteins.

[186]  Ned S Wingreen,et al.  Flexibility of β‐sheets: Principal component analysis of database protein structures , 2004, Proteins.

[187]  Dong Xu,et al.  PROSPECT II: protein structure prediction program for genome-scale applications. , 2003, Protein engineering.

[188]  C. Adjiman,et al.  A global optimization method, αBB, for general twice-differentiable constrained NLPs—II. Implementation and computational results , 1998 .

[189]  J. Skolnick,et al.  The PDB is a covering set of small protein structures. , 2003, Journal of molecular biology.

[190]  A. Liwo,et al.  Design of a knowledge-based force field for off-lattice simulations of protein structure. , 1997, Acta biochimica Polonica.

[191]  C. Floudas,et al.  ASTRO-FOLD: a combinatorial and global optimization framework for Ab initio prediction of three-dimensional structures of proteins from the amino acid sequence. , 2003, Biophysical journal.

[192]  S. L. Mayo,et al.  De novo protein design: fully automated sequence selection. , 1997, Science.

[193]  D A Agard,et al.  Computational method for the design of enzymes with altered substrate specificity. , 1991, Journal of molecular biology.

[194]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[195]  Y Xu,et al.  Protein threading using PROSPECT: Design and evaluation , 2000, Proteins.

[196]  P. Andrew Karplus,et al.  Hydrophobicity regained: Hydrophobicity regained , 1997 .

[197]  S. A. Marshall,et al.  Energy functions for protein design. , 1999, Current opinion in structural biology.

[198]  F. Richards,et al.  Construction of new ligand binding sites in proteins of known structure. I. Computer-aided modeling of sites with pre-defined geometry. , 1991, Journal of molecular biology.

[199]  John L. Klepeis,et al.  Ab initio prediction of helical segments in polypeptides , 2002, J. Comput. Chem..

[200]  Jooyoung Lee,et al.  New optimization method for conformational energy calculations on polypeptides: Conformational space annealing , 1997, J. Comput. Chem..

[201]  Burkhard Rost,et al.  Improving fold recognition without folds. , 2004, Journal of molecular biology.

[202]  S. Rackovsky,et al.  Conformational analysis of the 20-residue membrane-bound portion of melittin by conformational space annealing. , 1998, Biopolymers.

[203]  D. Baker,et al.  Modeling structurally variable regions in homologous proteins with rosetta , 2004, Proteins.

[204]  Sheldon Park,et al.  Advances in computational protein design. , 2004, Current opinion in structural biology.

[205]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[206]  Lynne Regan,et al.  The de novo design of a rubredoxin‐like fe site , 1998, Protein science : a publication of the Protein Society.

[207]  H W Hellinga,et al.  Rational design of nascent metalloenzymes. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[208]  Anna Tramontano,et al.  Assessment of homology‐based predictions in CASP5 , 2003, Proteins.

[209]  Chris Sander,et al.  Completeness in structural genomics , 2001, Nature Structural Biology.

[210]  Richard Wolfenden,et al.  Comparing the polarities of the amino acids: side-chain distribution coefficients between the vapor phase, cyclohexane, 1-octanol, and neutral aqueous solution , 1988 .

[211]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[212]  S. Bryant,et al.  The frequency of ion‐pair substructures in proteins is quantitatively related to electrostatic potential: A statistical model for nonbonded interactions , 1991, Proteins.

[213]  C A Floudas,et al.  Protein loop structure prediction with flexible stem geometries , 2005, Proteins.

[214]  A. Sali,et al.  Evolution and physics in comparative protein structure modeling. , 2002, Accounts of chemical research.

[215]  J G Saven,et al.  Designing protein energy landscapes. , 2001, Chemical reviews.

[216]  R. Friesner,et al.  High‐resolution prediction of protein helix positions and orientations , 2004, Proteins.

[217]  Adam Liwo,et al.  Prediction of the structures of proteins with the UNRES force field, including dynamic formation and breaking of disulfide bonds. , 2004, Protein engineering, design & selection : PEDS.

[218]  Frances H. Arnold,et al.  Computational method to reduce the search space for directed protein evolution , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[219]  J. Skolnick,et al.  A distance‐dependent atomic knowledge‐based potential for improved protein structure selection , 2001, Proteins.

[220]  Jonathan Casper,et al.  Combining local‐structure, fold‐recognition, and new fold methods for protein structure prediction , 2003, Proteins.

[221]  V A Eyrich,et al.  Prediction of protein tertiary structure to low resolution: performance for a large and structurally diverse test set. , 1999, Journal of molecular biology.

[222]  Geoffrey J. Barton,et al.  JPred : a consensus secondary structure prediction server , 1999 .

[223]  M. DePristo,et al.  Ab initio construction of polypeptide fragments: Accuracy of loop decoy discrimination by an all‐atom statistical potential and the AMBER force field with the Generalized Born solvation model , 2003, Proteins.

[224]  B. Rost Review: protein secondary structure prediction continues to rise. , 2001, Journal of structural biology.

[225]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[226]  Hongyi Zhou,et al.  An accurate, residue‐level, pair potential of mean force for folding and binding based on the distance‐scaled, ideal‐gas reference state , 2004, Protein science : a publication of the Protein Society.

[227]  C. Lee,et al.  Predicting protein mutant energetics by self-consistent ensemble optimization. , 1994, Journal of molecular biology.

[228]  S L Mayo,et al.  Coupling backbone flexibility and amino acid sequence selection in protein design , 1997, Protein science : a publication of the Protein Society.

[229]  F M Richards,et al.  Optimal sequence selection in proteins of known structure by simulated evolution. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[230]  C. Chothia One thousand families for the molecular biologist , 1992, Nature.

[231]  Christodoulos A. Floudas,et al.  A global optimization method, αBB, for process design , 1996 .

[232]  L L Looger,et al.  Computational design of a Zn2+ receptor that controls bacterial gene expression , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[233]  P. S. Kim,et al.  A designed protein with packing between left-handed and right-handed helices. , 2001, Biochemistry.

[234]  Homme W Hellinga,et al.  An empirical model for electrostatic interactions in proteins incorporating multiple geometry‐dependent dielectric constants , 2003, Proteins.

[235]  Adam Zemla,et al.  Critical assessment of methods of protein structure prediction (CASP)‐round V , 2005, Proteins.

[236]  R Samudrala,et al.  Ab initio construction of protein tertiary structures using a hierarchical approach. , 2000, Journal of molecular biology.

[237]  M J Sippl,et al.  Protein folds from pair interactions: A blind test in fold recognition , 1997, Proteins.

[238]  J G Saven,et al.  Statistical theory of combinatorial libraries of folding proteins: energetic discrimination of a target structure. , 2000, Journal of molecular biology.