Protein WISDOM: a workbench for in silico de novo design of biomolecules.

The aim of de novo protein design is to find the amino acid sequences that will fold into a desired 3-dimensional structure with improvements in specific properties, such as binding affinity, agonist or antagonist behavior, or stability, relative to the native sequence. Protein design lies at the center of current advances drug design and discovery. Not only does protein design provide predictions for potentially useful drug targets, but it also enhances our understanding of the protein folding process and protein-protein interactions. Experimental methods such as directed evolution have shown success in protein design. However, such methods are restricted by the limited sequence space that can be searched tractably. In contrast, computational design strategies allow for the screening of a much larger set of sequences covering a wide variety of properties and functionality. We have developed a range of computational de novo protein design methods capable of tackling several important areas of protein design. These include the design of monomeric proteins for increased stability and complexes for increased binding affinity. To disseminate these methods for broader use we present Protein WISDOM (http://www.proteinwisdom.org), a tool that provides automated methods for a variety of protein design problems. Structural templates are submitted to initialize the design process. The first stage of design is an optimization sequence selection stage that aims at improving stability through minimization of potential energy in the sequence space. Selected sequences are then run through a fold specificity stage and a binding affinity stage. A rank-ordered list of the sequences for each step of the process, along with relevant designed structures, provides the user with a comprehensive quantitative assessment of the design. Here we provide the details of each design method, as well as several notable experimental successes attained through the use of the methods.

[1]  C A Floudas,et al.  Distance dependent centroid to centroid force fields using high resolution decoys , 2008, Proteins.

[2]  John L. Klepeis,et al.  Free energy calculations for peptides via deterministic global optimization , 1999 .

[3]  John L. Klepeis,et al.  A new class of hybrid global optimization algorithms for peptide structure prediction: integrated hybrids , 2003 .

[4]  Bernd Hartke,et al.  Global optimization , 2011 .

[5]  Christodoulos A. Floudas,et al.  Novel formulations for the sequence selection problem in de novo protein design with flexible templates , 2007, Optim. Methods Softw..

[6]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[7]  John L. Klepeis,et al.  Analysis and prediction of loop segments in protein structures , 2005, Comput. Chem. Eng..

[8]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[9]  S L Mayo,et al.  Coupling backbone flexibility and amino acid sequence selection in protein design , 1997, Protein science : a publication of the Protein Society.

[10]  S. L. Mayo,et al.  Automated design of the surface positions of protein helices , 1997, Protein science : a publication of the Protein Society.

[11]  C. Floudas,et al.  ASTRO-FOLD: a combinatorial and global optimization framework for Ab initio prediction of three-dimensional structures of proteins from the amino acid sequence. , 2003, Biophysical journal.

[12]  C. Floudas,et al.  A New Generation of Potent Complement Inhibitors of the Compstatin Family , 2011, Chemical biology & drug design.

[13]  Drexler Ke,et al.  Molecular engineering: An approach to the development of general capabilities for molecular manipulation. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Bruce Randall Donald,et al.  A Novel Ensemble-Based Scoring and Search Algorithm for Protein Redesign and Its Application to Modify the Substrate Specificity of the Gramicidin Synthetase A Phenylalanine Adenylation Enzyme , 2005, J. Comput. Biol..

[15]  D. Baker,et al.  Native protein sequences are close to optimal for their structures. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[16]  C A Floudas,et al.  Protein loop structure prediction with flexible stem geometries , 2005, Proteins.

[17]  H. K. Fung,et al.  Discovery of entry inhibitors for HIV-1 via a new de novo protein design framework. , 2010, Biophysical journal.

[18]  David Baker,et al.  Protein–protein docking predictions for the CAPRI experiment , 2003, Proteins.

[19]  P. S. Kim,et al.  High-resolution protein design with backbone freedom. , 1998, Science.

[20]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[21]  H. K. Fung,et al.  New compstatin variants through two de novo protein design frameworks. , 2010, Biophysical journal.

[22]  Christodoulos A. Floudas,et al.  A network flow model for biclustering via optimal re-ordering of data matrices , 2010, J. Glob. Optim..

[23]  Christodoulos A. Floudas,et al.  CONCORD: a consensus method for protein secondary structure prediction via mixed integer linear optimization , 2012, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[24]  Christodoulos A. Floudas,et al.  Biclustering via optimal re-ordering of data matrices in systems biology: rigorous methods and comparative studies , 2008, BMC Bioinformatics.

[25]  C. Floudas,et al.  Reconstitution and engineering of apoptotic protein interactions on the bacterial cell surface. , 2009, Journal of molecular biology.

[26]  Lynne Regan,et al.  The de novo design of a rubredoxin‐like fe site , 1998, Protein science : a publication of the Protein Society.

[27]  D. Baker,et al.  De novo determination of protein backbone structure from residual dipolar couplings using Rosetta. , 2002, Journal of the American Chemical Society.

[28]  Christodoulos A. Floudas,et al.  Research Challenges, Opportunities and Synergism in Systems Engineering and Computational Biology , 2005 .

[29]  Christodoulos A. Floudas,et al.  A novel high resolution CαCα distance dependent force field based on a high quality decoy set , 2006 .

[30]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[31]  C A Floudas,et al.  Structure prediction of loops with fixed and flexible stems. , 2012, The journal of physical chemistry. B.

[32]  J. L. Klepeis,et al.  Predicting peptide structures using NMR data and deterministic global optimization , 1999 .

[33]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[34]  C. Floudas,et al.  Novel approach for α‐helical topology prediction in globular proteins: Generation of interhelical restraints , 2006 .

[35]  John L. Klepeis,et al.  Prediction of β‐sheet topology and disulfide bridges in polypeptides , 2003, J. Comput. Chem..

[36]  C. Floudas,et al.  Contact prediction for beta and alpha‐beta proteins using integer linear optimization and its impact on the first principles 3D structure prediction method ASTRO‐FOLD , 2010, Proteins.

[37]  Christodoulos A Floudas,et al.  Molecular Dynamics in Drug Design: New Generations of Compstatin Analogs , 2012, Chemical biology & drug design.

[38]  P. Güntert Automated NMR structure calculation with CYANA. , 2004, Methods in molecular biology.

[39]  H. K. Fung,et al.  Computational de novo Peptide and Protein Design: Rigid Templates versus Flexible Templates , 2008 .

[40]  C. Floudas,et al.  β-sheet Topology Prediction with High Precision and Recall for β and Mixed α/β Proteins , 2012, PloS one.

[41]  Christodoulos A. Floudas,et al.  Advances in protein structure prediction and de novo protein design : A review , 2006 .

[42]  John L. Klepeis,et al.  Deterministic Global Optimization and Ab Initio Approaches for the Structure Prediction of Polypeptides, Dynamics of Protein Folding, and Protein‐Protein Interactions , 2002 .

[43]  H. K. Fung,et al.  De novo peptide design with C3a receptor agonist and antagonist activities: theoretical predictions and experimental validation. , 2012, Journal of medicinal chemistry.

[44]  J R Desjarlais,et al.  Side-chain and backbone flexibility in protein core design. , 1999, Journal of molecular biology.

[45]  S. L. Mayo,et al.  Protein design automation , 1996, Protein science : a publication of the Protein Society.

[46]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[47]  C. Pabo Molecular technology: Designing proteins and peptides , 1983, Nature.

[48]  Martin S. Taylor,et al.  Toward full-sequence de novo protein design with flexible templates for human beta-defensin-2. , 2008, Biophysical journal.

[49]  Jeffrey J. Gray,et al.  CAPRI rounds 3–5 reveal promising successes and future challenges for RosettaDock , 2005, Proteins.

[50]  Christodoulos A Floudas,et al.  Integrated computational and experimental approach for lead optimization and design of compstatin variants with improved activity. , 2003, Journal of the American Chemical Society.

[51]  J L Klepeis,et al.  Hybrid global optimization algorithms for protein structure prediction: alternating hybrids. , 2003, Biophysical journal.

[52]  Panos M. Pardalos,et al.  Computational Comparison Studies of Quadratic Assignment Like Formulations for the In Silico Sequence Selection Problem in De Novo Protein Design , 2005, J. Comb. Optim..