Practically Useful: What the Rosetta Protein Modeling Suite Can Do for You

The objective of this review is to enable researchers to use the software package Rosetta for biochemical and biomedicinal studies. We provide a brief review of the six most frequent research problems tackled with Rosetta. For each of these six tasks, we provide a tutorial that illustrates a basic Rosetta protocol. The Rosetta method was originally developed for de novo protein structure prediction and is regularly one of the best performers in the community-wide biennial Critical Assessment of Structure Prediction. Predictions for protein domains with fewer than 125 amino acids regularly have a backbone root-mean-square deviation of better than 5.0 Å. More impressively, there are several cases in which Rosetta has been used to predict structures with atomic level accuracy better than 2.5 Å. In addition to de novo structure prediction, Rosetta also has methods for molecular docking, homology modeling, determining protein structures from sparse experimental NMR or EPR data, and protein design. Rosetta has been used to accurately design a novel protein structure, predict the structure of protein−protein complexes, design altered specificity protein−protein and protein−DNA interactions, and stabilize proteins and protein complexes. Most recently, Rosetta has been used to solve the X-ray crystallographic phase problem.

[1]  David E. Kim,et al.  Physically realistic homology models built with ROSETTA can be more accurate than their templates. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[2]  M. Karplus,et al.  Effective energy function for proteins in solution , 1999, Proteins.

[3]  Jeffrey J. Gray,et al.  Modeling the structure of mAb 14B7 bound to the anthrax protective antigen , 2007, Proteins.

[4]  C. Levinthal Are there pathways for protein folding , 1968 .

[5]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[6]  Jeffrey J. Gray,et al.  Incorporating biochemical information and backbone flexibility in RosettaDock for CAPRI rounds 6–12 , 2007, Proteins.

[7]  Stephen L Mayo,et al.  A de novo designed protein–protein interface , 2007, Protein science : a publication of the Protein Society.

[8]  Jens Meiler,et al.  New algorithms and an in silico benchmark for computational enzyme design , 2006, Protein science : a publication of the Protein Society.

[9]  F. Avilés,et al.  The three‐dimensional structure of human procarboxypeptidase A2. Deciphering the basis of the inhibition, activation and intrinsic activity of the zymogen , 1997, The EMBO journal.

[10]  Eric Gouaux,et al.  Crystal structure of a bacterial homologue of Na+/Cl--dependent neurotransmitter transporters , 2005, Nature.

[11]  D. Baker,et al.  The 3D profile method for identifying fibril-forming segments of proteins. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[12]  D. Baker,et al.  Computational design of a new hydrogen bond network and at least a 300-fold specificity switch at a protein-protein interface. , 2006, Journal of molecular biology.

[13]  Richard Bonneau,et al.  De novo prediction of three-dimensional structures for major protein families. , 2002, Journal of molecular biology.

[14]  D. Baker,et al.  Improved recognition of native‐like protein structures using a combination of sequence‐dependent and sequence‐independent features of proteins , 1999, Proteins.

[15]  D. Baker,et al.  Rapid protein fold determination using unassigned NMR data , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Adrian A Canutescu,et al.  Cyclic coordinate descent: A robotics algorithm for protein loop closure , 2003, Protein science : a publication of the Protein Society.

[17]  C L Brooks,et al.  Ligand-protein database: linking protein-ligand complex structures to binding data. , 2001, Journal of medicinal chemistry.

[18]  Tanja Kortemme,et al.  Computer-aided design of functional protein interactions. , 2009, Nature chemical biology.

[19]  David Baker,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improving NMR protein structure quality by Rosetta refinement: A molecular , 2022 .

[20]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[21]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[22]  D. Baker,et al.  A large scale test of computational protein design: folding and stability of nine completely redesigned globular proteins. , 2003, Journal of molecular biology.

[23]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[24]  D. Baker,et al.  RosettaHoles: Rapid assessment of protein core packing for structure prediction, refinement, design, and validation , 2008, Protein science : a publication of the Protein Society.

[25]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[26]  Jeffrey J. Gray,et al.  Structural model of the mAb 806-EGFR complex using computational docking followed by computational and experimental mutagenesis. , 2006, Structure.

[27]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[28]  Richard Bonneau,et al.  Rosetta in CASP4: Progress in ab initio protein structure prediction , 2001, Proteins.

[29]  Timothy D. Fenn,et al.  Rab and Arl GTPase Family Members Cooperate in the Localization of the Golgin GCC185 , 2008, Cell.

[30]  Tanja Kortemme,et al.  Design of Multi-Specificity in Protein Interfaces , 2007, PLoS Comput. Biol..

[31]  Lars Malmström,et al.  Automated prediction of CASP‐5 structures using the Robetta server , 2003, Proteins.

[32]  Jens Meiler,et al.  Rosetta predictions in CASP5: Successes, failures, and prospects for complete automation , 2003, Proteins.

[33]  Eric A. Althoff,et al.  De Novo Computational Design of Retro-Aldol Enzymes , 2008, Science.

[34]  B. Stoddard,et al.  Design, activity, and structure of a highly specific artificial endonuclease. , 2002, Molecular cell.

[35]  Oliver F. Lange,et al.  Structure prediction for CASP8 with all‐atom refinement using Rosetta , 2009, Proteins.

[36]  O. Schueler‐Furman,et al.  Progress in protein–protein docking: Atomic resolution predictions in the CAPRI experiment using RosettaDock with an improved treatment of side‐chain flexibility , 2005, Proteins.

[37]  K. Takano ON SOLUTION OF , 1983 .

[38]  D. Baker,et al.  Toward high-resolution prediction and design of transmembrane helical protein structures , 2007, Proceedings of the National Academy of Sciences.

[39]  Jonathan W. Essex,et al.  A review of protein-small molecule docking methods , 2002, J. Comput. Aided Mol. Des..

[40]  D. Baker,et al.  Simultaneous prediction of protein folding and docking at high resolution , 2009, Proceedings of the National Academy of Sciences.

[41]  David Baker,et al.  Blind docking of pharmaceutically relevant compounds using RosettaLigand , 2009, Protein science : a publication of the Protein Society.

[42]  D Baker,et al.  Local sequence-structure correlations in proteins. , 1996, Current opinion in biotechnology.

[43]  Eric A. Althoff,et al.  Kemp elimination catalysts by computational enzyme design , 2008, Nature.

[44]  Jeffrey J. Gray,et al.  Conformer selection and induced fit in flexible backbone protein-protein docking using computational and NMR ensembles. , 2008, Journal of molecular biology.

[45]  Lars Malmström,et al.  Prediction of CASP6 structures using automated robetta protocols , 2005, Proteins.

[46]  B. Kuhlman,et al.  Computational design of a single amino acid sequence that can switch between two distinct protein folds. , 2006, Journal of the American Chemical Society.

[47]  Yi Liu,et al.  RosettaDesign server for protein design , 2006, Nucleic Acids Res..

[48]  D. Baker,et al.  De novo protein structure generation from incomplete chemical shift assignments , 2009, Journal of biomolecular NMR.

[49]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[50]  Ian W. Davis,et al.  RosettaLigand docking with full ligand and receptor flexibility. , 2009, Journal of molecular biology.

[51]  David Baker,et al.  Protein-protein docking with backbone flexibility. , 2007, Journal of molecular biology.

[52]  Roland L. Dunbrack,et al.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction. , 1993, Journal of molecular biology.

[53]  D. Baker,et al.  High-resolution Structural and Thermodynamic Analysis of Extreme Stabilization of Human Procarboxypeptidase by Computational Protein Design , 2007, Journal of molecular biology.

[54]  David Baker,et al.  Protein structure prediction and analysis using the Robetta server , 2004, Nucleic Acids Res..

[55]  D. Baker,et al.  Modeling structurally variable regions in homologous proteins with rosetta , 2004, Proteins.

[56]  Carol A Rohl,et al.  Protein structure estimation from minimal restraints using Rosetta. , 2005, Methods in enzymology.

[57]  Jens Meiler,et al.  ROSETTALIGAND: Protein–small molecule docking with full side‐chain flexibility , 2006, Proteins.

[58]  David E. Kim,et al.  Free modeling with Rosetta in CASP6 , 2005, Proteins.

[59]  J. Meiler,et al.  A model for the solution structure of the rod arrestin tetramer. , 2008, Structure.

[60]  Jack Snoeyink,et al.  Rotamer-Pair Energy Calculations Using a Trie Data Structure , 2005, WABI.

[61]  A. Bax,et al.  Protein backbone angle restraints from searching a database for chemical shift and sequence homology , 1999, Journal of biomolecular NMR.

[62]  C Kooperberg,et al.  Assembly of protein tertiary structures from fragments with similar local sequences using simulated annealing and Bayesian scoring functions. , 1997, Journal of molecular biology.

[63]  R. Blakely,et al.  Structural determinants of species‐selective substrate recognition in human and Drosophila serotonin transporters revealed through computational docking studies , 2009, Proteins.

[64]  B. Stoddard,et al.  Computational Thermostabilization of an Enzyme , 2005, Science.

[65]  D. Baker,et al.  De novo protein structure determination using sparse NMR data , 2000, Journal of biomolecular NMR.

[66]  E. Coutsias,et al.  Sub-angstrom accuracy in protein loop reconstruction by robotics-inspired conformational sampling , 2009, Nature Methods.

[67]  Tanja Kortemme,et al.  Multi‐constraint computational design suggests that native sequences of germline antibody H3 loops are nearly optimal for conformational flexibility , 2009, Proteins.

[68]  P. Bradley,et al.  High-resolution structure prediction and the crystallographic phase problem , 2007, Nature.

[69]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[70]  D. Baker,et al.  Clustering of low-energy conformations near the native structures of small proteins. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[71]  D. J. Price,et al.  Assessing scoring functions for protein-ligand interactions. , 2004, Journal of medicinal chemistry.

[72]  P. Stewart,et al.  Hybrid approaches: applying computational methods in cryo-electron microscopy. , 2009, Current opinion in structural biology.

[73]  David Baker,et al.  Macromolecular modeling with rosetta. , 2008, Annual review of biochemistry.

[74]  D. Baker,et al.  De novo determination of protein backbone structure from residual dipolar couplings using Rosetta. , 2002, Journal of the American Chemical Society.

[75]  Lars Malmström,et al.  Structure prediction for CASP7 targets using extensive all‐atom refinement with Rosetta@home , 2007, Proteins.

[76]  Jens Meiler,et al.  De novo high-resolution protein structure determination from sparse spin-labeling EPR data. , 2008, Structure.

[77]  Oliver F. Lange,et al.  Consistent blind protein structure generation from NMR chemical shift data , 2008, Proceedings of the National Academy of Sciences.

[78]  Jens Meiler,et al.  Small Molecule Rotamers Enable Simultaneous Optimization of Small Molecule and Protein Degrees of Freedom in ROSETTALIGAND Docking , 2008 .

[79]  D. Baker,et al.  Native protein sequences are close to optimal for their structures. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[80]  Jeffrey J. Gray,et al.  SnugDock: Paratope Structural Optimization during Antibody-Antigen Docking Compensates for Errors in Antibody Homology Models , 2010, PLoS Comput. Biol..

[81]  D. Baker,et al.  Computational redesign of protein-protein interaction specificity , 2004, Nature Structural &Molecular Biology.

[82]  P. Stewart,et al.  EM-fold: De novo folding of alpha-helical proteins guided by intermediate-resolution electron microscopy density maps. , 2009, Structure.

[83]  David Baker,et al.  Prospects for de novo phasing with de novo protein models , 2009, Acta crystallographica. Section D, Biological crystallography.

[84]  H. V. van Vlijmen,et al.  Structure activity relationships of monocyte chemoattractant proteins in complex with a blocking antibody. , 2006, Protein engineering, design & selection : PEDS.

[85]  D. Baker,et al.  Refinement of protein structures into low-resolution density maps using rosetta. , 2009, Journal of molecular biology.

[86]  Sergey Lyskov,et al.  The RosettaDock server for local protein–protein docking , 2008, Nucleic Acids Res..

[87]  A. Sali,et al.  Statistical potential for assessment and prediction of protein structures , 2006, Protein science : a publication of the Protein Society.

[88]  Chaok Seok,et al.  A kinematic view of loop closure , 2004, J. Comput. Chem..