proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improved prediction of protein side-chain conformations with SCWRL4

Determination of side‐chain conformations is an important step in protein structure prediction and protein design. Many such methods have been presented, although only a small number are in widespread use. SCWRL is one such method, and the SCWRL3 program (2003) has remained popular because of its speed, accuracy, and ease‐of‐use for the purpose of homology modeling. However, higher accuracy at comparable speed is desirable. This has been achieved in a new program SCWRL4 through: (1) a new backbone‐dependent rotamer library based on kernel density estimates; (2) averaging over samples of conformations about the positions in the rotamer library; (3) a fast anisotropic hydrogen bonding function; (4) a short‐range, soft van der Waals atom–atom interaction potential; (5) fast collision detection using k‐discrete oriented polytopes; (6) a tree decomposition algorithm to solve the combinatorial problem; and (7) optimization of all parameters by determining the interaction graph within the crystal environment using symmetry operators of the crystallographic space group. Accuracies as a function of electron density of the side chains demonstrate that side chains with higher electron density are easier to predict than those with low‐electron density and presumed conformational disorder. For a testing set of 379 proteins, 86% of χ1 angles and 75% of χ1+2 angles are predicted correctly within 40° of the X‐ray positions. Among side chains with higher electron density (25–100th percentile), these numbers rise to 89 and 80%. The new program maintains its simple command‐line interface, designed for homology modeling, and is now available as a dynamic‐linked library for incorporation into other software programs. Proteins 2009. © 2009 Wiley‐Liss, Inc.

[1]  J. Kirkwood Statistical Mechanics of Fluid Mixtures , 1935 .

[2]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[3]  Derek G. Corneil,et al.  Complexity of finding embeddings in a k -tree , 1987 .

[4]  William L. Briggs,et al.  A multigrid tutorial , 1987 .

[5]  C. D. Kemp,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[6]  Hydroxyl hydrogen conformations in trypsin determined by the neutron diffraction solvent difference map method: relative importance of steric and electrostatic factors in defining hydrogen-bonding geometries. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Johan Desmet,et al.  The dead-end elimination theorem and its use in protein side-chain positioning , 1992, Nature.

[8]  C. Sander,et al.  Evaluation of protein models by atomic solvation preference. , 1992, Journal of molecular biology.

[9]  Simon J. Hubbard,et al.  Department of Biochemistry and Molecular Biology , 2006 .

[10]  Roland L. Dunbrack,et al.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction. , 1993, Journal of molecular biology.

[11]  P. Koehl,et al.  Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy. , 1994, Journal of molecular biology.

[12]  Roland L. Dunbrack,et al.  Conformational analysis of the backbone-dependent rotamer preferences of protein sidechains , 1994, Nature Structural Biology.

[13]  R. Goldstein Efficient rotamer elimination applied to protein side-chains and related spin glasses. , 1994, Biophysical journal.

[14]  A. Leach,et al.  Ligand docking to proteins with discrete side-chain flexibility. , 1994, Journal of molecular biology.

[15]  David T. Jones,et al.  De novo protein design using pairwise potentials and a genetic algorithm , 1994, Protein science : a publication of the Protein Society.

[16]  S. L. Mayo,et al.  Protein design automation , 1996, Protein science : a publication of the Protein Society.

[17]  P A Kollman,et al.  Modeling protein stability: a theoretical analysis of the stability of T4 lysozyme mutants. , 1997, Protein engineering.

[18]  Roland L. Dunbrack,et al.  Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: a new homology modeling tool. , 1997, Journal of molecular biology.

[19]  I Lasters,et al.  All in one: a highly detailed rotamer library improves both accuracy and speed in the modelling of sidechains by dead-end elimination. , 1997, Folding & design.

[20]  Roland L. Dunbrack,et al.  Bayesian statistical analysis of protein side‐chain rotamer preferences , 1997, Protein science : a publication of the Protein Society.

[21]  Gary L. Miller,et al.  Separators for sphere-packings and nearest neighbor graphs , 1997, JACM.

[22]  M. Sternberg,et al.  Rapid refinement of protein interfaces incorporating solvation: application to the docking problem. , 1998, Journal of molecular biology.

[23]  Joseph S. B. Mitchell,et al.  Efficient Collision Detection Using Bounding Volume Hierarchies of k-DOPs , 1998, IEEE Trans. Vis. Comput. Graph..

[24]  J. Mendes,et al.  Improved modeling of side‐chains in proteins with rotamer‐based methods: A flexible rotamer model , 1999, Proteins.

[25]  William L. Briggs,et al.  A multigrid tutorial, Second Edition , 2000 .

[26]  J. Richardson,et al.  The penultimate rotamer library , 2000, Proteins.

[27]  António M. Baptista,et al.  Implicit solvation in the self-consistent mean field theory method: sidechain modelling and prediction of folding free energies of protein mutants , 2001, J. Comput. Aided Mol. Des..

[28]  Z. Xiang,et al.  Extending the accuracy limits of prediction for side-chain conformations. , 2001, Journal of molecular biology.

[29]  Roland L. Dunbrack Rotamer libraries in the 21st century. , 2002, Current opinion in structural biology.

[30]  George A. Kaminski,et al.  Force Field Validation Using Protein Side Chain Prediction , 2002 .

[31]  Z. Xiang,et al.  On the role of the crystal environment in determining protein side-chain conformations. , 2002, Journal of molecular biology.

[32]  N. Grishin,et al.  Side‐chain modeling with an optimized scoring function , 2002, Protein science : a publication of the Protein Society.

[33]  Guoli Wang,et al.  PISCES: a protein sequence culling server , 2003, Bioinform..

[34]  D. Baker,et al.  An orientation-dependent hydrogen bonding potential improves prediction of specificity and structure for proteins and protein-protein complexes. , 2003, Journal of molecular biology.

[35]  Tom L Blundell,et al.  Advantages of fine-grained side chain conformer libraries. , 2003, Protein engineering.

[36]  Hao Chen,et al.  Beyond the rotamer library: Genetic algorithm combined with the disturbing mutation process for upbuilding protein side‐chains , 2003, Proteins.

[37]  D. Baker,et al.  Design of a Novel Globular Protein Fold with Atomic-Level Accuracy , 2003, Science.

[38]  Adrian A Canutescu,et al.  Access the most recent version at doi: 10.1110/ps.03154503 References , 2003 .

[39]  Jeffrey J. Gray,et al.  Protein-protein docking with simultaneous optimization of rigid-body displacement and side-chain conformations. , 2003, Journal of molecular biology.

[40]  A Joshua Wand,et al.  Improved side‐chain prediction accuracy using an ab initio potential energy function and a very large rotamer library , 2004, Protein science : a publication of the Protein Society.

[41]  T. A. Jones,et al.  The Uppsala Electron-Density Server. , 2004, Acta crystallographica. Section D, Biological crystallography.

[42]  D. Baker,et al.  Modeling structurally variable regions in homologous proteins with rosetta , 2004, Proteins.

[43]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[44]  Jinbo Xu,et al.  Rapid Protein Side-Chain Packing via Tree Decomposition , 2005, RECOMB.

[45]  Roland L. Dunbrack,et al.  MollDE: a homology modeling framework you can click with , 2005, Bioinform..

[46]  Roland L Dunbrack,et al.  Assessment of fold recognition predictions in CASP6 , 2005, Proteins.

[47]  Mona Singh,et al.  Solving and analyzing side-chain positioning problems using linear and integer programming , 2005, Bioinform..

[48]  Guoli Wang,et al.  PISCES: recent improvements to a PDB sequence culling server , 2005, Nucleic Acids Res..

[49]  J. Maurice Rojas,et al.  Practical conversion from torsion space to Cartesian space for in silico protein synthesis , 2005, J. Comput. Chem..

[50]  C. Camacho,et al.  Modeling side‐chains using molecular dynamics improve recognition of binding region in CAPRI targets , 2005, Proteins.

[51]  Leszek Rychlewski,et al.  FFAS03: a server for profile–profile sequence alignments , 2005, Nucleic Acids Res..

[52]  Jack Snoeyink,et al.  Rotamer-Pair Energy Calculations Using a Trie Data Structure , 2005, WABI.

[53]  Jens Meiler,et al.  ROSETTALIGAND: Protein–small molecule docking with full side‐chain flexibility , 2006, Proteins.

[54]  Roland L. Dunbrack,et al.  Statistical and conformational analysis of the electron density of protein side chains , 2006, Proteins.

[55]  Zoran Obradovic,et al.  Statistical analysis of interface similarity in crystals of homologous proteins. , 2008, Journal of molecular biology.

[56]  Adrian A Canutescu,et al.  SCWRL and MolIDE: computer programs for side-chain conformation prediction and homology modeling , 2008, Nature Protocols.

[57]  Jianpeng Ma,et al.  OPUS‐Rota: A fast and accurate method for side‐chain modeling , 2008, Protein science : a publication of the Protein Society.

[58]  G. Sheldrick A short history of SHELX. , 2008, Acta crystallographica. Section A, Foundations of crystallography.