Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: a new homology modeling tool.

Modeling by homology is the most accurate computational method for translating an amino acid sequence into a protein structure. Homology modeling can be divided into two sub-problems, placing the polypeptide backbone and adding side-chains. We present a method for rapidly predicting the conformations of protein side-chains, starting from main-chain coordinates alone. The method involves using fewer than ten rotamers per residue from a backbone-dependent rotamer library and a search to remove steric conflicts. The method is initially tested on 299 high resolution crystal structures by rebuilding side-chains onto the experimentally determined backbone structures. A total of 77% of chi1 and 66% of chi(1 + 2) dihedral angles are predicted within 40 degrees of their crystal structure values. We then tested the method on the entire database of known structures in the Protein Data Bank. The predictive accuracy of the algorithm was strongly correlated with the resolution of the structures. In an effort to simulate a realistic homology modeling problem, 9424 homology models were created using three different modeling strategies. For prediction purposes, pairs of structures were identified which shared between 30% and 90% sequence identity. One strategy results in 82% of chi1 and 72% chi(1 + 2) dihedral angles predicted within 40 degrees of the target crystal structure values, suggesting that movements of the backbone associated with this degree of sequence identity are not large enough to disrupt the predictive ability of our method for non-native backbones. These results compared favorably with existing methods over a comprehensive data set.

[1]  E. S. Pearson,et al.  THE USE OF CONFIDENCE OR FIDUCIAL LIMITS ILLUSTRATED IN THE CASE OF THE BINOMIAL , 1934 .

[2]  D. Phillips,et al.  A possible three-dimensional structure of bovine alpha-lactalbumin based on that of hen's egg-white lysozyme. , 1969, Journal of molecular biology.

[3]  V. Sasisekharan,et al.  Backbone and side-chain conformations of amino acids and amino acid residues in peptides. , 1970, Biopolymers.

[4]  G. N. Ramachandran,et al.  Studies on the conformation of amino acids. XI. Analysis of the observed side group conformation in proteins. , 2009, International journal of protein research.

[5]  V. Sasisekharan,et al.  Studies on the conformation of amino acids X. Conformations of norvalyl, leucyl and aromatic side groups in a dipeptide unit , 1971, Biopolymers.

[6]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[7]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[8]  Analysis of torsional spectra of molecules with two internal C3v rotors. V - Barriers to internal rotation in ethylsilane , 1977 .

[9]  M. Levitt,et al.  Conformation of amino acid side-chains in proteins. , 1978, Journal of molecular biology.

[10]  T. Bhat,et al.  An analysis of side-chain conformation in proteins. , 2009, International journal of peptide and protein research.

[11]  J. Durig,et al.  Analysis of torsional spectra of molecules with two internal C3v rotors. 11. Low frequency vibrational spectra, methyl torsional potential functions, and internal rotation of ethyl methyl sulfide , 1979 .

[12]  Georg E. Schulz,et al.  Principles of Protein Structure , 1979 .

[13]  M Karplus,et al.  Side-chain torsional potentials: effect of dipeptide, protein, and solvent environment. , 1979, Biochemistry.

[14]  William F. Murphy,et al.  Low-frequency Raman spectrum and asymmetric potential function for internal rotation of gaseous n-butane , 1980 .

[15]  Stanton A. Glantz,et al.  Primer of biostatistics : statistical software program version 6.0 , 1981 .

[16]  M. James,et al.  Structure and refinement of penicillopepsin at 1.8 A resolution. , 1983, Journal of molecular biology.

[17]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[18]  H. Scheraga,et al.  Statistical and energetic analysis of side-chain conformations in oligopeptides. , 2009, International journal of peptide and protein research.

[19]  T. Blundell,et al.  Knowledge based modelling of homologous proteins, Part I: Three-dimensional frameworks derived from the simultaneous superposition of multiple structures. , 1987, Protein engineering.

[20]  T. L. Blundell,et al.  Knowledge-based prediction of protein structures and the design of novel molecules , 1987, Nature.

[21]  T L Blundell,et al.  Knowledge based modelling of homologous proteins, Part II: Rules for the conformations of substituted sidechains. , 1987, Protein engineering.

[22]  M. Sternberg,et al.  Analysis of the relationship between side-chain conformation and secondary structure in globular proteins. , 1987, Journal of molecular biology.

[23]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[24]  M Karplus,et al.  Analysis of side-chain orientations in homologous proteins. , 1987, Journal of molecular biology.

[25]  Mark A. Murcko,et al.  Rotational barriers. 2. Energies of alkane rotamers. An examination of gauche interactions , 1988 .

[26]  M Karplus,et al.  Construction of side-chains in homology modelling. Application to the C-terminal lobe of rhizopuspepsin. , 1989, Journal of molecular biology.

[27]  B. Matthews,et al.  A mutant T4 lysozyme displays five different crystal conformations , 1990, Nature.

[28]  W. Pearson Rapid and sensitive sequence comparison with FASTP and FASTA. , 1990, Methods in enzymology.

[29]  R. Lavery,et al.  A new approach to the rapid determination of protein side chain conformations. , 1991, Journal of biomolecular structure & dynamics.

[30]  S. Subbiah,et al.  Prediction of protein side-chain conformation by packing optimization. , 1991, Journal of molecular biology.

[31]  Jeanmarie Guenot,et al.  Variability of conformations at crystal contacts in BPTI represent true low‐energy structures: Correspondence among lattice packing and molecular dynamics structures , 1992, Proteins.

[32]  J. Thornton,et al.  Stereochemical quality of protein structure coordinates , 1992, Proteins.

[33]  C. Sander,et al.  Fast and simple monte carlo algorithm for side chain optimization in proteins: Application to model building by homology , 1992, Proteins.

[34]  M. Levitt Accurate modeling of protein conformation by automatic segment matching. , 1992, Journal of molecular biology.

[35]  Johan Desmet,et al.  The dead-end elimination theorem and its use in protein side-chain positioning , 1992, Nature.

[36]  Carl W. David,et al.  Sprouting side chain conformations in X‐PLOR simulations of peptides , 1993, J. Comput. Chem..

[37]  D A Agard,et al.  Modeling side-chain conformation for homologous proteins using an energy-based rotamer search. , 1993, Journal of molecular biology.

[38]  P Argos,et al.  A method to configure protein side-chains from the main-chain trace in homology modelling. , 1993, Journal of molecular biology.

[39]  Roland L. Dunbrack,et al.  Backbone-dependent rotamer library for proteins. Application to side-chain prediction. , 1993, Journal of molecular biology.

[40]  I. Lasters,et al.  The fuzzy-end elimination theorem: correctly implementing the side chain placement algorithm based on the dead-end elimination theorem. , 1993, Protein engineering.

[41]  P. Argos,et al.  Rotamers: to be or not to be? An analysis of amino acid side-chain conformations in globular proteins. , 1993, Journal of molecular biology.

[42]  Pierre Tufféry,et al.  A critical comparison of search algorithms applied to the optimization of protein side‐chain conformations , 1993, J. Comput. Chem..

[43]  F E Cohen,et al.  Modeling protein structures: construction and their applications , 1993, FASEB journal : official publication of the Federation of American Societies for Experimental Biology.

[44]  P. Koehl,et al.  Application of a self-consistent mean field theory to predict protein side-chains conformation and estimate their conformational entropy. , 1994, Journal of molecular biology.

[45]  Roland L. Dunbrack,et al.  Conformational analysis of the backbone-dependent rotamer preferences of protein sidechains , 1994, Nature Structural Biology.

[46]  A. Kidera,et al.  Determinants of protein side‐chain packing , 1994, Protein science : a publication of the Protein Society.

[47]  H Kono,et al.  Energy minimization method using automata network for sequence and side‐chain conformation prediction from given backbone geometry , 1994, Proteins.

[48]  Comparison of the structures and the crystal contacts of trypanosomal triosephosphate isomerase in four different crystal forms , 1994, Protein science : a publication of the Protein Society.

[49]  C. Laughton,et al.  Prediction of protein side-chain conformations from local three-dimensional homology relationships. , 1994, Journal of molecular biology.

[50]  J. K. Hwang,et al.  Side-chain prediction by neural networks and simulated annealing optimization. , 1995, Protein engineering.

[51]  J. Hurley,et al.  Crystal structure of the Cys2 activator-binding domain of protein kinase Cδ in complex with phorbol ester , 1995, Cell.

[52]  Maximiliano Vásquez,et al.  An evaluvation of discrete and continuum search techniques for conformational analysis of side chains in proteins , 1995 .

[53]  F E Cohen,et al.  Evaluation of current techniques for Ab initio protein structure prediction , 1995, Proteins.

[54]  H. Farid,et al.  Prediction and evaluation of side‐chain conformations for protein backbone structures , 1996, Proteins.

[55]  F E Cohen,et al.  Modeling protein-ligand complexes. , 1996, Current opinion in structural biology.

[56]  S Subbiah,et al.  A structural explanation for the twilight zone of protein sequence homology. , 1996, Structure.

[57]  M. Vásquez,et al.  Modeling side-chain conformation. , 1996, Current opinion in structural biology.