Confronting the problem of interconnected structural changes in the comparative modeling of proteins

Comparative models of three proteins have been built using a variety of computational methods, heavily supplemented by visual inspection. We consider the accuracy obtained to be worse than expected. A careful analysis of the models shows that a major reason for the poor results is the interconnectedness of the structural differences between the target proteins and the template structures they were modeled from. Side chain conformations are often determined by details of the structure remote in the sequence, and can be influenced by relatively small main chain changes. Almost all of the regions of substantial main chain conformational change interact with at least one other such region, so that they often cannot be modeled independently. Visual inspection is sometimes effective in correcting errors in sequence alignment and in spotting when an alternative template structure is more appropriate. We expect some improvements in the near future through the development of structure‐based sequence alignment tools, side chain interconnectedness rotamer choice algorithms, and a better understanding of the context sensitivity of conformational features. © 1995 Wiley‐Liss, Inc.

[1]  U. Pieper,et al.  Structural evidence for the evolutionary divergence of mycoplasma from gram-positive bacteria: the histidine-containing phosphocarrier protein. , 1995, Structure.

[2]  T. A. Jones,et al.  Crystal structures of cellular retinoic acid binding proteins I and II in complex with all-trans-retinoic acid and a synthetic retinoid. , 1995, Structure.

[3]  J. Moult,et al.  Ab initio structure prediction for small polypeptides and protein fragments using genetic algorithms , 1995, Proteins.

[4]  C. Sander,et al.  Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. , 1991, Journal of molecular biology.

[5]  D A Agard,et al.  Modeling side-chain conformation for homologous proteins using an energy-based rotamer search. , 1993, Journal of molecular biology.

[6]  A. Mclachlan Gene duplications in the structural evolution of chymotrypsin. , 1979, Journal of molecular biology.

[7]  J Moult,et al.  Analysis of the steric strain in the polypeptide backbone of protein molecules , 1991, Proteins.

[8]  M. James,et al.  A critical assessment of comparative molecular modeling of tertiary structures of proteins * , 1995, Proteins.

[9]  J Moult,et al.  An analysis of protein folding pathways. , 1991, Biochemistry.

[10]  A C Martin,et al.  Modeling antibody hypervariable loops: a combined algorithm. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[11]  J. Wootton,et al.  Construction of validated, non-redundant composite protein sequence databases. , 1990, Protein engineering.

[12]  K. Fidelis,et al.  Comparison of systematic search and database methods for constructing segments of protein structure. , 1994, Protein engineering.

[13]  M. Karplus,et al.  Prediction of the folding of short polypeptide segments by uniform conformational sampling , 1987, Biopolymers.

[14]  J. Risler,et al.  Amino acid substitutions in structurally related proteins. A pattern recognition approach. Determination of a new and efficient scoring matrix. , 1988, Journal of molecular biology.

[15]  G. Barton Protein multiple sequence alignment and flexible pattern matching. , 1990, Methods in enzymology.

[16]  Robert E. Bruccoleri,et al.  Chain closure with bond angle variations , 1985 .

[17]  R J Read,et al.  Critical evaluation of comparative model building of Streptomyces griseus trypsin. , 1984, Biochemistry.

[18]  J. Moult,et al.  An algorithm for determining the conformation of polypeptide segments in proteins by systematic search , 1986, Proteins.

[19]  J. Greer Comparative modeling methods: Application to the family of the mammalian serine proteases , 1990, Proteins.

[20]  D. Lipman,et al.  Rapid and sensitive protein similarity searches. , 1985, Science.

[21]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[22]  M. Sternberg,et al.  A strategy for the rapid multiple alignment of protein sequences. Confidence levels from tertiary structure comparisons. , 1987, Journal of molecular biology.