Dynamic domain threading

A modeling method is described that avoids the need to consider the domain structure of the template used for modeling, and automatically extracts compact fragments of structure that would be of a suitable size to build the model. This aids automation as the size or nature of the template structure can be ignored and does not have to be broken into domain (or multi‐domain) units beforehand. The approach leads to the generation of a large number of models each based on slightly differing domain definitions and this variation was further increased by considering alternative secondary structure predictions. Each model, of which there may be thousands, takes the form of a complete alpha‐carbon trace and some methods (including residue burial) were investigated for their power to discriminate good models from bad models using decoys. The method is also compared to an earlier retroviral capsid modeling problem for which the X‐ray structure is now known. Some potential extensions of the approach to more distant modeling problems are discussed. Proteins 2006. © 2006 Wiley‐Liss, Inc.

[1]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[2]  W. Taylor Protein structure comparison using iterated double dynamic programming , 2008, Protein science : a publication of the Protein Society.

[3]  William R Taylor,et al.  Modelling zinc-binding proteins with GADGET: genetic algorithm and distance geometry for exploring topology. , 2003, Journal of molecular biology.

[4]  David C. Jones,et al.  GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. , 1999, Journal of molecular biology.

[5]  William R. Taylor,et al.  The rapid generation of mutation data matrices from protein sequences , 1992, Comput. Appl. Biosci..

[6]  Franca Fraternali,et al.  Parameter optimized surfaces (POPS): analysis of key interactions and conformational changes in the ribosome , 2002, Nucleic Acids Res..

[7]  Chris Sander,et al.  Dali/FSSP classification of three-dimensional protein folds , 1997, Nucleic Acids Res..

[8]  F. Avbelj,et al.  Use of a potential of mean force to analyze free energy contributions in protein folding. , 1992, Biochemistry.

[9]  I. Taylor,et al.  High-resolution structure of a retroviral capsid hexameric amino-terminal domain , 2004, Nature.

[10]  Franca Fraternali,et al.  POPS: a fast algorithm for solvent accessible surface areas at atomic and residue level , 2003, Nucleic Acids Res..

[11]  Kuang Lin,et al.  Threading Using Neural nEtwork (TUNE): the measure of protein sequence-structure compatibility , 2002, Bioinform..

[12]  W R Taylor,et al.  Residual colours: a proposal for aminochromography. , 1997, Protein engineering.

[13]  William R Taylor,et al.  A structural pattern‐based method for protein fold recognition , 2004, Proteins.

[14]  William R Taylor,et al.  A Fourier analysis of symmetry in protein structure. , 2002, Protein engineering.

[15]  D. Eisenberg,et al.  A method to identify protein sequences that fold into a known three-dimensional structure. , 1991, Science.

[16]  William R. Taylor,et al.  A deeply knotted protein structure and how it might fold , 2000, Nature.

[17]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[18]  W R Taylor,et al.  Protein structural domain identification. , 1999, Protein engineering.

[19]  Tim J. P. Hubbard,et al.  SCOP: a structural classification of proteins database , 1998, Nucleic Acids Res..

[20]  J A Bangham,et al.  Data-sieving hydrophobicity plots. , 1988, Analytical biochemistry.

[21]  R A Sayle,et al.  RASMOL: biomolecular graphics for all. , 1995, Trends in biochemical sciences.

[22]  William R. Taylor,et al.  Protein Geometry, Classification, Topology and Symmetry: A Computational Analysis of Structure , 2004 .

[23]  R. Samudrala,et al.  An all-atom distance-dependent conditional probability discriminatory function for protein structure prediction. , 1998, Journal of molecular biology.

[24]  W. Taylor,et al.  Folding polypeptide α‐carbon backbones by distance geometry methods , 1994 .

[25]  D. T. Jones,et al.  A new approach to protein fold recognition , 1992, Nature.

[26]  N. D. Clarke,et al.  Identification of protein folds: Matching hydrophobicity patterns of sequence sets with solvent accessibility patterns of known structures , 1990, Proteins.

[27]  W. Taylor,et al.  Multiple sequence threading: an analysis of alignment quality and stability. , 1997, Journal of molecular biology.

[28]  A. D. McLachlan,et al.  Profile analysis: detection of distantly related proteins. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[29]  I. Jonassen,et al.  Searching the protein structure databank with weak sequence patterns and structural constraints. , 2000, Journal of molecular biology.

[30]  W R Taylor,et al.  Defining linear segments in protein structure. , 2001, Journal of molecular biology.

[31]  D T Jones,et al.  A systematic comparison of protein structure classifications: SCOP, CATH and FSSP. , 1999, Structure.

[32]  I. Jonassen,et al.  Discovery of local packing motifs in protein structures , 1999, Proteins.

[33]  William R. Taylor,et al.  Ab initio modelling of the N-terminal domain of the secretin receptors , 2003, Comput. Biol. Chem..

[34]  F E Cohen,et al.  Novel method for the rapid evaluation of packing in protein structures. , 1990, Journal of molecular biology.

[35]  J. Skolnick,et al.  A distance‐dependent atomic knowledge‐based potential for improved protein structure selection , 2001, Proteins.

[36]  William R Taylor,et al.  Consensus structural models for the amino terminal domain of the retrovirus restriction gene Fv1 and the Murine Leukaemia Virus capsid proteins , 2004, BMC Structural Biology.

[37]  W. Taylor,et al.  Identification of protein sequence homology by consensus template alignment. , 1986, Journal of molecular biology.

[38]  M. Bock,et al.  Retroviral Capsid Determinants of Fv1 NB and NR Tropism , 2004, Journal of Virology.

[39]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.