FlexProt: Alignment of Flexible Protein Structures Without a Predefinition of Hinge Regions

FlexProt is a novel technique for the alignment of flexible proteins. Unlike all previous algorithms designed to solve the problem of structural comparisons allowing hinge-bending motions, FlexProt does not require an a priori knowledge of the location of the hinge(s). FlexProt carries out the flexible alignment, superimposing the matching rigid subpart pairs, and detects the flexible hinge regions simultaneously. A large number of methods are available to handle rigid structural alignment. However, proteins are flexible molecules, which may appear in different conformations. Hence, protein structural analysis requires algorithms that can deal with molecular flexibility. Here, we present a method addressing specifically a flexible protein alignment task. First, the method efficiently detects maximal congruent rigid fragments in both molecules. Transforming the task into a graph theoretic problem, our method proceeds to calculate the optimal arrangement of previously detected maximal congruent rigid fragments. The fragment arrangement does not violate the protein sequence order. A clustering procedure is performed on fragment-pairs which have the same 3-D rigid transformation regardless of insertions and deletions (such as loops and turns) which separate them. Although the theoretical worst case complexity of the algorithm is O(n(6)), in practice FlexProt is highly efficient. It performs a structural comparison of a pair of proteins 300 amino acids long in about seven seconds on a standard desktop PC (400 MHz Pentium II processor with 256MB internal memory). We have performed extensive experiments with the algorithm. An assortment of these results is presented here. FlexProt can be accessed via WWW at bioinfo3d.cs.tau.ac.il/FlexProt/.

[1]  Ruth Nussinov,et al.  Alignment of Flexible Protein Structures , 2000, ISMB.

[2]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[3]  Chris Sander,et al.  Protein folds and families: sequence and structure alignments , 1999, Nucleic Acids Res..

[4]  Berthold K. P. Horn,et al.  Closed-form solution of absolute orientation using unit quaternions , 1987 .

[5]  Thomas H. Cormen,et al.  Introduction to algorithms [2nd ed.] , 2001 .

[6]  William R. Taylor,et al.  Structure Comparison and Structure Patterns , 2000, J. Comput. Biol..

[7]  Jack R. Collins,et al.  Flap opening in HIV-1 protease simulated by ‘activated’ molecular dynamics , 1995, Nature Structural Biology.

[8]  Haim J. Wolfson,et al.  Model-Based Object Recognition by Geometric Hashing , 1990, ECCV.

[9]  H. Wolfson,et al.  Detection of non-topological motifs in protein structures. , 1996, Protein engineering.

[10]  Paul W. Fitzjohn,et al.  Incorporation of flexibility into rigid‐body docking: Applications in rounds 3–5 of CAPRI , 2005, Proteins.

[11]  Dan Gusfield,et al.  Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[12]  Dan Gusfield Algorithms on Strings, Trees, and Sequences - Computer Science and Computational Biology , 1997 .

[13]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[14]  M. Gerstein,et al.  A database of macromolecular motions. , 1998, Nucleic acids research.

[15]  H. Wolfson,et al.  Multiple diverse ligands binding at a single protein site : A matter of pre-existing populations , 2001 .

[16]  R. Nussinov,et al.  Folding funnels and binding mechanisms. , 1999, Protein engineering.

[17]  M. Sternberg,et al.  The relationship between the flexibility of proteins and their conformational states on forming protein-protein complexes with an application to protein-protein docking. , 2005, Journal of molecular biology.

[18]  Ruth Nussinov,et al.  MUSTA - A General, Efficient, Automated Method for Multiple Structure Alignment and Detection of Common Motifs: Application to Proteins , 2001, J. Comput. Biol..

[19]  Micha Sharir,et al.  Identification of Partially Obscured Objects in Two and Three Dimensions by Matching Noisy Characteristic Curves , 1987 .

[20]  Christian Lemmen,et al.  Computational methods for the structural alignment of molecules , 2000, J. Comput. Aided Mol. Des..

[21]  S J Remington,et al.  A general method to assess similarity of protein structures, with applications to T4 bacteriophage lysozyme. , 1978, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Günther Bauer,et al.  High Resolution X-Ray Diffraction , 1996 .

[23]  H. Wolfson,et al.  Flexible protein alignment and hinge detection , 2002, Proteins.

[24]  M G Rossmann,et al.  Comparison of protein structures. , 1985, Methods in enzymology.

[25]  Ruth Nussinov,et al.  3-D Substructure Matching in Protein Molecules , 1992, CPM.

[26]  Haim J. Wolfson,et al.  Generalizing the generalized hough transform , 1991, Pattern Recognit. Lett..

[27]  Trevor J. Hastie,et al.  3-D curve matching using splines , 1991, J. Field Robotics.

[28]  H Wolfson,et al.  Flexible structural comparison allowing hinge‐bending, swiveling motions , 1999, Proteins.

[29]  K Schulten,et al.  Protein domain movements: detection of rigid domains and visualization of hinges in comparisons of atomic coordinates , 1997, Proteins.

[30]  R Abagyan,et al.  A new method for modeling large‐scale rearrangements of protein domains , 1997, Proteins.

[31]  H. Wolfson,et al.  Structural Comparison Allowing Hinge Bending, Swiveling Motions , 1999 .

[32]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  C. Sander,et al.  Detection of common three‐dimensional substructures in proteins , 1991, Proteins.

[34]  T. Blundell,et al.  High‐resolution X‐ray diffraction study of the complex between endothiapepsin and an oligopeptide inhibitor: the analysis of the inhibitor binding and description of the rigid body shift in the enzyme. , 1989, The EMBO journal.

[35]  A. Lesk,et al.  Structural mechanisms for domain movements in proteins. , 1994, Biochemistry.

[36]  T. Bhat,et al.  Structure of human cathepsin D: comparison of inhibitor binding and subdomain displacement with other aspartic proteases. , 1995, Advances in experimental medicine and biology.

[37]  R. Nussinov,et al.  Folding and binding cascades: Dynamic landscapes and population shifts , 2008, Protein science : a publication of the Protein Society.

[38]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[39]  R M Stroud,et al.  Domain flexibility in retroviral proteases: structural implications for drug resistant mutations. , 1998, Biochemistry.

[40]  R. Jernigan,et al.  Proteins with similar architecture exhibit similar large-scale dynamic behavior. , 2000, Biophysical journal.

[41]  Kurt Mehlhorn,et al.  The LEDA Platform of Combinatorial and Geometric Computing , 1997, ICALP.

[42]  John W. Erickson,et al.  Conformational switching in an aspartic proteinase , 1998, Nature Structural Biology.

[43]  R. Kelley,et al.  Hinge bending within the cytokine receptor superfamily revealed by the 2.4 Å crystal structure of the extracellular domain of rabbit tissue factor , 1998, Protein science : a publication of the Protein Society.

[44]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.