Fast Hinge Detection Algorithms for Flexible Protein Structures

Analysis of conformational changes is one of the keys to the understanding of protein functions and interactions. For the analysis, we often compare two protein structures, taking flexible regions like hinge regions into consideration. The Root Mean Square Deviation (RMSD) is the most popular measure for comparing two protein structures, but it is only for rigid structures without hinge regions. In this paper, we propose a new measure called RMSD considering hinges (RMSDh) and its variant RMSDh(k) for comparing two flexible proteins with hinge regions. We also propose novel efficient algorithms for computing them, which can detect the hinge positions at the same time. The RMSDh is suitable for cases where there is one small hinge region in each of the two target structures. The new algorithm for computing the RMSDh runs in linear time, which is the same as the time complexity for computing the RMSD and is faster than any of previous algorithms for hinge detection. The RMSDh(k) is designed for comparing structures with more than one hinge region. The RMSDh(k) measure considers at most k small hinge region, i.e., the RMSDh(k) value should be small if the two structures are similar except for at most k hinge regions. To compute the value, we propose an O(kn2)-time and O(n)-space algorithm based on a new dynamic programming technique. With the same computational time and space, we can enumerate the predicted hinge positions. We also test our algorithms against actual flexible protein structures, and show that the hinge positions can be correctly detected by our algorithms.

[1]  J. Mccammon,et al.  HIV‐1 protease molecular dynamics of a wild‐type and of the V82F/I84V mutant: Possible contributions to drug resistance and a potential new target site for drugs , 2004, Protein science : a publication of the Protein Society.

[2]  Ruth Nussinov,et al.  HingeProt: Automated prediction of hinges in protein structures , 2008, Proteins.

[3]  M Gerstein,et al.  Analysis of protein loop closure. Two types of hinges produce one motion in lactate dehydrogenase. , 1991, Journal of molecular biology.

[4]  Shoshana J. Wodak,et al.  Advanced pairwise structure alignments of proteins and analysis of conformational changes , 2002, Bioinform..

[5]  Mark Gerstein,et al.  FlexOracle: predicting flexible hinges by identification of stable domains , 2007, BMC Bioinformatics.

[6]  Heather A Carlson,et al.  Gaussian-weighted RMSD superposition of proteins: a structural comparison for flexible proteins and predicted protein structures. , 2006, Biophysical journal.

[7]  W. Kabsch A solution for the best rotation to relate two sets of vectors , 1976 .

[8]  Adam Godzik,et al.  Flexible structure alignment by chaining aligned fragment pairs allowing twists , 2003, ECCB.

[9]  Micha Sharir,et al.  Identification of Partially Obscured Objects in Two and Three Dimensions by Matching Noisy Characteristic Curves , 1987 .

[10]  K. S. Arun,et al.  Least-Squares Fitting of Two 3-D Point Sets , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Tetsuo Shibuya,et al.  Efficient Substructure RMSD Query Algorithms , 2007, J. Comput. Biol..

[12]  Mark Gerstein,et al.  The Database of Macromolecular Motions: new features added at the decade mark , 2005, Nucleic Acids Res..

[13]  Richard A. Lee,et al.  A comprehensive and non-redundant database of protein domain movements , 2005, Bioinform..

[14]  Christian Lemmen,et al.  Computational methods for the structural alignment of molecules , 2000, J. Comput. Aided Mol. Des..

[15]  I. Rayment,et al.  Three-dimensional structure of adenosylcobinamide kinase/adenosylcobinamide phosphate guanylyltransferase from Salmonella typhimurium determined to 2.3 A resolution,. , 1998, Biochemistry.

[16]  Mark Gerstein,et al.  Hinge Atlas: relating protein sequence to sites of structural flexibility , 2007, BMC Bioinformatics.

[17]  Daniel S. Hirschberg,et al.  A linear space algorithm for computing maximal common subsequences , 1975, Commun. ACM.

[18]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[19]  Robert B. Fisher,et al.  Estimating 3-D rigid body transformations: a comparison of four major algorithms , 1997, Machine Vision and Applications.

[20]  K Schulten,et al.  Protein domain movements: detection of rigid domains and visualization of hinges in comparisons of atomic coordinates , 1997, Proteins.

[21]  David Hsu,et al.  Protein Conformational Flexibility Analysis with Noisy Data , 2007, RECOMB.

[22]  H. Wolfson,et al.  From structure to function: methods and applications. , 2005, Current protein & peptide science.

[23]  Mark Gerstein,et al.  MolMovDB: analysis and visualization of conformational change and structural flexibility , 2003, Nucleic Acids Res..

[24]  Ruth Nussinov,et al.  FlexProt: Alignment of Flexible Protein Structures Without a Predefinition of Hinge Regions , 2004, J. Comput. Biol..

[25]  W. Kabsch A discussion of the solution for the best rotation to relate two sets of vectors , 1978 .

[26]  Enoch S. Huang,et al.  Automatic and accurate method for analysis of proteins that undergo hinge-mediated domain and loop movements , 1993, Current Biology.

[27]  M J Rooman,et al.  Automatic analysis of protein conformational changes by multiple linkage clustering. , 1995, Journal of molecular biology.

[28]  D. Jacobs,et al.  Protein flexibility predictions using graph theory , 2001, Proteins.

[29]  David Hsu,et al.  Protein Conformational Flexibility Analysis with Noisy Data , 2007, RECOMB.

[30]  William R. Taylor,et al.  Structure Comparison and Structure Patterns , 2000, J. Comput. Biol..