Pairwise protein structure alignment based on an orientation-independent representation of the backbone geometry

Determining structural similarities between proteins is an important problem since it can help identify functional and evolutionary relationships. In this paper, an algorithm is proposed to align two protein structures. Given the protein backbones, the algorithm finds a rigid motion of one backbone onto the other such that large substructures are matched. The algorithm uses a representation of the backbones that is independent of their relative orientations in space and applies dynamic programming to this representation to compute an initial alignment, which is then refined iteratively. Experiments indicate that the algorithm is competitive with two well-known algorithms, namely DALI by L. Holm and C. Sander (1993) and LOCK by A.P. Singh and D.L. Brutlag (1997).

[1]  A. M. Lesk,et al.  A toolkit for computational molecular biology. II. On the optimal superposition of two sets of coordinates , 1986 .

[2]  Jon M. Kleinberg,et al.  Fast detection of common geometric substructure in proteins , 1999, J. Comput. Biol..

[3]  C Sander,et al.  Mapping the Protein Universe , 1996, Science.

[4]  William R. Taylor,et al.  Structure Comparison and Structure Patterns , 2000, J. Comput. Biol..

[5]  E. Shakhnovich,et al.  Pseudodihedrals: Simplified protein backbone representation with knowledge‐based energy , 1994, Protein science : a publication of the Protein Society.

[6]  C. Branden,et al.  Introduction to protein structure , 1991 .

[7]  Jon M. Kleinberg,et al.  Fast Detection of Common Geometric Substructure in Proteins , 1999, J. Comput. Biol..

[8]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[9]  T L Blundell,et al.  Phylogenetic relationships from three-dimensional protein structures. , 1990, Methods in enzymology.

[10]  Klara Kedem,et al.  Finding the Consensus Shape for a Protein Family , 2002, SCG '02.

[11]  김동규,et al.  [서평]「Algorithms on Strings, Trees, and Sequences」 , 2000 .

[12]  Philip E. Bourne,et al.  The Protein Data Bank (PDB) | NIST , 2002 .

[13]  W R Taylor,et al.  Protein structure alignment. , 1989, Journal of molecular biology.

[14]  Douglas L. Brutlag,et al.  Hierarchical Protein Structure Superposition Using Both Secondary Structure and Atomic Representations , 1997, ISMB.

[15]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[16]  Ruth Nussinov,et al.  3-D Substructure Matching in Protein Molecules , 1992, CPM.

[17]  Gene H. Golub,et al.  Matrix computations , 1983 .

[18]  Chris Sander,et al.  The FSSP database: fold classification based on structure-structure alignment of proteins , 1996, Nucleic Acids Res..

[19]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[20]  F. Cohen,et al.  A surface of minimum area metric for the structural comparison of proteins. , 1996, Journal of molecular biology.

[21]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[22]  W. Taylor Protein structure comparison using iterated double dynamic programming , 2008, Protein science : a publication of the Protein Society.