ROSS: A rapid protein structure alignment algorithm

Protein structure comparison is very important in bioinformatics as the function of a protein is determined by its structure. Finding relationships between proteins is very necessary. The databases for proteins are growing rapidly. Therefore there is a strong need for rapid algorithms to handle large number of requests made by biologists. This means that protein structure comparison is an active research area. Relative Orientation of Secondary Structure (ROSS) algorithm for protein structure alignment is proposed. ROSS is a sequential (preserving order) sequence-independent structure alignment algorithm. This work proposes a data reduction and a hierarchical approach for the structure alignment of proteins. The data reduction is done in the pre-processing stage to compute small descriptors for proteins and store them in a repository. This reduction speeds up the process of aligning the structure of a query protein to a large number of proteins. The comparison process is performed hierarchically. The reduced representation depends on the vectors representing the axes of the Secondary Structure Elements (SSE's) and the inter-direction between their centroids. The accuracy of the algorithm is similar to the well known algorithms Secondary-Structure Matching (SSM) and Vector Alignment Search Tool (VAST), used through their websites. The speed of the algorithm is around fifty times that of TM Align algorithm. This speed enables searching for similarities within large datasets instantly. Moreover, the algorithm achieves reduction ratio around 30:1.

[1]  Jafar Razmara,et al.  TS-AMIR: a topology string alignment method for intensive rapid protein structure comparison , 2012, Algorithms for Molecular Biology.

[2]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[3]  Adam Godzik,et al.  Flexible structure alignment by chaining aligned fragment pairs allowing twists , 2003, ECCB.

[4]  Liisa Holm,et al.  Advances and pitfalls of protein structural alignment. , 2009, Current opinion in structural biology.

[5]  J F Gibrat,et al.  Surprising similarities in structure comparison. , 1996, Current opinion in structural biology.

[6]  J. Skolnick,et al.  TM-align: a protein structure alignment algorithm based on the TM-score , 2005, Nucleic acids research.

[7]  Andrew J. Martin,et al.  The ups and downs of protein topology; rapid comparison of protein structure. , 2000, Protein engineering.

[8]  Douglas L. Brutlag,et al.  Hierarchical Protein Structure Superposition Using Both Secondary Structure and Atomic Representations , 1997, ISMB.

[9]  William R Taylor,et al.  Protein fold comparison by the alignment of topological strings. , 2003, Protein engineering.

[10]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[11]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[12]  Dmitrij Frishman,et al.  STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins , 2004, Nucleic Acids Res..

[13]  K Henrick,et al.  Electronic Reprint Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions , 2022 .