ProSMoS server: a pattern-based search using interaction matrix representation of protein structures

Assessing structural similarity and defining common regions through comparison of protein spatial structures is an important task in functional and evolutionary studies of proteins. There are many servers that compare structures and define sub-structures in common between proteins through superposition and closeness of either coordinates or contacts. However, a natural way to analyze a structure for experts working on structure classification is to look for specific three-dimensional (3D) motifs and patterns instead of finding common features in two proteins. Such motifs can be described by the architecture and topology of major secondary structural elements (SSEs) without consideration of subtle differences in 3D coordinates. Despite the importance of motif-based structure searches, currently there is a shortage of servers to perform this task. Widely known TOPS does not fully address this problem, as it finds only topological match but does not take into account other important spatial properties, such as interactions and chirality. Here, we implemented our approach to protein structure pattern search (ProSMoS) as a web-server. ProSMoS converts 3D structure into an interaction matrix representation including the SSE types, handednesses of connections between SSEs, coordinates of SSE starts and ends, types of interactions between SSEs and β-sheet definitions. For a user-defined structure pattern, ProSMoS lists all structures from a database that contain this pattern. ProSMoS server will be of interest to structural biologists who would like to analyze very general and distant structural similarities. The ProSMoS web server is available at: http://prodata.swmed.edu/ProSMoS/.

[1]  Arthur M Lesk,et al.  Contact patterns between helices and strands of sheet define protein folding patterns , 2007, Proteins.

[2]  Liisa Holm,et al.  Searching protein structure databases with DaliLite v.3 , 2008, Bioinform..

[3]  D. O’Leary,et al.  Secondary structure spatial conformation footprint: a novel method for fast protein structure comparison and classification , 2006, BMC Structural Biology.

[4]  J. Rubinstein,et al.  Bacterial polysaccharide co-polymerases share a common framework for control of polymer length , 2008, Nature Structural &Molecular Biology.

[5]  Peter J. Stuckey,et al.  Structural search and retrieval using a tableau representation of protein folding patterns , 2008, Bioinform..

[6]  N. Boutonnet,et al.  Structural classification of alphabetabeta and betabetaalpha supersecondary structure units in proteins. , 1998, Proteins.

[7]  David R. Gilbert,et al.  Protein structure topological comparison, discovery and matching service , 2005, Bioinform..

[8]  Nick V. Grishin,et al.  PALSSE: A program to delineate linear secondary structural elements from protein structures , 2005, BMC Bioinformatics.

[9]  K. Koretke,et al.  A CTP-dependent archaeal riboflavin kinase forms a bridge in the evolution of cradle-loop barrels. , 2007, Structure.

[10]  E. Koonin,et al.  Emergence of diverse biochemical activities in evolutionarily conserved structural scaffolds of proteins. , 2003, Current opinion in chemical biology.

[11]  James E. Bray,et al.  The CATH database: an extended protein family resource for structural and functional genomics , 2003, Nucleic Acids Res..

[12]  Tim J. P. Hubbard,et al.  Data growth and its impact on the SCOP database: new developments , 2007, Nucleic Acids Res..

[13]  Rosemarie Swanson,et al.  Algorithms for Finding the Axis of a Helix: Fast Rotational and Parametric Least-squares Methods , 1996, Comput. Chem..

[14]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[15]  P. Koehl,et al.  Protein structure similarities. , 2001, Current opinion in structural biology.

[16]  Yi Zhong,et al.  Searching for three-dimensional secondary structural patterns in proteins with ProSMoS , 2007, Bioinform..

[17]  Marianne Rooman,et al.  Structural classification of αββ and ββα supersecondary structure units in proteins , 1998 .

[18]  N. Grishin,et al.  Mh1 domain of Smad is a degraded homing endonuclease. , 2001, Journal of molecular biology.

[19]  Frances M. G. Pearl,et al.  The CATH domain structure database. , 2005, Methods of biochemical analysis.

[20]  William R. Taylor,et al.  Structure Comparison and Structure Patterns , 2000, J. Comput. Biol..

[21]  K Henrick,et al.  Electronic Reprint Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions , 2022 .