P3S: Protein Structure Similarity Search

Similarity search in protein structure databases is an important task of computational biology. To reduce the time required to search for similar structures, indexing techniques are being often introduced. However, as the indexing phase is computationally very expensive, it becomes useful only when a large number of searches are expected (so that the expensive indexing cost is amortized by cheaper search cost). This is a typical situation for a public similarity search service. In this article we introduce the P3S web application (http://siret.cz/p3s) allowing, given a query structure, to identify the set of the most similar structures in a database. The result set can be browsed interactively, including visual inspection of the structure superposition, or it can be downloaded as a zip archive. P3S employs the SProt similarity measure and an indexing technique based on the LAESA method, both introduced recently by our group. Together with the measure and the index, the method presents an effective and efficient tool for querying protein structure databases.

[1]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[2]  Didier Rognan,et al.  sc-PDB: a database for identifying variations and multiplicity of 'druggable' binding sites in proteins , 2011, Bioinform..

[3]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[4]  David Hoksza,et al.  SProt: sphere-based protein structure similarity algorithm , 2011, Proteome Science.

[5]  Ismail Hakki Toroslu,et al.  Integrated search and alignment of protein structures , 2008, Bioinform..

[6]  Chi-Ching Lee,et al.  iSARST: an integrated SARST web server for rapid protein structural similarity searches , 2009, Nucleic Acids Res..

[7]  Dusanka Janezic,et al.  ProBiS: a web server for detection of structurally similar protein binding sites , 2010, Nucleic Acids Res..

[8]  Zong Hong Zhang,et al.  deconSTRUCT: general purpose protein database search on the substructure level , 2010, Nucleic Acids Res..

[9]  Jinn-Moon Yang,et al.  Kappa-alpha plot derived structural alphabet and BLOSUM-like substitution matrix for rapid search of protein structure database , 2007, Genome Biology.

[10]  Liisa Holm,et al.  Dali server: conservation mapping in 3D , 2010, Nucleic Acids Res..

[11]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[12]  K Henrick,et al.  Electronic Reprint Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions Biological Crystallography Secondary-structure Matching (ssm), a New Tool for Fast Protein Structure Alignment in Three Dimensions , 2022 .

[13]  Zhiping Weng,et al.  FAST: A novel protein structure alignment algorithm , 2004, Proteins.

[14]  Kian-Lee Tan,et al.  Rapid 3D protein structure database searching using information retrieval techniques , 2004, Bioinform..

[15]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[16]  James Reinders,et al.  Intel threading building blocks - outfitting C++ for multi-core processor parallelism , 2007 .

[17]  J F Gibrat,et al.  Surprising similarities in structure comparison. , 1996, Current opinion in structural biology.

[18]  Jinn-Moon Yang,et al.  Protein structure database search and evolutionary classification , 2006, Nucleic acids research.

[19]  Patrice Koehl,et al.  The ASTRAL Compendium in 2004 , 2003, Nucleic Acids Res..

[20]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[21]  Chih-Hung Chang,et al.  Protein structural similarity search by Ramachandran codes , 2007, BMC Bioinformatics.

[22]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.