A more efficient method for domain repeat detection in WD-40 proteins

Structural biology is a branch of molecular biology and biochemistry, aiming to understand the interaction of molecules like proteins by observing their structures. Crystallization is one of the most widely used methods to identify protein structure, yet it is laborious and time-consuming. Researchers are seeking assistance from computers. This paper implements an improved WDSP program for recognizing and predicting secondary structure of WD40 repeat proteins, which is a large protein family in eukaryotes. The original WDSP works well on predicting WD40 protein structures but it also suffers from low computational efficiency. We propose a more computationally efficient WDSP namely FWDSP by imposing clustering and a specific local searching to the original WDSP. Experiment results on three datasets of WD40 proteins demonstrate the effectiveness and efficiency of FWDSP.

[1]  Burkhard Rost,et al.  PHD - an automatic mail server for protein secondary structure prediction , 1994, Comput. Appl. Biosci..

[2]  J. Gibrat,et al.  GOR method for predicting protein secondary structure from amino acid sequence. , 1996, Methods in enzymology.

[3]  M Ouali,et al.  Cascaded multiple classifiers for secondary structure prediction , 2000, Protein science : a publication of the Protein Society.

[4]  Jérôme Gouzy,et al.  ProDom: Automated Clustering of Homologous Domains , 2002, Briefings Bioinform..

[5]  Robert D. Finn,et al.  The Pfam protein families database , 2004, Nucleic Acids Res..

[6]  Amos Bairoch,et al.  The PROSITE database , 2005, Nucleic Acids Res..

[7]  J. Piškur,et al.  Textbook of Structural Biology , 2009 .

[8]  R. Russell,et al.  WD40 proteins propel cellular networks. , 2010, Trends in biochemical sciences.

[9]  Jaap Heringa,et al.  Protein secondary structure prediction. , 2010, Methods in molecular biology.

[10]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[11]  Daniel W. A. Buchan,et al.  Scalable web services for the PSIPRED Protein Analysis Workbench , 2013, Nucleic Acids Res..

[12]  F. Jiang,et al.  A Method for WD40 Repeat Detection and Secondary Structure Prediction , 2013, PloS one.

[13]  Yang Wang,et al.  WDSPdb: a database for WD40-repeat proteins , 2014, Nucleic Acids Res..