A Novel Approach For Classifying Protein Structures Based On Fuzzy Decision Tree

The processes in the organisms are influenced by protein molecules. Based on the functions of proteins, they can be used to design drugs for various diseases. There are methods that can be used to classify protein structures where the classification is manual done by domain experts, but they are not able to provide classification with reasonable speed, require intensive humans' effort and is time consuming. Thus, there is obvious necessity for rapid methods that would afford accurate classification of protein structures in an automated way. In this paper, we introduce an approach for classifying protein structures. First, for each protein its ray based descriptor is extracted, which gives evidence how the backbone of the protein is positioned with respect to the center of the protein. Then, a prediction model is created by using the fuzzy decision tree classifier. For evaluation, we used a part from the SCOP database, which holds information for the classification of proteins gathered in manual way. We present experimental results from the evaluation of the proposed approach.

[1]  Srinivasan Parthasarathy,et al.  A multi-level approach to SCOP fold recognition , 2005, Fifth IEEE Symposium on Bioinformatics and Bioengineering (BIBE'05).

[2]  Georgina Mirceva,et al.  Efficient Approaches for Retrieving Protein Tertiary Structures , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[3]  S. Bryant,et al.  Threading a database of protein cores , 1995, Proteins.

[4]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[5]  Jinn-Moon Yang,et al.  fastSCOP: a fast web server for recognizing protein structural domains and SCOP superfamilies , 2007, Nucleic Acids Res..

[6]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[7]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[8]  Chris Sander,et al.  The FSSP database: fold classification based on structure-structure alignment of proteins , 1996, Nucleic Acids Res..

[9]  Cezary Z. Janikow,et al.  Fuzzy decision trees: issues and methods , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[10]  J. Ross Quinlan,et al.  Decision trees and decision-making , 1990, IEEE Trans. Syst. Man Cybern..

[11]  C. Sander,et al.  Dali: a network tool for protein structure comparison. , 1995, Trends in biochemical sciences.

[12]  N. Grishin,et al.  COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance. , 2003, Journal of molecular biology.

[13]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[14]  Osvaldo Olmea,et al.  MAMMOTH (Matching molecular models obtained from theory): An automated method for model comparison , 2002, Protein science : a publication of the Protein Society.

[15]  David C. Jones,et al.  CATH--a hierarchic classification of protein domain structures. , 1997, Structure.

[16]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[17]  Yuan Qi,et al.  SCOPmap: Automated assignment of protein structures to evolutionary superfamilies , 2004, BMC Bioinformatics.

[18]  Chi-Ren Shyu,et al.  Efficient protein tertiary structure retrievals and classifications using content based comparison algorithms , 2007 .

[19]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[20]  Jinn-Moon Yang,et al.  Protein structure database search and evolutionary classification , 2006, Nucleic acids research.