Classification of 3d Protein based on Structure Information Feature

In molecular biology, classification of 3D protein structures is a very important topic. In this paper, from the view of geometry, we propose one kind of feature called SID (DI, structure information dimension) for describing 3D protein structures in order to perform similarity search and classification. The goal of the approach is to reduce the 3D protein structures matching problem to the comparison of the probability distribution. Experimental results show the proposed approach is effective to pre-classification of 3d proteins, and needs less computation and storage cost comparing to the existing methods, the proposed feature of 3d protein is invariant to the translation, rotation and scale of 3d protein structures.

[1]  Tsuhan Chen,et al.  PROTEIN RETRIEVAL BY MATCHING 3 D SURFACES , 2002 .

[2]  Yuan-Fang Wang,et al.  CTSS: a robust and efficient method for protein structure alignment based on local geometrical and biological features , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[3]  Tsuhan Chen,et al.  Retrieval of 3D protein structures , 2002, Proceedings. International Conference on Image Processing.

[4]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[5]  P. Røgen,et al.  Automatic classification of protein structure by using Gauss integrals , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[6]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[7]  Hans-Peter Kriegel,et al.  Nearest Neighbor Classification in 3D Protein Databases , 1999, ISMB.

[8]  S D O'Hearn,et al.  MolCom: a method to compare protein molecules based on 3-D structural and chemical similarity. , 2003, Protein engineering.

[9]  Serge A. Hazout,et al.  Hybrid Protein Model (HPM): a method to compact protein 3D-structure information and physicochemical properties , 2000, Proceedings Seventh International Symposium on String Processing and Information Retrieval. SPIRE 2000.

[10]  Ambuj K. Singh,et al.  Towards index-based similarity search for protein structure databases , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[11]  Hans-Peter Kriegel,et al.  3D Shape Histograms for Similarity Search and Classification in Spatial Databases , 1999, SSD.

[12]  M. Milik,et al.  Common Structural Cliques: a tool for protein structure and function analysis. , 2003, Protein engineering.