Exploring Protein Architecture using 3D Shape-based Signatures

Consider the scenario where, for a prescription drug designed to treat a terminal illness, a particular protein has been successfully identified as a crucial, beneficial component in the drug compound. However, this protein has contra-indications and causes severe adverse effects in a certain subset of the population. If another protein from the same family, with similar structure and functionality, but without these adverse effects, can be found, the subsequent modification of the harmful drug has obvious benefits. This paper describes a new indexing and similarity search system to retrieve such protein structure family members, based on their 3D shape. Our approach is translation, scale and rotation invariant, which eliminates the need for prior structure alignment. Our experimental evaluation against seven (7) diverse protein families indicate that our system accurately and precisely locate all members of a family. We further illustrate this by showing that our system precisely retrieves the Homo Sapiens Hemoglobin family members, against a database containing 26,000 protein structures.

[1]  T. Ohkawa,et al.  A method of comparing protein structures based on matrix representation of secondary structure pairwise topology , 1999, Proceedings 1999 International Conference on Information Intelligence and Systems (Cat. No.PR00446).

[2]  Tim J. P. Hubbard,et al.  SCOP: a Structural Classification of Proteins database , 1999, Nucleic Acids Res..

[3]  Chi-Ren Shyu,et al.  A Fast Protein Structure Retrieval System Using Image-Based Distance Matrices and Multidimensional Index , 2004, BIBE.

[4]  Bing-Yu Chen,et al.  A web-based three-dimensional protein retrieval system by matching visual similarity , 2005, Bioinform..

[5]  Saiful Akbar,et al.  Exploiting Geometrical Properties on Protein Similarity Search , 2006, 17th International Workshop on Database and Expert Systems Applications (DEXA'06).

[6]  Sung Hee Park,et al.  A Protein Structure Retrieval System Using 3D Edge Histogram , 2004 .

[7]  Kenji Inoue,et al.  Logical cluster construction in a grid environment for similar protein retrieval , 2004, 2004 International Symposium on Applications and the Internet Workshops. 2004 Workshops..

[8]  Ming Ouhyoung,et al.  A web-based protein retrieval system by matching visual similarity , 2005, Conference, Emerging Information Technology 2005..