Shape modeling and matching in identifying 3D protein structures

In this paper, we describe a novel geometric approach in the process of recovering 3D protein structures from scalar volumes. The input to our method is a sequence of @a-helices that make up a protein, and a low-resolution protein density volume where possible locations of @a-helices have been detected. Our task is to identify the correspondence between the two sets of helices, which will shed light on how the protein folds in space. The central theme of our approach is to cast the correspondence problem as that of shape matching between the 3D volume and the 1D sequence. We model both shapes as attributed relational graphs, and formulate a constrained inexact graph matching problem. To compute the matching, we developed an optimal algorithm based on the A*-search with several choices of heuristic functions. As demonstrated in a suite of synthetic and authentic inputs, the shape-modeling approach is capable of identifying helix correspondences in noise-abundant volumes at high accuracy with minimal or no user intervention.

[1]  Andrew K. C. Wong,et al.  An algorithm for graph optimal monomorphism , 1990, IEEE Trans. Syst. Man Cybern..

[2]  Ali Shokoufandeh,et al.  Retrieving Articulated 3-D Models Using Medial Surfaces and Their Graph Spectra , 2005, EMMCVPR.

[3]  Daniel Cohen-Or,et al.  Salient geometric features for partial shape matching and similarity , 2006, TOGS.

[4]  Jianpeng Ma,et al.  Determining protein topology from skeletons of secondary structures. , 2005, Journal of molecular biology.

[5]  Joan Serrat,et al.  Multilocal Creaseness Based on the Level-Set Extrinsic Curvature , 2000, Comput. Vis. Image Underst..

[6]  Mario Vento,et al.  Thirty Years Of Graph Matching In Pattern Recognition , 2004, Int. J. Pattern Recognit. Artif. Intell..

[7]  Radu Horaud,et al.  Symbolic image matching by simulated annealing , 1990, BMVC.

[8]  Yang Zhang,et al.  The protein structure prediction problem could be solved using the current PDB library. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Attila Kuba,et al.  A Parallel 3D 12-Subiteration Thinning Algorithm , 1999, Graph. Model. Image Process..

[10]  A. Sali 100,000 protein structures for the biologist , 1998, Nature Structural Biology.

[11]  M. Dhome,et al.  Inexact matching using neural networks , 1994 .

[12]  M J Sippl,et al.  Who solved the protein folding problem? , 1999, Structure.

[13]  Bert Jüttler,et al.  Geometric Modeling and Processing , 2009, Comput. Aided Geom. Des..

[14]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[15]  Kuo-Chin Fan,et al.  Genetic-based search for error-correcting graph isomorphism , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[16]  Zeyun Yu,et al.  A Structure Tensor Approach for 3D Image Skeletonization: Applications in Protein Secondary Structure Analysis , 2006, 2006 International Conference on Image Processing.

[17]  Horst Bunke,et al.  A New Algorithm for Error-Tolerant Subgraph Isomorphism Detection , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  M. Baker,et al.  Electron cryomicroscopy of biological machines at subnanometer resolution. , 2005, Structure.

[19]  Horst Bunke,et al.  Error Correcting Graph Matching: On the Influence of the Underlying Cost Function , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  J. Frank,et al.  The ribosome at improved resolution: new techniques for merging and orientation refinement in 3D cryo-electron microscopy of biological particles. , 1994, Ultramicroscopy.

[21]  Gabriella Sanniti di Baja,et al.  Computing skeletons in three dimensions , 1999, Pattern Recognit..

[22]  Horst Bunke,et al.  Recent Advances in Graph Matching , 1997, Int. J. Pattern Recognit. Artif. Intell..

[23]  Helen M Berman,et al.  Large macromolecular complexes in the Protein Data Bank: a status report. , 2005, Structure.

[24]  King-Sun Fu,et al.  A distance measure between attributed relational graphs for pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[25]  A. Sali,et al.  Statistical potential for assessment and prediction of protein structures , 2006, Protein science : a publication of the Protein Society.

[26]  Azriel Rosenfeld,et al.  A Min-Max Medial Axis Transformation , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  Giovanni Soda,et al.  Exploiting the past and the future in protein secondary structure prediction , 1999, Bioinform..

[28]  Ming Ouhyoung,et al.  On Visual Similarity Based 3D Model Retrieval , 2003, Comput. Graph. Forum.

[29]  Thomas A. Funkhouser,et al.  Partial matching of 3D shapes with priority-driven search , 2006, SGP '06.

[30]  David Baker,et al.  Protein Structure Prediction Using Rosetta , 2004, Numerical Computer Methods, Part D.

[31]  Gilles Bertrand,et al.  A parallel thinning algorithm for medial surfaces , 1995, Pattern Recognit. Lett..

[32]  William J. Christmas,et al.  Structural Matching in Computer Vision Using Probabilistic Relaxation , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Robert M. Haralick,et al.  Structural Descriptions and Inexact Matching , 1981, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  King-Sun Fu,et al.  Error-Correcting Isomorphisms of Attributed Relational Graphs for Pattern Analysis , 1979, IEEE Transactions on Systems, Man, and Cybernetics.

[35]  M. Baker,et al.  Bridging the information gap: computational tools for intermediate resolution structure interpretation. , 2001, Journal of molecular biology.

[36]  Gilles Bertrand,et al.  Topological operators for grayscale image processing , 2001, J. Electronic Imaging.

[37]  Č. Venclovas,et al.  Comparative modeling in CASP6 using consensus approach to template selection, sequence‐structure alignment, and structure assessment , 2005, Proteins.

[38]  John P. Overington,et al.  Derivation of rules for comparative protein modeling from a database of protein structure alignments , 1994, Protein science : a publication of the Protein Society.

[39]  Manfred J. Sippl,et al.  Boltzmann's principle, knowledge-based mean fields and protein folding. An approach to the computational determination of protein structures , 1993, J. Comput. Aided Mol. Des..

[40]  John Moult,et al.  A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction. , 2005, Current opinion in structural biology.

[41]  Gabriella Sanniti di Baja,et al.  Curve skeletonization of surface-like objects in 3D images guided by voxel classification , 2002, Pattern Recognit. Lett..

[42]  M. Baker,et al.  Identification of secondary structure elements in intermediate-resolution density maps. , 2007, Structure.

[43]  P. Foggia,et al.  Performance evaluation of the VF graph matching algorithm , 1999, Proceedings 10th International Conference on Image Analysis and Processing.

[44]  Thomas A. Funkhouser,et al.  The Princeton Shape Benchmark , 2004, Proceedings Shape Modeling Applications, 2004..

[45]  Julian R. Ullmann,et al.  An Algorithm for Subgraph Isomorphism , 1976, J. ACM.

[46]  Nils J. Nilsson,et al.  Principles of Artificial Intelligence , 1980, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Matthew L. Baker,et al.  Computing a Family of Skeletons of Volumetric Models for Shape Description , 2006, GMP.

[48]  Michael Garland,et al.  Curvature maps for local shape comparison , 2005, International Conference on Shape Modeling and Applications 2005 (SMI' 05).

[49]  Horst Bunke,et al.  Inexact graph matching for structural pattern recognition , 1983, Pattern Recognit. Lett..

[50]  Sven J. Dickinson,et al.  Skeleton based shape matching and retrieval , 2003, 2003 Shape Modeling International..

[51]  Radu Horaud,et al.  Stereo Correspondence Through Feature Grouping and Maximal Cliques , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[52]  Henrik I. Christensen,et al.  Pattern Recognition in Practice IV: Multiple Paradigms, Comparative Studies and Hybrid Systems , 1994 .