An Innovative Protocol for Comparing Protein Binding Sites via Atomic Grid Maps

This paper deals with a novel computational approach that aims to measure the similarities of protein binding sites through comparison of atomic grid maps. The assessment of structural similarity between proteins is a longstanding goal in biology and in structure-based drug design. Instead of focusing on standard structural alignment techniques, mostly based on superposition of common structural elements, the proposed approach starts from a physicochemical description of the proteins’ binding site. We call these atomic grid maps. These maps are preprocessed to reduce the dimensionality of the data while retaining the relevant information. Then, we devise an alignment-based similarity measure, based on a rigid registration algorithm (the Iterative Closest Point –ICP). The proposed approach, tested on a real dataset involving 22 proteins, has shown encouraging results in comparison with standard procedures.

[1]  Ofer Melnik,et al.  Mixed group ranks: preference and confidence in classifier combination , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  David S. Goodsell,et al.  AutoDock4 and AutoDockTools4: Automated docking with selective receptor flexibility , 2009, J. Comput. Chem..

[3]  Sargur N. Srihari,et al.  Decision Combination in Multiple Classifier Systems , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Josef Kittler,et al.  Combining multiple classifiers by averaging or by multiplying? , 2000, Pattern Recognit..

[5]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Emanuele Trucco,et al.  Introductory techniques for 3-D computer vision , 1998 .

[7]  Richard Morphy,et al.  The physicochemical challenges of designing multiple ligands. , 2006, Journal of medicinal chemistry.

[8]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  N. Mantel The detection of disease clustering and a generalized regression approach. , 1967, Cancer research.

[10]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[11]  Arun Ross,et al.  Multimodal biometrics: An overview , 2004, 2004 12th European Signal Processing Conference.

[12]  J. Jung,et al.  Protein structure alignment using environmental profiles. , 2000, Protein engineering.

[13]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[14]  R. Abagyan,et al.  Systematic Exploitation of Multiple Receptor Conformations for Virtual Ligand Screening , 2011, PloS one.

[15]  Robert B. Fisher,et al.  A Comparison of Four Algorithms for Estimating 3-D Rigid Transformations , 1995, BMVC.

[16]  Angelo D. Favia,et al.  Theoretical and computational approaches to ligand-based drug discovery. , 2011, Frontiers in bioscience.

[17]  Gérard G. Medioni,et al.  Object modeling by registration of multiple range images , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[18]  Takeshi Kawabata,et al.  MATRAS: a program for protein 3D structure comparison , 2003, Nucleic Acids Res..

[19]  P E Bourne,et al.  Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. , 1998, Protein engineering.

[20]  Janet M. Thornton,et al.  Methods to characterize the structure of enzyme binding sites , 2008 .

[21]  Maxim Totrov,et al.  Ligand binding site superposition and comparison based on Atomic Property Fields: identification of distant homologues, convergent evolution and PDB-wide clustering of binding sites , 2011, BMC Bioinformatics.

[22]  Anil K. Jain,et al.  Clustering ensembles: models of consensus and weak partitions , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Robert P. W. Duin,et al.  Experiments with Classifier Combining Rules , 2000, Multiple Classifier Systems.

[24]  A. D. McLachlan,et al.  Rapid comparison of protein structures , 1982 .

[25]  A. McNaught,et al.  Compendium of chemical terminology. IUPAC recommendations , 1997 .

[26]  Ana L. N. Fred,et al.  Combining multiple clusterings using evidence accumulation , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27]  C. Sander,et al.  Protein structure comparison by alignment of distance matrices. , 1993, Journal of molecular biology.

[28]  Yu Chen,et al.  A novel approach to structural alignment using realistic structural and environmental information , 2005, Protein science : a publication of the Protein Society.

[29]  Jesús Avila,et al.  The role of GSK3 in Alzheimer disease , 2009, Brain Research Bulletin.

[30]  Helen C. Shen,et al.  Personal Verification Using Palmprint and Hand Geometry Biometric , 2003, AVBPA.

[31]  Fabio Roli,et al.  A theoretical and experimental analysis of linear combiners for multiple classifier systems , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  Manuele Bicego,et al.  Multimodal Phylogeny for Taxonomy: Integrating Information from nucleotide and amino acid Sequences , 2007, J. Bioinform. Comput. Biol..