Fast Computation of High-resolution Solvent Excluded Protein Surface with OpenMP

The solvent-excluded surface of proteins is extremely useful when studying their properties and interactions as it represents the portion of the outer protein contour that is available to interact with the solvent and other molecules. Given their simplicity and ability to represent geometrical and physico- chemical properties of proteins, voxelised surface representations have received a lot of interest in bioinformatics and computational biology applications such as protein-protein docking, interaction interface prediction and ligand-binding pocket prediction. Computing voxelised surfaces for large proteins can be challenging, as space-demanding data structures with associated high computational costs are required. In this paper we present a fast, OpenMP-based parallel algorithm for the computation of high-resolution voxelised solvent-excluded protein surfaces. The methodology is based on a region-growing implementation of the approximate Euclidean Distance Transform algorithm with Hierarchical Queues. The geometrical relationship between the solvent-accessible and solvent-excluded surfaces allows us to obtain the latter very efficiently by computing distance map values only for a small subset of the overall voxels representing the protein. The algorithm computes the contribution to the overall outer surface for each atom in parallel. The proposed methodology was experimentally compared to two previous MPI- based parallel implementations showing overall better speedup and efficiency metrics as well as lower surface computation times.

[1]  Chandrajit L. Bajaj,et al.  Quality meshing of implicit solvation models of biomolecular structures , 2006, Comput. Aided Geom. Des..

[2]  G. Klebe,et al.  A new method to detect related function among proteins independent of sequence and fold homology. , 2002, Journal of molecular biology.

[3]  Scott Grandison,et al.  The Application of 3D Zernike Moments for the Description of "Model-Free" Molecular Structure, Functional Motion, and Structural Reliability , 2009, J. Comput. Biol..

[4]  Bin Li,et al.  Characterization of local geometry of protein surfaces with the visibility criterion , 2008, Proteins.

[5]  F. Ferrè,et al.  Protein surface similarities: a survey of methods to describe and compare protein surfaces , 2000, Cellular and Molecular Life Sciences CMLS.

[6]  Carlo Ferrari,et al.  Parallel Computation of Voxelized Macromolecular Surfaces by Spatial Slicing , 2015, 2015 IEEE Trustcom/BigDataSE/ISPA.

[7]  David W Ritchie,et al.  Recent progress and future directions in protein-protein docking. , 2008, Current protein & peptide science.

[8]  M. Schmid Structural proteomics : the potential of high-throughput structure determination , 2002 .

[9]  M. L. Connolly Analytical molecular surface calculation , 1983 .

[10]  Daisuke Kihara,et al.  Molecular surface representation using 3D Zernike descriptors for protein shape comparison and docking. , 2011, Current protein & peptide science.

[11]  M. Spagnuolo,et al.  Between algorithm and model: different Molecular Surface definitions for the Poisson-Boltzmann based electrostatic characterization of biomolecules in solution. , 2013, Communications in computational physics.

[12]  Carlo Ferrari,et al.  Computing Discrete Fine-Grained Representations of Protein Surfaces , 2015, CIBB.

[13]  Fei He,et al.  A rapid 3D seed-filling algorithm based on scan slice , 2010, Comput. Graph..

[14]  Jean-Claude Spehner,et al.  Fast and robust computation of molecular surfaces , 1995, SCG '95.

[15]  D. Moras,et al.  Defining and characterizing protein surface using alpha shapes , 2009, Proteins.

[16]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[17]  Sheng-You Huang,et al.  Exploring the potential of global protein-protein docking: an overview and critical assessment of current programs for automatic ab initio docking. , 2015, Drug discovery today.

[18]  K. Kinoshita,et al.  Identification of the ligand binding sites on the molecular surface of proteins , 2005, Protein science : a publication of the Protein Society.

[19]  Daisuke Kihara,et al.  PL-PatchSurfer: A Novel Molecular Local Surface-Based Method for Exploring Protein-Ligand Interactions , 2014, International journal of molecular sciences.

[20]  Shuangye Yin,et al.  Fast screening of protein surfaces using geometric invariant fingerprints , 2009, Proceedings of the National Academy of Sciences.

[21]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[22]  B. Li,et al.  Rapid comparison of properties on protein surface , 2008, Proteins.

[23]  M. Rask-Andersen,et al.  Trends in the exploitation of novel drug targets , 2011, Nature Reviews Drug Discovery.

[24]  Olivier Cuisenaire,et al.  Region Growing Euclidean Distance Transforms , 1997, ICIAP.

[25]  Conrad C. Huang,et al.  Van der Waals Surfaces in Molecular Modeling: Implementation with Real-Time Computer Graphics , 1983, Science.

[26]  Donald A. Adjeroh,et al.  Protein Surface Characterization Using an Invariant Descriptor , 2011, Int. J. Biomed. Imaging.

[27]  Carlo Ferrari,et al.  Identifying Similar Surface Patches on Proteins Using a Spin-Image Surface Representation , 2005, CPM.

[28]  Jack Bresenham,et al.  Algorithm for computer control of a digital plotter , 1965, IBM Syst. J..

[29]  Carlo Ferrari,et al.  Computing voxelised representations of macromolecular surfaces , 2018, Int. J. High Perform. Comput. Appl..

[30]  M. Paoli,et al.  Crystal structure of T state haemoglobin with oxygen bound at all four haems. , 1996, Journal of molecular biology.

[31]  Carlo Ferrari,et al.  Exploring the potential of 3D Zernike descriptors and SVM for protein–protein interface prediction , 2018, BMC Bioinformatics.

[32]  Daisuke Kihara,et al.  Pairwise and multimeric protein-protein docking using the LZerD program suite. , 2014, Methods in molecular biology.

[33]  G. Schneider,et al.  PocketPicker: analysis of ligand binding-sites with shape descriptors , 2007, Chemistry Central Journal.

[34]  Alexander D. MacKerell,et al.  All-atom empirical potential for molecular modeling and dynamics studies of proteins. , 1998, The journal of physical chemistry. B.

[35]  Daisuke Kihara,et al.  Large-scale binding ligand prediction by improved patch-based method Patch-Surfer2.0 , 2015, Bioinform..

[36]  M. Schroeder,et al.  LIGSITEcsc: predicting ligand binding sites using the Connolly surface and degree of conservation , 2006, BMC Structural Biology.