Structuprint: a scalable and extensible tool for two-dimensional representation of protein surfaces

BackgroundThe term ‘molecular cartography’ encompasses a family of computational methods for two-dimensional transformation of protein structures and analysis of their physicochemical properties. The underlying algorithms comprise multiple manual steps, whereas the few existing implementations typically restrict the user to a very limited set of molecular descriptors.ResultsWe present Structuprint, a free standalone software that fully automates the rendering of protein surface maps, given - at the very least - a directory with a PDB file and an amino acid property. The tool comes with a default database of 328 descriptors, which can be extended or substituted by user-provided ones. The core algorithm comprises the generation of a mould of the protein surface, which is subsequently converted to a sphere and mapped to two dimensions, using the Miller cylindrical projection. Structuprint is partly optimized for multicore computers, making the rendering of animations of entire molecular dynamics simulations feasible.ConclusionsStructuprint is an efficient application, implementing a molecular cartography algorithm for protein surfaces. According to the results of a benchmark, its memory requirements and execution time are reasonable, allowing it to run even on low-end personal computers. We believe that it will be of use - primarily but not exclusively - to structural biologists and computational biochemists.

[1]  G. Rose,et al.  Molecular cartography of globular proteins with application to antigenic sites , 1986, Biopolymers.

[2]  A. Balaban Highly discriminating distance-based topological index , 1982 .

[3]  Tudor I. Oprea,et al.  Property distribution of drug-related chemical databases* , 2000, J. Comput. Aided Mol. Des..

[4]  T. Kohzuma,et al.  Novel Insight into the Copper-Ligand Geometry in the Crystal Structure of Ulva pertusa Plastocyanin at 1.6-Å Resolution , 1999, The Journal of Biological Chemistry.

[5]  P. Jurs,et al.  Development and use of charged partial surface area structural descriptors in computer-assisted quantitative structure-property relationship studies , 1990 .

[6]  Molecular cartography of proteins: surface relief analysis of the calf eye lens protein gamma-crystallin. , 1989, Protein engineering.

[7]  J. Kazius,et al.  Derivation and validation of toxicophores for mutagenicity prediction. , 2005, Journal of medicinal chemistry.

[8]  Michel Petitjean,et al.  Applications of the radius-diameter diagram to the classification of topological and geometrical shapes of chemical compounds , 1992, J. Chem. Inf. Comput. Sci..

[9]  K. M. Smith,et al.  Novel software tools for chemical diversity , 1998 .

[10]  V. Pande,et al.  Heterogeneity even at the speed limit of folding: large-scale molecular dynamics study of a fast-folding variant of the villin headpiece. , 2007, Journal of molecular biology.

[11]  Gordon M. Crippen,et al.  Prediction of Physicochemical Parameters by Atomic Contributions , 1999, J. Chem. Inf. Comput. Sci..

[12]  Janusz M Bujnicki,et al.  SURF’s UP! — Protein classification by surface comparisons , 2007, Journal of Biosciences.

[13]  Guillaume Levieux,et al.  Towards real-time interactive visualization modes of molecular surfaces: examples with udock , 2015, 2015 IEEE 1st International Workshop on Virtual and Augmented Reality for Molecular Science (VARMS@IEEEVR).

[14]  C. Orengo,et al.  Protein families and their evolution-a structural perspective. , 2005, Annual review of biochemistry.

[15]  H. Wiener Structural determination of paraffin boiling points. , 1947, Journal of the American Chemical Society.

[16]  M Kokkinidis,et al.  Protein plasticity to the extreme: changing the topology of a 4-alpha-helical bundle with a single amino acid substitution. , 1999, Structure.

[17]  P. Selzer,et al.  Fast calculation of molecular polar surface area as a sum of fragment-based contributions and its application to the prediction of drug transport properties. , 2000, Journal of medicinal chemistry.

[18]  John P. Snyder,et al.  Map Projections: A Working Manual , 2012 .

[19]  R. C. Weast CRC Handbook of Chemistry and Physics , 1973 .

[20]  Conrad C. Huang,et al.  UCSF Chimera—A visualization system for exploratory research and analysis , 2004, J. Comput. Chem..

[21]  Roman G. Efremov,et al.  Deciphering Fine Molecular Details of Proteins' Structure and Function with a Protein Surface Topography (PST) Method , 2014, J. Chem. Inf. Model..

[22]  Hadley Wickham,et al.  ggplot2 - Elegant Graphics for Data Analysis (2nd Edition) , 2017 .

[23]  O. M. Miller Notes on Cylindrical World Map Projections , 1942 .

[24]  "Iso-depth contour map" of a molecular surface. , 1994, Journal of molecular graphics.

[25]  Alexandru T. Balaban,et al.  Chemical graphs , 1979 .

[26]  L. Hall,et al.  The Molecular Connectivity Chi Indexes and Kappa Shape Indexes in Structure‐Property Modeling , 2007 .

[27]  F. Lombardo,et al.  Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings , 1997 .

[28]  Sophia Kossida,et al.  Space Constrained Homology Modelling: The Paradigm of the RNA-Dependent RNA Polymerase of Dengue (Type II) Virus , 2013, Comput. Math. Methods Medicine.

[29]  Tingjun Hou,et al.  ADME evaluation in drug discovery , 2002, Journal of molecular modeling.

[30]  Yanli Wang,et al.  Structure-Based Virtual Screening for Drug Discovery: a Problem-Centric Review , 2012, The AAPS Journal.

[31]  Tingjun Hou,et al.  ADME Evaluation in Drug Discovery. 4. Prediction of Aqueous Solubility Based on Atom Contribution Approach , 2004, J. Chem. Inf. Model..

[32]  D. Cox,et al.  An Analysis of Transformations , 1964 .

[33]  A. Sacan,et al.  Protein surface representation and analysis by dimension reduction , 2012, Proteome Science.

[34]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[35]  A Godzik,et al.  Surface map comparison: studying function diversity of homologous proteins. , 2001, Journal of molecular biology.

[36]  Chuong B. Do,et al.  ProbCons: Probabilistic consistency-based multiple sequence alignment. , 2005, Genome research.