GenProBiS: web server for mapping of sequence variants to protein binding sites

Abstract Discovery of potentially deleterious sequence variants is important and has wide implications for research and generation of new hypotheses in human and veterinary medicine, and drug discovery. The GenProBiS web server maps sequence variants to protein structures from the Protein Data Bank (PDB), and further to protein–protein, protein–nucleic acid, protein–compound, and protein–metal ion binding sites. The concept of a protein–compound binding site is understood in the broadest sense, which includes glycosylation and other post-translational modification sites. Binding sites were defined by local structural comparisons of whole protein structures using the Protein Binding Sites (ProBiS) algorithm and transposition of ligands from the similar binding sites found to the query protein using the ProBiS-ligands approach with new improvements introduced in GenProBiS. Binding site surfaces were generated as three-dimensional grids encompassing the space occupied by predicted ligands. The server allows intuitive visual exploration of comprehensively mapped variants, such as human somatic mis-sense mutations related to cancer and non-synonymous single nucleotide polymorphisms from 21 species, within the predicted binding sites regions for about 80 000 PDB protein structures using fast WebGL graphics. The GenProBiS web server is open and free to all users at http://genprobis.insilab.org.

[1]  Akira R. Kinjo,et al.  Molmil: a molecular viewer for the PDB and beyond , 2016, Journal of Cheminformatics.

[2]  Sabine C. Mueller,et al.  New insights into the genetics of glioblastoma multiforme by familial exome sequencing , 2014, Oncotarget.

[3]  Remo Rohs,et al.  Structure of p53 binding to the BAX response element reveals DNA unwinding and compression to accommodate base-pair insertion , 2013, Nucleic acids research.

[4]  E. Lander,et al.  Comprehensive assessment of cancer missense mutation clustering in protein structures , 2015, Proceedings of the National Academy of Sciences.

[5]  Andreas Keller,et al.  StructMAn: annotation of single-nucleotide polymorphisms in the structural context , 2016, Nucleic Acids Res..

[6]  Chi-Ren Shyu,et al.  Determining Effects of Non-synonymous SNPs on Protein-Protein Interactions using Supervised and Semi-supervised Learning , 2014, PLoS Comput. Biol..

[7]  Franca Fraternali,et al.  PinSnps: structural and functional analysis of SNPs in the context of protein interaction networks , 2016, Bioinform..

[8]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[9]  S. Elliott,et al.  Glycoengineering: the effect of glycosylation on the properties of therapeutic proteins. , 2005, Journal of pharmaceutical sciences.

[10]  Nitzan Kol,et al.  G23D: Online tool for mapping and visualization of genomic variants on 3D protein structures , 2016, BMC Genomics.

[11]  Mingming Jia,et al.  COSMIC: somatic cancer genetics at high-resolution , 2016, Nucleic Acids Res..

[12]  Johan T den Dunnen,et al.  Describing Sequence Variants Using HGVS Nomenclature. , 2017, Methods in molecular biology.

[13]  Tetsuya Kohno,et al.  Crystal Structures and Structure-Activity Relationships of Imidazothiazole Derivatives as IDO1 Inhibitors. , 2014, ACS medicinal chemistry letters.

[14]  Michael Schroeder,et al.  Discovery of Mycobacterium tuberculosis InhA Inhibitors by Binding Sites Comparison and Ligands Prediction. , 2016, Journal of medicinal chemistry.

[15]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[16]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[17]  Subha Madhavan,et al.  SNP2Structure: A Public and Versatile Resource for Mapping and Three-Dimensional Modeling of Missense SNPs on Human Protein Structures , 2015, Computational and structural biotechnology journal.

[18]  Dusanka Janezic,et al.  ProBiS algorithm for detection of structurally similar protein binding sites by local structural alignment , 2010, Bioinform..

[19]  Mark Diekhans,et al.  MuPIT interactive: webserver for mapping variant positions to annotated, interactive 3D structures , 2013, Human Genetics.

[20]  Maria Jesus Martin,et al.  SIFTS: Structure Integration with Function, Taxonomy and Sequences resource , 2012, Nucleic Acids Res..

[21]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[22]  Janez Konc,et al.  An improved branch and bound algorithm for the maximum clique problem , 2007 .

[23]  Jing Hu,et al.  SIFT web server: predicting effects of amino acid substitutions on proteins , 2012, Nucleic Acids Res..

[24]  Yun Liu,et al.  LS-SNP/PDB: annotated non-synonymous SNPs mapped to Protein Data Bank structures , 2009, Bioinform..

[25]  Ivan Ovcharenko,et al.  ECR Browser: a tool for visualizing and accessing data from comparisons of multiple vertebrate genomes , 2004, Nucleic Acids Res..

[26]  P. Ridker,et al.  Novel Association of ABO Histo-Blood Group Antigen with Soluble ICAM-1: Results of a Genome-Wide Association Study of 6,578 Women , 2008, PLoS genetics.

[27]  M. Sternberg,et al.  Protein–protein interaction sites are hot spots for disease‐associated nonsynonymous SNPs , 2012, Human mutation.

[28]  Hans-Peter Kriegel,et al.  OPTICS: ordering points to identify the clustering structure , 1999, SIGMOD '99.

[29]  Dusanka Janezic,et al.  ProBiS-ligands: a web server for prediction of ligands by examination of protein binding sites , 2014, Nucleic Acids Res..

[30]  Dan M. Bolser,et al.  Ensembl Genomes 2016: more genomes, more complexity , 2015, Nucleic Acids Res..

[31]  David S. Goodsell,et al.  The RCSB protein data bank: integrative view of protein, gene and 3D structural information , 2016, Nucleic Acids Res..

[32]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[33]  Russ B. Altman,et al.  PharmGKB: the Pharmacogenetics Knowledge Base , 2002, Nucleic Acids Res..

[34]  K. B. Ward,et al.  Crystal structure of sickle-cell deoxyhemoglobin at 5 A resolution. , 1975, Journal of molecular biology.