StructMAn: annotation of single-nucleotide polymorphisms in the structural context

The next generation sequencing technologies produce unprecedented amounts of data on the genetic sequence of individual organisms. These sequences carry a substantial amount of variation that may or may be not related to a phenotype. Phenotypically important part of this variation often comes in form of protein-sequence altering (non-synonymous) single nucleotide variants (nsSNVs). Here we present StructMAn, a Web-based tool for annotation of human and non-human nsSNVs in the structural context. StructMAn analyzes the spatial location of the amino acid residue corresponding to nsSNVs in the three-dimensional (3D) protein structure relative to other proteins, nucleic acids and low molecular-weight ligands. We make use of all experimentally available 3D structures of query proteins, and also, unlike other tools in the field, of structures of proteins with detectable sequence identity to them. This allows us to provide a structural context for around 20% of all nsSNVs in a typical human sequencing sample, for up to 60% of nsSNVs in genes related to human diseases and for around 35% of nsSNVs in a typical bacterial sample. Each nsSNV can be visualized and inspected by the user in the corresponding 3D structure of a protein or protein complex. The StructMAn server is available at http://structman.mpi-inf.mpg.de.

[1]  Sabine C. Mueller,et al.  BALL-SNP: combining genetic and structural information to identify candidate non-synonymous single nucleotide polymorphisms , 2015, Genome Medicine.

[2]  B. Rost,et al.  SNAP: predict effect of non-synonymous polymorphisms on function , 2007, Nucleic acids research.

[3]  P. Bork,et al.  Towards a structural basis of human non-synonymous single nucleotide polymorphisms. , 2000, Trends in genetics : TIG.

[4]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[5]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[6]  Predrag Radivojac,et al.  MutDB: update on development of tools for the biochemical analysis of genetic variation , 2007, Nucleic Acids Res..

[7]  Simon Kasif,et al.  topoSNP: a topographic database of non-synonymous single nucleotide polymorphisms with and without known disease association , 2004, Nucleic Acids Res..

[8]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[9]  O. Lichtarge,et al.  A formal perturbation equation between genotype and phenotype determines the Evolutionary Action of protein-coding variations on fitness , 2014, Genome research.

[10]  Valentin A. Ilyin,et al.  Structure SNP (StSNP): a web server for mapping and modeling nsSNPs on protein structures with linkage to metabolic pathways , 2007, Nucleic Acids Res..

[11]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[12]  R. Russell,et al.  The relationship between sequence and interaction divergence in proteins. , 2003, Journal of molecular biology.

[13]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[14]  María Martín,et al.  UniProt: A hub for protein information , 2015 .

[15]  Kenny Q. Ye,et al.  An integrated map of genetic variation from 1,092 human genomes , 2012, Nature.

[16]  David T. W. Jones,et al.  Mechismo: predicting the mechanistic impact of mutations and modifications on molecular interactions , 2014, Nucleic acids research.

[17]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[18]  Mark Diekhans,et al.  MuPIT interactive: webserver for mapping variant positions to annotated, interactive 3D structures , 2013, Human Genetics.

[19]  Shuxing Zhang,et al.  Computational prediction of protein hot spot residues. , 2012, Current pharmaceutical design.

[20]  P. Stenson,et al.  Human Gene Mutation Database (HGMD®): 2003 update , 2003, Human mutation.

[21]  Shamil R Sunyaev,et al.  Inferring causality and functional significance of human coding DNA variants. , 2012, Human molecular genetics.

[22]  Jofre Tenorio-Laranga,et al.  dSysMap: exploring the edgetic role of disease mutations , 2015, Nature Methods.

[23]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[24]  István A. Kovács,et al.  Widespread Macromolecular Interaction Perturbations in Human Genetic Disorders , 2015, Cell.

[25]  Henning Hermjakob,et al.  The Reactome pathway Knowledgebase , 2015, Nucleic acids research.

[26]  Ricardo Villamarín-Salomón,et al.  ClinVar: public archive of interpretations of clinically relevant variants , 2015, Nucleic Acids Res..

[27]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[28]  R. Altman,et al.  Collective judgment predicts disease-associated single nucleotide variants , 2013, BMC Genomics.

[29]  R. Altman,et al.  WS-SNPs&GO: a web server for predicting the deleterious effect of human protein variants using functional annotation , 2013, BMC Genomics.

[30]  Subha Madhavan,et al.  SNP2Structure: A Public and Versatile Resource for Mapping and Three-Dimensional Modeling of Missense SNPs on Human Protein Structures , 2015, Computational and structural biotechnology journal.

[31]  J. Moult,et al.  Structural and functional impact of cancer-related missense somatic mutations. , 2011, Journal of molecular biology.

[32]  François Schiettecatte,et al.  OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders , 2014, Nucleic Acids Res..

[33]  Matthew Meyerson,et al.  Structures of lung cancer-derived EGFR mutants and inhibitor complexes: mechanism of activation and insights into differential inhibitor sensitivity. , 2007, Cancer cell.

[34]  Mingming Jia,et al.  COSMIC: exploring the world's knowledge of somatic mutations in human cancer , 2014, Nucleic Acids Res..

[35]  Wei Xu,et al.  The design, synthesis, and biological evaluation of potent receptor tyrosine kinase inhibitors. , 2012, Bioorganic & medicinal chemistry letters.

[36]  Andrew C. R. Martin,et al.  Human Mutation , 2020 .

[37]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[38]  Yun Liu,et al.  LS-SNP/PDB: annotated non-synonymous SNPs mapped to Protein Data Bank structures , 2009, Bioinform..

[39]  François Stricher,et al.  The FoldX web server: an online force field , 2005, Nucleic Acids Res..

[40]  Peng Yue,et al.  SNPs3D: Candidate gene and SNP selection for association studies , 2006, BMC Bioinformatics.

[41]  T. Nonaka,et al.  JAK3 inhibitor VI is a mutant specific inhibitor for epidermal growth factor receptor with the gatekeeper mutation T790M. , 2015, World journal of biological chemistry.

[42]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[43]  E. Boerwinkle,et al.  dbNSFP v3.0: A One‐Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice‐Site SNVs , 2016, Human mutation.