Mechismo: predicting the mechanistic impact of mutations and modifications on molecular interactions

Systematic interrogation of mutation or protein modification data is important to identify sites with functional consequences and to deduce global consequences from large data sets. Mechismo (mechismo.russellab.org) enables simultaneous consideration of thousands of 3D structures and biomolecular interactions to predict rapidly mechanistic consequences for mutations and modifications. As useful functional information often only comes from homologous proteins, we benchmarked the accuracy of predictions as a function of protein/structure sequence similarity, which permits the use of relatively weak sequence similarities with an appropriate confidence measure. For protein–protein, protein–nucleic acid and a subset of protein–chemical interactions, we also developed and benchmarked a measure of whether modifications are likely to enhance or diminish the interactions, which can assist the detection of modifications with specific effects. Analysis of high-throughput sequencing data shows that the approach can identify interesting differences between cancers, and application to proteomics data finds potential mechanistic insights for how post-translational modifications can alter biomolecular interactions.

[1]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[2]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[3]  B. Rost,et al.  Analysing six types of protein-protein interfaces. , 2003, Journal of molecular biology.

[4]  M. Y. Kim,et al.  Acetylation of estrogen receptor alpha by p300 at lysines 266 and 268 enhances the deoxyribonucleic acid binding and transactivation activities of the receptor. , 2006, Molecular endocrinology.

[5]  M. Mann,et al.  Phosphoproteome Analysis of E. coli Reveals Evolutionary Conservation of Bacterial Ser/Thr/Tyr Phosphorylation*S , 2008, Molecular & Cellular Proteomics.

[6]  Shekhar C Mande,et al.  Exploiting 3D structural templates for detection of metal‐binding sites in protein structures , 2008, Proteins.

[7]  M. Swindells,et al.  Protein clefts in molecular recognition and function. , 1996, Protein science : a publication of the Protein Society.

[8]  P. Bork,et al.  Structure-Based Assembly of Protein Complexes in Yeast , 2004, Science.

[9]  Adam J. Smith,et al.  The Database of Interacting Proteins: 2004 update , 2004, Nucleic Acids Res..

[10]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[11]  C. Sander,et al.  Predicting the functional impact of protein mutations: application to cancer genomics , 2011, Nucleic acids research.

[12]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt): an expanding universe of protein information , 2005, Nucleic Acids Res..

[13]  Haiyuan Yu,et al.  Three-dimensional reconstruction of protein networks provides insight into human genetic disease , 2012, Nature Biotechnology.

[14]  Eunseog Youn,et al.  Connecting protein interaction data, mutations, and disease using bioinformatics. , 2009, Methods in molecular biology.

[15]  Ioannis Xenarios,et al.  DIP: The Database of Interacting Proteins: 2001 update , 2001, Nucleic Acids Res..

[16]  Tim J. P. Hubbard,et al.  Data growth and its impact on the SCOP database: new developments , 2007, Nucleic Acids Res..

[17]  Livia Perfetto,et al.  MINT, the molecular interaction database: 2009 update , 2009, Nucleic Acids Res..

[18]  Steven E. Brenner,et al.  SCOPe: Structural Classification of Proteins—extended, integrating SCOP and ASTRAL data and classification of new structures , 2013, Nucleic Acids Res..

[19]  A. Fischer,et al.  Novel mutations in the RFXANK gene: RFX complex containing in-vitro-generated RFXANK mutant binds the promoter without transactivating MHC II , 2003, Immunogenetics.

[20]  Torsten Schwede,et al.  The SWISS-MODEL Repository and associated resources , 2008, Nucleic Acids Res..

[21]  S. Henikoff,et al.  Predicting the effects of amino acid substitutions on protein function. , 2006, Annual review of genomics and human genetics.

[22]  Chris Morley,et al.  Open Babel: An open chemical toolbox , 2011, J. Cheminformatics.

[23]  Michael Schroeder,et al.  SCOPPI: a structural classification of protein–protein interfaces , 2005, Nucleic Acids Res..

[24]  David S. Goodsell,et al.  Promoting a structural view of biology for varied audiences: an overview of RCSB PDB resources and experiences , 2010, Journal of applied crystallography.

[25]  P. Bork,et al.  Proteome Organization in a Genome-Reduced Bacterium , 2009, Science.

[26]  E. Birney,et al.  Pfam: the protein families database , 2013, Nucleic Acids Res..

[27]  B. Honig,et al.  Structure-based prediction of protein-protein interactions on a genome-wide scale , 2012, Nature.

[28]  Lennart Martens,et al.  The Ontology Lookup Service: bigger and better , 2010, Nucleic Acids Res..

[29]  Lincoln D. Stein,et al.  Pancreatic cancer genomes reveal aberrations in axon guidance pathway genes , 2012, Nature.

[30]  M. Vidal,et al.  Edgetic perturbation models of human inherited disorders , 2009, Molecular systems biology.

[31]  M. Harding,et al.  The architecture of metal coordination groups in proteins. , 2004, Acta crystallographica. Section D, Biological crystallography.

[32]  Arnaud Céol,et al.  3did: identification and classification of domain-based interactions of known three-dimensional structure , 2010, Nucleic Acids Res..

[33]  Zsuzsanna Dosztányi,et al.  IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content , 2005, Bioinform..

[34]  R. Eils,et al.  Recurrent RHOA mutations in pediatric Burkitt lymphoma treated according to the NHL‐BFM protocols , 2014, Genes, chromosomes & cancer.

[35]  M. Sternberg,et al.  Protein–protein interaction sites are hot spots for disease‐associated nonsynonymous SNPs , 2012, Human mutation.

[36]  A. Gonzalez-Perez,et al.  Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel. , 2011, American journal of human genetics.

[37]  Gapped BLAST and PSI-BLAST: A new , 1997 .

[38]  P. Uetz,et al.  The binary protein-protein interaction landscape of Escherichia coli , 2014, Nature Biotechnology.

[39]  M. Sternberg,et al.  The effects of non-synonymous single nucleotide polymorphisms (nsSNPs) on protein-protein interactions. , 2013, Journal of molecular biology.

[40]  R. Spang,et al.  Recurrent mutation of the ID3 gene in Burkitt lymphoma identified by integrated genome, exome and transcriptome sequencing , 2012, Nature Genetics.

[41]  Y. Fukushima,et al.  Three novel DNMT3B mutations in Japanese patients with ICF syndrome. , 2002, American journal of medical genetics.

[42]  Jaroslav Bendl,et al.  PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations , 2014, PLoS Comput. Biol..

[43]  P. Aloy,et al.  Interactome3D: adding structural details to protein networks , 2013, Nature Methods.

[44]  J. Heath,et al.  Apert syndrome mutations in fibroblast growth factor receptor 2 exhibit increased affinity for FGF ligand. , 1998, Human molecular genetics.

[45]  R. Russell,et al.  The relationship between sequence and interaction divergence in proteins. , 2003, Journal of molecular biology.

[46]  L. Aaltonen,et al.  Diagnostic Cancer Genome Sequencing and the Contribution of Germline Variants , 2013, Science.

[47]  Rafael C. Jimenez,et al.  The IntAct molecular interaction database in 2012 , 2011, Nucleic Acids Res..

[48]  Daniel R. Zerbino,et al.  Ensembl 2014 , 2013, Nucleic Acids Res..

[49]  Gary D Bader,et al.  BIND--The Biomolecular Interaction Network Database. , 2001, Nucleic acids research.

[50]  Christie S. Chang,et al.  The BioGRID interaction database: 2013 update , 2012, Nucleic Acids Res..

[51]  I. Adzhubei,et al.  Predicting Functional Effect of Human Missense Mutations Using PolyPhen‐2 , 2013, Current protocols in human genetics.

[52]  Ozlem Keskin,et al.  Architectures and functional coverage of protein-protein interfaces. , 2008, Journal of molecular biology.

[53]  Patrick Aloy,et al.  Interrogating protein interaction networks through structural biology , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[54]  A. Shilatifard,et al.  Histone H3 lysine-to-methionine mutants as a paradigm to study chromatin signaling , 2014, Science.

[55]  M. Mann,et al.  Decoding signalling networks by mass spectrometry-based proteomics , 2010, Nature Reviews Molecular Cell Biology.

[56]  Julie C. Mitchell,et al.  Community‐wide evaluation of methods for predicting the effect of mutations on protein–protein interactions , 2013, Proteins.

[57]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[58]  Ben M. Webb,et al.  ModBase, a database of annotated comparative protein structure models and associated resources , 2013, Nucleic Acids Res..

[59]  Matthew J. Betts,et al.  Dissecting the genomic complexity underlying medulloblastoma , 2012, Nature.

[60]  Robert B. Russell,et al.  Combinations of Protein-Chemical Complex Structures Reveal New Targets for Established Drugs , 2011, PLoS Comput. Biol..

[61]  Dan M. Bolser,et al.  Visualisation and graph-theoretic analysis of a large-scale protein structural interactome , 2003, BMC Bioinformatics.

[62]  S. Henikoff,et al.  Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm , 2009, Nature Protocols.

[63]  G. Hong,et al.  Nucleic Acids Research , 2015, Nucleic Acids Research.

[64]  A. Barabasi,et al.  High-Quality Binary Protein Interaction Map of the Yeast Interactome Network , 2008, Science.