Structural dynamics is a determinant of the functional significance of missense variants

Significance Discrimination of clinically relevant mutations from neutral mutations is of paramount importance in precision medicine and pharmacogenomics. Our study shows that current computational predictions of pathogenicity, mostly based on analysis of sequence conservation, may be improved by considering the changes in the structural dynamics of the protein due to point mutations. We introduce and demonstrate the utility of a classifier that takes advantage of efficient evaluation of structural dynamics by elastic network models. Accurate evaluation of the effect of point mutations on protein function is essential to assessing the genesis and prognosis of many inherited diseases and cancer types. Currently, a wealth of computational tools has been developed for pathogenicity prediction. Two major types of data are used to this aim: sequence conservation/evolution and structural properties. Here, we demonstrate in a systematic way that another determinant of the functional impact of missense variants is the protein’s structural dynamics. Measurable improvement is shown in pathogenicity prediction by taking into consideration the dynamical context and implications of the mutation. Our study suggests that the class of dynamics descriptors introduced here may be used in conjunction with existing features to not only increase the prediction accuracy of the impact of variants on biological function, but also gain insight into the physical basis of the effect of missense variants.

[1]  Ying Liu,et al.  ATPase Subdomain IA Is a Mediator of Interdomain Allostery in Hsp70 Molecular Chaperones , 2014, PLoS Comput. Biol..

[2]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[3]  Keith A. Boroevich,et al.  Systematic analysis of mutation distribution in three dimensional protein structures identifies cancer driver genes , 2016, Scientific Reports.

[4]  Jaroslav Bendl,et al.  PredictSNP: Robust and Accurate Consensus Classifier for Prediction of Disease-Related Mutations , 2014, PLoS Comput. Biol..

[5]  J. Shendure,et al.  Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data , 2011, Nature Reviews Genetics.

[6]  Ivet Bahar,et al.  ProDy: Protein Dynamics Inferred from Theory and Experiments , 2011, Bioinform..

[7]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[8]  T. Andrews,et al.  Comparison of predicted and actual consequences of missense mutations , 2015, Proceedings of the National Academy of Sciences.

[9]  Ivet Bahar,et al.  iGNM 2.0: the Gaussian network model database for biomolecular structural dynamics , 2015, Nucleic Acids Res..

[10]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[11]  Ivet Bahar,et al.  Adaptability of protein structures to enable functional interactions and evolutionary implications. , 2015, Current opinion in structural biology.

[12]  Ivet Bahar,et al.  Toward a molecular understanding of the anisotropic response of proteins to external forces: insights from elastic network models. , 2008, Biophysical journal.

[13]  Y. Bromberg,et al.  Computational predictors fail to identify amino acid substitution effects at rheostat positions , 2017, Scientific Reports.

[14]  Luca Ponzoni,et al.  Unifying view of mechanical and functional hotspots across class A GPCRs , 2017, PLoS Comput. Biol..

[15]  M. Wall,et al.  Allostery in a coarse-grained model of protein dynamics. , 2005, Physical review letters.

[16]  Gert Vriend,et al.  A series of PDB related databases for everyday needs , 2010, Nucleic Acids Res..

[17]  Nitzan Kol,et al.  G23D: Online tool for mapping and visualization of genomic variants on 3D protein structures , 2016, BMC Genomics.

[18]  I. Adzhubei,et al.  Predicting Functional Effect of Human Missense Mutations Using PolyPhen‐2 , 2013, Current protocols in human genetics.

[19]  S. Sunyaev,et al.  PSIC: profile extraction from sequence alignments with position-specific counts of independent observations. , 1999, Protein engineering.

[20]  Ivet Bahar,et al.  Mechanisms of CFTR Functional Variants That Impair Regulated Bicarbonate Permeation and Increase Risk for Pancreatitis but Not for Cystic Fibrosis , 2014, PLoS genetics.

[21]  Tom R. Gaunt,et al.  Predicting the Functional, Molecular, and Phenotypic Consequences of Amino Acid Substitutions using Hidden Markov Models , 2012, Human mutation.

[22]  D. Covell,et al.  Correlation between native-state hydrogen exchange and cooperative residue fluctuations from a simple model. , 1998, Biochemistry.

[23]  Jana Marie Schwarz,et al.  MutationTaster2: mutation prediction for the deep-sequencing age , 2014, Nature Methods.

[24]  Karsten M. Borgwardt,et al.  The Evaluation of Tools Used to Predict the Impact of Missense Variants Is Hindered by Two Types of Circularity , 2015, Human mutation.

[25]  C. Sander,et al.  Predicting the functional impact of protein mutations: application to cancer genomics , 2011, Nucleic acids research.

[26]  Luca Ponzoni,et al.  SPECTRUS: A Dimensionality Reduction Approach for Identifying Dynamical Domains in Protein Complexes from Limited Structural Datasets. , 2015, Structure.

[27]  A. Gonzalez-Perez,et al.  Improving the assessment of the outcome of nonsynonymous SNVs with a consensus deleteriousness score, Condel. , 2011, American journal of human genetics.

[28]  Thierry Soussi,et al.  Investigation and prediction of the severity of p53 mutants using parameters from structural calculations , 2009, The FEBS journal.

[29]  Anaïs Mottaz,et al.  Bioinformatics Applications Note Databases and Ontologies Easy Retrieval of Single Amino-acid Polymorphisms and Phenotype Information Using Swissvar , 2022 .

[30]  Mauno Vihinen,et al.  VariBench: A Benchmark Database for Variations , 2013, Human mutation.

[31]  Ali Rana Atilgan,et al.  Perturbation-Response Scanning Reveals Ligand Entry-Exit Mechanisms of Ferric Binding Protein , 2009, PLoS Comput. Biol..

[32]  I. Bahar,et al.  Global dynamics of proteins: bridging between structure and function. , 2010, Annual review of biophysics.

[33]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[34]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[35]  I. Bahar,et al.  The intrinsic dynamics of enzymes plays a dominant role in determining the structural changes induced upon inhibitor binding , 2009, Proceedings of the National Academy of Sciences.

[36]  Johnny S. H. Kwan,et al.  Predicting Mendelian Disease-Causing Non-Synonymous Single Nucleotide Variants in Exome Sequencing Studies , 2013, PLoS genetics.

[37]  R. Brüschweiler Collective protein dynamics and nuclear spin relaxation , 1995 .

[38]  E. Lander,et al.  Comprehensive assessment of cancer missense mutation clustering in protein structures , 2015, Proceedings of the National Academy of Sciences.

[39]  W. Youden,et al.  Index for rating diagnostic tests , 1950, Cancer.

[40]  I. Bahar,et al.  Structural changes involved in protein binding correlate with intrinsic motions of proteins in the unbound state. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Tjerk P. Straatsma,et al.  NWChem: A comprehensive and scalable open-source solution for large scale molecular simulations , 2010, Comput. Phys. Commun..

[42]  Serafim Batzoglou,et al.  Identifying a High Fraction of the Human Genome to be under Selective Constraint Using GERP++ , 2010, PLoS Comput. Biol..

[43]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[44]  Justin C. Fay,et al.  Identification of deleterious mutations within three human genomes. , 2009, Genome research.

[45]  Ivet Bahar,et al.  DynOmics: dynamics of structural proteome and beyond , 2017, Nucleic Acids Res..