Computational Saturation Screen Reveals the Landscape of Mutations in Human Fumarate Hydratase

Single amino acid substitutions within protein structures often manifest with clinical conditions in humans. The mutation of a single amino can significantly alter protein folding and stability, or change protein dynamics to influence function. The chemical engineering field has developed a large toolset for predicting the influence of point mutations with the aim of guiding the design of improved and more stable proteins. Here, we reverse this general protocol and adapt these tools for the prediction of damaging mutations within proteins. Mutations to fumarate hydratase (FH), an enzyme of the citric acid cycle, can lead to human diseases. The inactivation of FH by mutation causes leiomyomas and renal cell carcinoma by subsequent fumarate buildup and reduction in available malate. We present a scheme for accurately predicting the clinical effects of every possible mutation in FH by adaptation to a database of characterized damaging and benign mutations. Using energy prediction tools Rosetta and FoldX coupled with molecular dynamics simulations, we accurately predict individual mutations as well as mutational hotspots with a high disruptive capability in FH. Furthermore, through dynamic analysis, we find that hinge regions of the protein can be stabilized or destabilized by mutations, with mechanistic implications for the functional ability of the enzyme. Finally, we categorize all potential mutations in FH into functional groups, predicting which known mutations in the human population are loss of function, therefore having clinical implications, and validate our findings through metabolomics data of characterized human cell lines.

[1]  Leland McInnes,et al.  UMAP: Uniform Manifold Approximation and Projection , 2018, J. Open Source Softw..

[2]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[3]  François Stricher,et al.  The FoldX web server: an online force field , 2005, Nucleic Acids Res..

[4]  Burkhard Rost,et al.  Correlating protein function and stability through the analysis of single amino acid substitutions , 2009, BMC Bioinformatics.

[5]  K. Lindorff-Larsen,et al.  Computational and cellular studies reveal structural destabilization and degradation of MLH1 variants in Lynch syndrome , 2019, eLife.

[6]  D. Baker,et al.  Role of conformational sampling in computing mutation‐induced changes in protein structure and stability , 2011, Proteins.

[7]  R. Huber,et al.  Crystallographic refinement and atomic models of two different forms of citrate synthase at 2.7 and 1.7 A resolution. , 1984, Journal of molecular biology.

[8]  Jean-Pierre Bayley,et al.  The FH mutation database: an online database of fumarate hydratase mutations involved in the MCUL (HLRCC) tumor syndrome and congenital fumarase deficiency , 2008, BMC Medical Genetics.

[9]  J. Shendure,et al.  The origins, determinants, and consequences of human mutations , 2015, Science.

[10]  A. Rowan,et al.  Missense mutations in fumarate hydratase in multiple cutaneous and uterine leiomyomatosis and renal cell cancer. , 2005, The Journal of molecular diagnostics : JMD.

[11]  Eran Segal,et al.  A Systematic p53 Mutation Library Links Differential Functional Impact to Cancer Mutation Pattern and Evolutionary Conservation. , 2018, Molecular cell.

[12]  M. Simpson,et al.  Germline FH mutations presenting with pheochromocytoma. , 2014, The Journal of clinical endocrinology and metabolism.

[13]  S. Dhanasekaran,et al.  Hereditary Leiomyomatosis and Renal Cell Carcinoma Syndrome (HLRCC): A Contemporary Review and Practical Discussion of the Differential Diagnosis for HLRCC-Associated Renal Cell Carcinoma. , 2018, Archives of pathology & laboratory medicine.

[14]  A. Clarke,et al.  Structural basis of substrate specificity in malate dehydrogenases: crystal structure of a ternary complex of porcine cytoplasmic malate dehydrogenase, alpha-ketomalonate and tetrahydoNAD. , 1999, Journal of molecular biology.

[15]  The role of the allosteric B site in the fumarase reaction. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Bartek Wilczynski,et al.  Biopython: freely available Python tools for computational molecular biology and bioinformatics , 2009, Bioinform..

[17]  E. Shaulian,et al.  Fumarase: A Mitochondrial Metabolic Enzyme and a Cytosolic/Nuclear Component of the DNA Damage Response , 2010, PLoS biology.

[18]  Berk Hess,et al.  GROMACS: High performance molecular simulations through multi-level parallelism from laptops to supercomputers , 2015 .

[19]  Joshua M. Korn,et al.  Next-generation characterization of the Cancer Cell Line Encyclopedia , 2019, Nature.

[20]  Xian-Ming Pan,et al.  New method for accurate prediction of solvent accessibility from protein sequence , 2001, Proteins.

[21]  Luis Serrano,et al.  FoldX 5.0: working with RNA, small molecules and a new graphical interface , 2019, Bioinform..

[22]  Huan‐Xiang Zhou,et al.  Prediction of solvent accessibility and sites of deleterious mutations from protein sequence , 2005, Nucleic acids research.

[23]  I. Bahar,et al.  Gaussian Dynamics of Folded Proteins , 1997 .

[24]  Jens Rudat,et al.  FoldX as Protein Engineering Tool: Better Than Random Based Approaches? , 2018, Computational and structural biotechnology journal.

[25]  Frank DiMaio,et al.  Prediction of Protein Mutational Free Energy: Benchmark and Sampling Improvements Increase Classification Accuracy , 2020, Frontiers in Bioengineering and Biotechnology.

[26]  C. Frezza,et al.  Fumarate hydratase in cancer: A multifaceted tumour suppressor. , 2020, Seminars in cell & developmental biology.

[27]  Ivet Bahar,et al.  ProDy: Protein Dynamics Inferred from Theory and Experiments , 2011, Bioinform..

[28]  Jingjing Ye,et al.  Statistical method on nonrandom clustering with application to somatic mutations in cancer , 2010, BMC Bioinformatics.

[29]  Philip M. Kim,et al.  Predicting changes in protein stability caused by mutation using sequence‐and structure‐based methods in a CAGI5 blind challenge , 2019, Human mutation.

[30]  K. Kavanagh,et al.  Structural basis of fumarate hydratase deficiency , 2011, Journal of Inherited Metabolic Disease.

[31]  O. Pines,et al.  Fumarase is involved in DNA double-strand break resection through a functional interaction with Sae2 , 2018, Current Genetics.

[32]  M. Nonato,et al.  Structural, biochemical and biophysical characterization of recombinant human fumarate hydratase , 2019, The FEBS journal.

[33]  Tomoyoshi Soga,et al.  The emerging role of fumarate as an oncometabolite , 2012, Front. Oncol..

[34]  C. Stout,et al.  Crystal structures of aconitase with isocitrate and nitroisocitrate bound. , 1993, Biochemistry.

[35]  Oliver Beckstein,et al.  MDAnalysis: A toolkit for the analysis of molecular dynamics simulations , 2011, J. Comput. Chem..

[36]  Andreas P. Eichenberger,et al.  Definition and testing of the GROMOS force-field versions 54A7 and 54B7 , 2011, European Biophysics Journal.

[37]  G. G. Galli,et al.  The landscape of cancer cell line metabolism , 2019, Nature Medicine.

[38]  I. Leigh,et al.  Fumarate hydratase mutations and predisposition to cutaneous leiomyomas, uterine leiomyomas and renal cancer , 2005, The British journal of dermatology.

[39]  Roland L. Dunbrack,et al.  The Rosetta all-atom energy function for macromolecular modeling and design , 2017, bioRxiv.

[40]  S. Bouvier,et al.  Systematic mutation of bacteriophage T4 lysozyme. , 1991, Journal of molecular biology.

[41]  W. Anderson,et al.  Crystal structure of T4-lysozyme generated from synthetic coding DNA expressed in Escherichia coli. , 1988, Protein engineering.

[42]  Adam A. Margolin,et al.  The Cancer Cell Line Encyclopedia enables predictive modeling of anticancer drug sensitivity , 2012, Nature.