Predicting disease-associated mutation of metal-binding sites in proteins using a deep learning approach

Metalloproteins play important roles in many biological processes. Mutations at the metal-binding sites may functionally disrupt metalloproteins, initiating severe diseases; however, there seemed to be no effective approach to predict such mutations until now. Here we develop a deep learning approach to successfully predict disease-associated mutations that occur at the metal-binding sites of metalloproteins. We generate energy-based affinity grid maps and physiochemical features of the metal-binding pockets (obtained from different databases as spatial and sequential features) and subsequently implement these features into a multichannel convolutional neural network. After training the model, the multichannel convolutional neural network can successfully predict disease-associated mutations that occur at the first and second coordination spheres of zinc-binding sites with an area under the curve of 0.90 and an accuracy of 0.82. Our approach stands for the first deep learning approach for the prediction of disease-associated metal-relevant site mutations in metalloproteins, providing a new platform to tackle human diseases. Metals can bind to proteins to fulfil important biological functions. Predicting the features of mutated binding sites can thus help us understand the connection between specific mutations and their role in diseases.

[1]  L. Ji,et al.  Insights into metalloenzyme microenvironments: biomimetic metal complexes with a functional second coordination sphere. , 2013, Chemical Society reviews.

[2]  E. Capriotti,et al.  Functional annotations improve the predictive score of human disease‐related mutations in proteins , 2009, Human mutation.

[3]  Dong-Sheng Cao,et al.  propy: a tool to generate various modes of Chou's PseAAC , 2013, Bioinform..

[4]  Andrew J. Grimm,et al.  Interpreting missense variants: comparing computational methods in human disease genes CDKN2A, MLH1, MSH2, MECP2, and tyrosinase (TYR) , 2007, Human mutation.

[5]  J. Bartek,et al.  The DNA-damage response in human biology and disease , 2009, Nature.

[6]  Wei Gu,et al.  Modes of p53 Regulation , 2009, Cell.

[7]  S. Shete,et al.  Genetic mapping of a third Li-Fraumeni syndrome predisposition locus to human chromosome 1q23. , 2005, Cancer research.

[8]  Kathryn L Haas,et al.  Application of metal coordination chemistry to explore and manipulate cell biology. , 2009, Chemical reviews.

[9]  C. Lim,et al.  Metal binding affinity and selectivity in metalloproteins: insights from computational studies. , 2008, Annual review of biophysics.

[10]  Yi Lu,et al.  Design of functional metalloproteins , 2009, Nature.

[11]  A. Bush,et al.  Biological metals and metal-targeting compounds in major neurodegenerative diseases. , 2014, Chemical Society reviews.

[12]  David S. Goodsell,et al.  Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function , 1998 .

[13]  A. Mondragón,et al.  Molecular Basis of Metal-Ion Selectivity and Zeptomolar Sensitivity by CueR , 2003, Science.

[14]  Robert Sabatier,et al.  IMGT standardized criteria for statistical analysis of immunoglobulin V‐REGION amino acid properties , 2004, Journal of molecular recognition : JMR.

[15]  J. den Hertog,et al.  Noonan syndrome gain-of-function mutations in NRAS cause zebrafish gastrulation defects , 2011, Disease Models & Mechanisms.

[16]  Yi Lu,et al.  Understanding and Modulating Metalloenzymes with Unnatural Amino Acids, Non-Native Metal Ions, and Non-Native Metallocofactors. , 2019, Accounts of chemical research.

[17]  J. Hopper,et al.  Genetic and histopathologic evaluation of BRCA1 and BRCA2 DNA sequence variants of unknown clinical significance. , 2006, Cancer research.

[18]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt): an expanding universe of protein information , 2005, Nucleic Acids Res..

[19]  Ronen Levy,et al.  First‐ and second‐shell metal binding residues in human proteins are disproportionately associated with disease‐related SNPs , 2011, Human mutation.

[20]  Mi Zhou,et al.  nsSNPAnalyzer: identifying disease-associated nonsynonymous single nucleotide polymorphisms , 2005, Nucleic Acids Res..

[21]  B. Rost,et al.  SNAP: predict effect of non-synonymous polymorphisms on function , 2007, Nucleic acids research.

[22]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[23]  Gabriele Cavallaro,et al.  MetalPDB: a database of metal sites in biological macromolecular structures , 2012, Nucleic Acids Res..

[24]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[25]  Hongzhe Sun,et al.  Metallomic and metalloproteomic strategies in elucidating the molecular mechanisms of metallodrugs. , 2015, Dalton transactions.

[26]  C E Lipscomb,et al.  Medical Subject Headings (MeSH). , 2000, Bulletin of the Medical Library Association.

[27]  Thomas V. O'Halloran,et al.  Transition Metal Speciation in the Cell: Insights from the Chemistry of Metal Ion Receptors , 2003, Science.

[28]  L. Luzzatto,et al.  Transcriptional Functionality of Germ Line p53 Mutants Influences Cancer Phenotype , 2007, Clinical Cancer Research.

[29]  K. Waldron,et al.  How do bacterial cells ensure that metalloproteins get the correct metal? , 2009, Nature Reviews Microbiology.

[30]  Hongzhe Sun,et al.  UreE-UreG Complex Facilitates Nickel Transfer and Preactivates GTPase of UreG in Helicobacter pylori*♦ , 2015, The Journal of Biological Chemistry.

[31]  Mathias Dunkel,et al.  CancerResource—updated database of cancer-relevant proteins, mutations and interacting drugs , 2015, Nucleic Acids Res..

[32]  M. Vihinen,et al.  Performance of mutation pathogenicity prediction methods on missense variants , 2011, Human mutation.

[33]  Hongzhe Sun,et al.  Nickel translocation between metallochaperones HypA and UreE in Helicobacter pylori. , 2014, Metallomics : integrated biometal science.

[34]  D. Lacey,et al.  Regulation of Protein Catabolism by Muscle-Specific and Cytokine-Inducible Ubiquitin Ligase E3α-II during Cancer Cachexia , 2004, Cancer Research.

[35]  Lawrence C. Brody,et al.  BRCA1 regulates the G2/M checkpoint by activating Chk1 kinase upon DNA damage , 2002, Nature Genetics.

[36]  M. Rauh,et al.  Deficiency of UBR1, a ubiquitin ligase of the N-end rule pathway, causes pancreatic dysfunction, malformations and mental retardation (Johanson-Blizzard syndrome) , 2005, Nature Genetics.

[37]  Dianne Ford,et al.  Metalloproteins and metal sensing , 2009, Nature.