A less‐biased analysis of metalloproteins reveals novel zinc coordination geometries

Zinc metalloproteins are involved in many biological processes and play crucial biochemical roles across all domains of life. Local structure around the zinc ion, especially the coordination geometry (CG), is dictated by the protein sequence and is often directly related to the function of the protein. Current methodologies in characterizing zinc metalloproteins' CG consider only previously reported CG models based mainly on nonbiological chemical context. Exceptions to these canonical CG models are either misclassified or discarded as “outliers.” Thus, we developed a less‐biased method that directly handles potential exceptions without pre‐assuming any CG model. Our study shows that numerous exceptions could actually be further classified and that new CG models are needed to characterize them. Also, these new CG models are cross‐validated by strong correlation between independent structural and functional annotation distance metrics, which is partially lost if these new CGs models are ignored. Furthermore, these new CG models exhibit functional propensities distinct from the canonical CG models. Proteins 2015; 83:1470–1487. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.

[1]  Pedro Alexandrino Fernandes,et al.  Analysis of zinc‐ligand bond lengths in metalloproteins: Trends and patterns , 2007, Proteins.

[2]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[3]  Antonio Rosato,et al.  Metalloproteomes: a bioinformatic approach. , 2009, Accounts of chemical research.

[4]  J. Hirose,et al.  [Carbonic anhydrase]. , 1983, Tanpakushitsu kakusan koso. Protein, nucleic acid, enzyme.

[5]  Antonio Rosato,et al.  A hint to search for metalloproteins in gene banks , 2004, Bioinform..

[6]  M. Harding,et al.  Small revisions to predicted distances around metal sites in proteins. , 2006, Acta crystallographica. Section D, Biological crystallography.

[7]  K. Nechvíle The High Resolution , 2005 .

[8]  A. Takeda Zinc homeostasis and functions of zinc in the brain , 2001, Biometals.

[9]  W. Maret,et al.  Zinc transfer potentials of the alpha - and beta-clusters of metallothionein are affected by domain interactions in the whole molecule. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[10]  I. Bertini,et al.  A bioinformatics view of zinc enzymes. , 2012, Journal of inorganic biochemistry.

[11]  Zexian Liu,et al.  Computationally characterizing and comprehensive analysis of zinc-binding sites in proteins. , 2014, Biochimica et Biophysica Acta.

[12]  Fionn Murtagh,et al.  Ward’s Hierarchical Agglomerative Clustering Method: Which Algorithms Implement Ward’s Criterion? , 2011, Journal of Classification.

[13]  Angela Milne,et al.  Determination of Mn, Fe, Co, Ni, Cu, Zn, Cd and Pb in seawater using high resolution magnetic sector inductively coupled mass spectrometry (HR-ICP-MS). , 2010, Analytica chimica acta.

[14]  L. Rink,et al.  Zinc homeostasis and immunity. , 2007, Trends in immunology.

[15]  C. Fierke,et al.  Function and mechanism of zinc metalloenzymes. , 2000, The Journal of nutrition.

[16]  Gaetano T. Montelione,et al.  3.11 News & Views 031 CDS , 2005 .

[17]  Liang Tong,et al.  The ROQ domain of Roquin recognizes mRNA constitutive decay element and double-stranded RNA , 2014, Nature Structural &Molecular Biology.

[18]  M. Zastrow,et al.  Designing Hydrolytic Zinc Metalloenzymes , 2014, Biochemistry.

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[21]  Gabriele Cavallaro,et al.  MetalPDB: a database of metal sites in biological macromolecular structures , 2012, Nucleic Acids Res..

[22]  B. Randolf,et al.  Molecular Dynamics Simulation Methods including Quantum Effects , 2008 .

[23]  Janet M. Thornton,et al.  Metal-MACiE: a database of metals involved in biological catalysis , 2009, Bioinform..

[24]  Bo Li,et al.  Separation and identification of zinc-chelating peptides from sesame protein hydrolysate using IMAC-Zn²⁺ and LC-MS/MS. , 2012, Food chemistry.

[25]  R. Rowlett Structure and catalytic mechanism of β-carbonic anhydrases. , 2014, Sub-cellular biochemistry.

[26]  W. Maret,et al.  Zinc ions modulate protein tyrosine phosphatase 1B activity. , 2014, Metallomics : integrated biometal science.

[27]  Susumu Goto,et al.  KEGG for integration and interpretation of large-scale molecular data sets , 2011, Nucleic Acids Res..

[28]  K Nadassy,et al.  Analysis of zinc binding sites in protein crystal structures , 1998, Protein science : a publication of the Protein Society.

[29]  Y. Pang,et al.  Zinc′s Exclusive Tetrahedral Coordination Governed by Its Electronic Structure , 1999 .

[30]  V. Moreno,et al.  Nickel(II) and copper(II) – l-cysteine, l-methionine, l-tryptophan-nucleotide ternary complexes , 1998 .

[31]  Maria João Ramos,et al.  The carboxylate shift in zinc enzymes: a computational study. , 2007, Journal of the American Chemical Society.

[32]  Fionn Murtagh,et al.  Multidimensional clustering algorithms , 1985 .

[33]  A Klug,et al.  Repetitive zinc‐binding domains in the protein transcription factor IIIA from Xenopus oocytes. , 1985, The EMBO journal.

[34]  W. Maret Zinc biochemistry: from a single zinc enzyme to a key element of life. , 2013, Advances in nutrition.

[35]  Matthew Fraser,et al.  InterProScan 5: genome-scale protein function classification , 2014, Bioinform..

[36]  John A. Tainer,et al.  MDB: the Metalloprotein Database and Browser at The Scripps Research Institute , 2002, Nucleic Acids Res..

[37]  C. Poulter,et al.  Zinc is an essential cofactor for type I isopentenyl diphosphate:dimethylallyl diphosphate isomerase. , 2003, Journal of the American Chemical Society.

[38]  W. Maret,et al.  Zinc transfer potentials of the α- and β-clusters of metallothionein are affected by domain interactions in the whole molecule , 2000 .

[39]  Gabriele Cavallaro,et al.  FindGeo: a tool for determining metal coordination geometry , 2012, Bioinform..

[40]  Kun-Yi Hsin,et al.  Mespeus--a database of metal interactions with proteins. , 2014, Methods in molecular biology.

[41]  Kirti Patel,et al.  Analysis of the structural consensus of the zinc coordination centers of metalloprotein structures. , 2007, Biochimica et biophysica acta.

[42]  C. Pabo,et al.  High-resolution structures of variant Zif268-DNA complexes: implications for understanding zinc finger-DNA recognition. , 1998, Structure.

[43]  Malcolm D. Walkinshaw,et al.  Metals in protein structures: a review of their principal features , 2010 .