Eight Unexpected Selenoprotein Families in ABC transport, in Organometallic Biochemistry in Clostridium difficile and other anaerobes, and in Methylmercury Biosynthesis

A novel protein family related to mercury resistance protein MerB, which cleaves Hg-C bonds of organomercurial compounds, is a newly recognized selenoprotein, typically seen truncated in sequence databases at CU (cysteine-selenocysteine) dipeptide sites fifty residues before the true C-terminus. Inspection shows this protein occurs in a nine-gene neighborhood conserved in more than fifty bacterial species, taxonomically diverse but exclusively anaerobic, including spirochetes, deltaproteobacteria, and Gram-positive spore-formers Clostridium difficile and C. botulinum. Three included families are novel selenoproteins in most instances, including two ABC transporter subunits, one a substrate-binding protein with another CU motif, the other a permease subunit with selenocysteine at the substrate-gating position. Phylogenetic profiling shows a strong pattern of co-occurrence with Stickland metabolism selenoproteins, but an even closer link to a group of 8Fe-9S cofactor-type double-cubane proteins. These 8Fe-9S enzymes vary in count and in genome location but frequently sit next to the nine-gene locus. We have named the locus SAO, because of the Selenocysteine-Assisted Organometallic (SAO) biochemistry implied by an uptake ABC transporter with apparent metal-binding selenocysteines, complementary metal efflux pump SaoE, the MerB-like cytosolic enzyme now called SaoL, and comparative genomics signatures suggesting energy metabolism rather than metal resistance. Hypothesizing cycles of formation and dismutation of organometallic compounds involved in fermentative metabolism, we examined methylmercury formation proteins, and discovered most HgcA proteins are selenoproteins as well, with a CU motif N-terminal to the previously predicted start. Seeking additional rare and overlooked selenoproteins, tricky because of their rarity, could help reveal more candidate cryptic biochemical processes.

[1]  William W. Van Treuren,et al.  Clostridium sporogenes uses reductive Stickland metabolism in the gut to generate ATP and produce circulating metabolites , 2022, Nature Microbiology.

[2]  Gabriele H. Marchler,et al.  iCn3D: From Web-Based 3D Viewer to Structural Analysis Tool in Batch Mode , 2021, bioRxiv.

[3]  Narmada Thanki,et al.  RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation , 2020, Nucleic Acids Res..

[4]  Conrad C. Huang,et al.  UCSF ChimeraX: Structure visualization for researchers, educators, and developers , 2020, Protein science : a publication of the Protein Society.

[5]  D. Grahame,et al.  Diverse Energy-Conserving Pathways in Clostridium difficile: Growth in the Absence of Amino Acid Stickland Acceptors and the Role of the Wood-Ljungdahl Pathway , 2020, Journal of Bacteriology.

[6]  H. Dobbek,et al.  Double‐Cubane [8Fe9S] Clusters: A Novel Nitrogenase‐Related Cofactor in Biology , 2020, Chembiochem : a European journal of chemical biology.

[7]  C. D. de Koster,et al.  Vegetative Cell and Spore Proteomes of Clostridioides difficile Show Finite Differences and Reveal Potential Protein Markers , 2019, bioRxiv.

[8]  D. Jahn,et al.  Metabolism the Difficile Way: The Key to the Success of the Pathogen Clostridioides difficile , 2019, Front. Microbiol..

[9]  H. Dobbek,et al.  ATP-dependent substrate reduction at an [Fe , 2018 .

[10]  Wen J. Li,et al.  RefSeq: an update on prokaryotic genome annotation and curation , 2017, Nucleic Acids Res..

[11]  Adam P Arkin,et al.  PaperBLAST: Text Mining Papers for Information about Homologs , 2017, mSystems.

[12]  J. Sonnenburg,et al.  The emerging metabolic view of Clostridium difficile pathogenesis. , 2017, Current opinion in microbiology.

[13]  J. Omichinski,et al.  Structural and Biochemical Characterization of a Copper-Binding Mutant of the Organomercurial Lyase MerB: Insight into the Key Role of the Active Site Aspartic Acid in Hg-Carbon Bond Cleavage and Metal Binding Specificity. , 2016, Biochemistry.

[14]  Yan Zhang,et al.  Comparative genomics reveals new evolutionary and ecological patterns of selenium utilization in bacteria , 2016, The ISME Journal.

[15]  Edward W. Tate,et al.  Quantitative Lipoproteomics in Clostridium difficile Reveals a Role for Lipoproteins in Sporulation. , 2015, Chemistry & biology.

[16]  R. Percudani,et al.  Evolution of the Selenoproteome in Helicobacter pylori and Epsilonproteobacteria , 2015, Genome biology and evolution.

[17]  Jerry M. Parks,et al.  Site-Directed Mutagenesis of HgcA and HgcB Reveals Amino Acid Residues Important for Mercury Methylation , 2015, Applied and Environmental Microbiology.

[18]  D. Haft,et al.  Using comparative genomics to drive new discoveries in microbiology. , 2015, Current opinion in microbiology.

[19]  Dachuan Zhang,et al.  MMDB and VAST+: tracking structural similarities between macromolecular complexes , 2013, Nucleic Acids Res..

[20]  R. Guigó,et al.  SECISearch3 and Seblastian: new tools for prediction of SECIS elements and selenoproteins , 2013, Nucleic acids research.

[21]  Jerry M. Parks,et al.  The Genetic Basis for Bacterial Mercury Methylation , 2013, Science.

[22]  Malay Kumar Basu,et al.  ProPhylo: partial phylogenetic profiling to guide protein family construction and assignment of biological process , 2011, BMC Bioinformatics.

[23]  L. Hancock,et al.  A Selenium-Dependent Xanthine Dehydrogenase Triggers Biofilm Proliferation in Enterococcus faecalis through Oxidant Production , 2011, Journal of bacteriology.

[24]  Sean R Eddy,et al.  A new generation of homology search tools based on probabilistic inference. , 2009, Genome informatics. International Conference on Genome Informatics.

[25]  V. Gladyshev,et al.  The selenoproteome of Clostridium sp. OhILAs: Characterization of anaerobic bacterial selenoprotein methionine sulfoxide reductase A , 2009, Proteins.

[26]  V. Gladyshev,et al.  In silico identification of genes involved in selenium metabolism: evidence for a third selenium utilization trait , 2008, BMC Genomics.

[27]  D. Haft,et al.  Orphan SelD proteins and selenium-dependent molybdenum hydroxylases , 2008, Biology Direct.

[28]  W. Buckel,et al.  Spectroscopic evidence for an all-ferrous [4Fe–4S]0 cluster in the superreduced activator of 2-hydroxyglutaryl-CoA dehydratase from Acidaminococcus fermentans , 2008, JBIC Journal of Biological Inorganic Chemistry.

[29]  Yan Zhang,et al.  Dynamic evolution of selenocysteine utilization in bacteria: a balance between selenoprotein loss and evolution of selenocysteine from redox active cysteine residues , 2006, Genome Biology.

[30]  William T. Self,et al.  Analysis of Proline Reduction in the Nosocomial Pathogen Clostridium difficile , 2006, Journal of bacteriology.

[31]  Naomi Ward,et al.  Exopolysaccharide-associated protein sorting in environmental organisms: the PEP-CTERM/EpsH system. Application of a novel phylogenetic profiling heuristic , 2006, BMC Biology.

[32]  Volker Hollich,et al.  Scoredist: A simple and robust protein sequence distance estimator , 2005, BMC Bioinformatics.

[33]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[34]  T. Stadtman,et al.  Cofactor determination and spectroscopic characterization of the selenium-dependent purine hydroxylase from Clostridium purinolyticum. , 2003, Biochemistry.

[35]  D E Wemmer,et al.  Solution nuclear magnetic resonance structure of a protein disulfide oxidoreductase from Methanococcus jannaschii , 2001, Protein science : a publication of the Protein Society.

[36]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[37]  J F Gibrat,et al.  Surprising similarities in structure comparison. , 1996, Current opinion in structural biology.

[38]  V. Gladyshev,et al.  Properties of the selenium- and molybdenum-containing nicotinic acid hydroxylase from Clostridium barkeri. , 1996, Biochemistry.

[39]  R. Durbin,et al.  A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis. , 1995, Gene.

[40]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.