Unusual Metabolism and Hypervariation in the Genome of a Gracilibacterium (BD1-5) from an Oil-Degrading Community

CPR bacteria are generally predicted to be symbionts due to their extensive biosynthetic deficits. Although monophyletic, they are not monolithic in terms of their lifestyles. The organism described here appears to have evolved an unusual metabolic platform not reliant on glucose or pentose sugars. Its biology appears to be centered around bacterial host-derived compounds and/or cell detritus. Amino acids likely provide building blocks for nucleic acids, peptidoglycan, and protein synthesis. We resolved an unusual repeat region that would be invisible without genome curation. The nucleotide sequence is apparently under strong diversifying selection, but the amino acid sequence is under stabilizing selection. The amino acid repeat also occurs in a surface protein of a coexisting bacterium, suggesting colocation and possibly interdependence. ABSTRACT The candidate phyla radiation (CPR) comprises a large monophyletic group of bacterial lineages known almost exclusively based on genomes obtained using cultivation-independent methods. Within the CPR, Gracilibacteria (BD1-5) are particularly poorly understood due to undersampling and the inherent fragmented nature of available genomes. Here, we report the first closed, curated genome of a gracilibacterium from an enrichment experiment inoculated from the Gulf of Mexico and designed to investigate hydrocarbon degradation. The gracilibacterium rose in abundance after the community switched to dominance by Colwellia. Notably, we predict that this gracilibacterium completely lacks glycolysis, the pentose phosphate and Entner-Doudoroff pathways. It appears to acquire pyruvate, acetyl coenzyme A (acetyl-CoA), and oxaloacetate via degradation of externally derived citrate, malate, and amino acids and may use compound interconversion and oxidoreductases to generate and recycle reductive power. The initial genome assembly was fragmented in an unusual gene that is hypervariable within a repeat region. Such extreme local variation is rare but characteristic of genes that confer traits under pressure to diversify within a population. Notably, the four major repeated 9-mer nucleotide sequences all generate a proline-threonine-aspartic acid (PTD) repeat. The genome of an abundant Colwellia psychrerythraea population has a large extracellular protein that also contains the repeated PTD motif. Although we do not know the host for the BD1-5 cell, the high relative abundance of the C. psychrerythraea population and the shared surface protein repeat may indicate an association between these bacteria. IMPORTANCE CPR bacteria are generally predicted to be symbionts due to their extensive biosynthetic deficits. Although monophyletic, they are not monolithic in terms of their lifestyles. The organism described here appears to have evolved an unusual metabolic platform not reliant on glucose or pentose sugars. Its biology appears to be centered around bacterial host-derived compounds and/or cell detritus. Amino acids likely provide building blocks for nucleic acids, peptidoglycan, and protein synthesis. We resolved an unusual repeat region that would be invisible without genome curation. The nucleotide sequence is apparently under strong diversifying selection, but the amino acid sequence is under stabilizing selection. The amino acid repeat also occurs in a surface protein of a coexisting bacterium, suggesting colocation and possibly interdependence.

[1]  Konstantinos D. Tsirigos,et al.  SignalP 5.0 improves signal peptide predictions using deep neural networks , 2019, Nature Biotechnology.

[2]  Alexander J. Probst,et al.  Biosynthetic capacity, metabolic variety and unusual biology in the CPR and DPANN radiations , 2018, Nature Reviews Microbiology.

[3]  Cindy J. Castelle,et al.  Major New Microbial Groups Expand Diversity and Alter our Understanding of the Tree of Life , 2018, Cell.

[4]  Alexander J. Probst,et al.  Simulation of Deepwater Horizon oil plume reveals substrate specialization within a complex community of hydrocarbon degraders , 2017, Proceedings of the National Academy of Sciences.

[5]  Torsten Schwede,et al.  The SWISS-MODEL Repository—new features and functionality , 2016, Nucleic Acids Res..

[6]  Brian C. Thomas,et al.  Measurement of bacterial replication rates in microbial communities , 2016, Nature Biotechnology.

[7]  Brian C. Thomas,et al.  A new view of the tree of life , 2016, Nature Microbiology.

[8]  Christine L. Sun,et al.  Major bacterial lineages are essentially devoid of CRISPR-Cas viral defence systems , 2016, Nature Communications.

[9]  Brian C. Thomas,et al.  Unusual biology across a group comprising more than 15% of domain Bacteria , 2015, Nature.

[10]  Yang Zhang,et al.  The I-TASSER Suite: protein structure and function prediction , 2014, Nature Methods.

[11]  S. Yooseph,et al.  Cultivation of a human-associated TM7 phylotype reveals a reduced genome and epibiotic parasitic lifestyle , 2014, Proceedings of the National Academy of Sciences.

[12]  The Uniprot Consortium,et al.  UniProt: a hub for protein information , 2014, Nucleic Acids Res..

[13]  Patrick X. Zhao,et al.  Prediction of Membrane Transport Proteins and Their Substrate Specificities Using Primary Sequence Information , 2014, PloS one.

[14]  Axel Visel,et al.  Stop codon reassignments in the wild , 2014, Science.

[15]  J. Banfield,et al.  Recoding of the stop codon UGA to glycine by a BD1-5/SN-2 bacterium and niche partitioning between Alpha- and Gammaproteobacteria in a tidal sediment microbial community naturally selected in a laboratory chemostat , 2014, Front. Microbiol..

[16]  Matthew Fraser,et al.  InterProScan 5: genome-scale protein function classification , 2014, Bioinform..

[17]  Alexandros Stamatakis,et al.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies , 2014, Bioinform..

[18]  C. Gee,et al.  The antigen 43 structure reveals a molecular Velcro-like mechanism of autotransporter-mediated bacterial clumping , 2013, Proceedings of the National Academy of Sciences.

[19]  Brian C. Thomas,et al.  Small Genomes and Sparse Metabolisms of Sediment-Associated Bacteria from Four Candidate Phyla , 2013, mBio.

[20]  Natalia N. Ivanova,et al.  Insights into the phylogeny and coding potential of microbial dark matter , 2013, Nature.

[21]  Brian C. Thomas,et al.  Fermentation, Hydrogen, and Sulfur Metabolism in Multiple Uncultivated Bacterial Phyla , 2012, Science.

[22]  Edward C. Uberbacher,et al.  Gene and translation initiation site prediction in metagenomic sequences , 2012, Bioinform..

[23]  Shane S. Sturrock,et al.  Geneious Basic: An integrated and extendable desktop software platform for the organization and analysis of sequence data , 2012, Bioinform..

[24]  Gene-Wei Li,et al.  The anti-Shine-Dalgarno sequence drives translational pausing and codon choice in bacteria , 2012, Nature.

[25]  A. Biegert,et al.  HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment , 2011, Nature Methods.

[26]  Sean R. Eddy,et al.  Accelerated Profile HMM Searches , 2011, PLoS Comput. Biol..

[27]  M. W. van der Woude Phase variation: how to create and coordinate population diversity. , 2011, Current opinion in microbiology.

[28]  Lisa D. Muiznieks,et al.  Proline Periodicity Modulates the Self-Assembly Properties of Elastin-Like Polypeptides , 2011 .

[29]  Mark A. Miller,et al.  Creating the CIPRES Science Gateway for inference of large phylogenetic trees , 2010, 2010 Gateway Computing Environments Workshop (GCE).

[30]  Lisa D. Muiznieks,et al.  Proline Periodicity Modulates the Self-assembly Properties of Elastin-like Polypeptides* , 2010, The Journal of Biological Chemistry.

[31]  N. Moran,et al.  Functional Convergence in Reduced Genomes of Bacterial Symbionts Spanning 200 My of Evolution , 2010, Genome biology and evolution.

[32]  Martin Ester,et al.  PSORTb 3.0: improved protein subcellular localization prediction with refined localization subcategories and predictive capabilities for all prokaryotes , 2010, Bioinform..

[33]  G. Weinstock,et al.  VarScan: variant detection in massively parallel sequencing of individual and pooled samples , 2009, Bioinform..

[34]  J. Banfield,et al.  Community-wide analysis of microbial genome sequence signatures , 2009, Genome Biology.

[35]  N. Moran,et al.  Origin of an Alternative Genetic Code in the Extremely Small and GC–Rich Genome of a Bacterial Symbiont , 2009, PLoS genetics.

[36]  R. Nussinov,et al.  Synonymous mutations and ribosome stalling can lead to altered folding pathways and distinct minima. , 2008, Journal of molecular biology.

[37]  Torsten Schwede,et al.  The SWISS-MODEL Repository: new features and functionalities , 2005, Nucleic Acids Res..

[38]  Kuang Lin,et al.  A simple and fast secondary structure prediction method using hidden neural networks , 2005, Bioinform..

[39]  Thomas Dandekar,et al.  Metabolic Interdependence of Obligate Intracellular Bacteria and Their Insect Hosts , 2004, Microbiology and Molecular Biology Reviews.

[40]  Dieter Jahn,et al.  PrediSi: prediction of signal peptides and their cleavage positions , 2004, Nucleic Acids Res..

[41]  E. Nevo,et al.  Microsatellites within genes: structure, function, and evolution. , 2004, Molecular biology and evolution.

[42]  N. Blom,et al.  Feature-based prediction of non-classical and leaderless protein secretion. , 2004, Protein engineering, design & selection : PEDS.

[43]  Michael Zuker,et al.  Mfold web server for nucleic acid folding and hybridization prediction , 2003, Nucleic Acids Res..

[44]  Hans Ellegren,et al.  Mismatch repair and mutational bias in microsatellite DNA. , 2002, Trends in genetics : TIG.

[45]  T. Fukui,et al.  ATP-citrate lyase from the green sulfur bacterium Chlorobium limicola is a heteromeric enzyme composed of two distinct gene products. , 2001, European journal of biochemistry.

[46]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[47]  J. Schwarzbauer,et al.  Fibronectin self-association is mediated by complementary sites within the amino-terminal one-third of the molecule. , 1994, The Journal of biological chemistry.

[48]  G. Gutman,et al.  Slipped-strand mispairing: a major mechanism for DNA sequence evolution. , 1987, Molecular biology and evolution.

[49]  M. W. van der Woude,et al.  Phase variation : how to create and coordinate population diversity , 2011 .

[50]  Eric P. Nawrocki,et al.  Structural rna homology search and alignment using covariance models , 2009 .

[51]  Johannes Söding,et al.  Fast and accurate automatic structure prediction with HHpred , 2009, Proteins.

[52]  Baris E. Suzek,et al.  Databases and ontologies UniRef : comprehensive and non-redundant UniProt reference clusters , 2007 .

[53]  Hiroyuki Ogata,et al.  KEGG: Kyoto Encyclopedia of Genes and Genomes , 1999, Nucleic Acids Res..