Long-Read Metagenomics of Marine Microbes Reveals Diversely Expressed Secondary Metabolites

Genome mining of metagenomic data has become the preferred method for the bioprospecting of novel compounds by cataloguing secondary metabolite potential. However, the accurate detection of BGCs requires unfragmented genomic assemblies, which have been technically difficult to obtain from metagenomes until recently with new long-read technologies. ABSTRACT Microbial secondary metabolites play crucial roles in microbial competition, communication, resource acquisition, antibiotic production, and a variety of other biotechnological processes. The retrieval of full-length BGC (biosynthetic gene cluster) sequences from uncultivated bacteria is difficult due to the technical constraints of short-read sequencing, making it impossible to determine BGC diversity. Using long-read sequencing and genome mining, 339 mainly full-length BGCs were recovered in this study, illuminating the wide range of BGCs from uncultivated lineages discovered in seawater from Aoshan Bay, Yellow Sea, China. Many extremely diverse BGCs were discovered in bacterial phyla such as Proteobacteria, Bacteroidota, Acidobacteriota, and Verrucomicrobiota as well as the previously uncultured archaeal phylum “Candidatus Thermoplasmatota.” The data from metatranscriptomics showed that 30.1% of secondary metabolic genes were being expressed, and they also revealed the expression pattern of BGC core biosynthetic genes and tailoring enzymes. Taken together, our results demonstrate that long-read metagenomic sequencing combined with metatranscriptomic analysis provides a direct view into the functional expression of BGCs in environmental processes. IMPORTANCE Genome mining of metagenomic data has become the preferred method for the bioprospecting of novel compounds by cataloguing secondary metabolite potential. However, the accurate detection of BGCs requires unfragmented genomic assemblies, which have been technically difficult to obtain from metagenomes until recently with new long-read technologies. We used high-quality metagenome-assembled genomes generated from long-read data to determine the biosynthetic potential of microbes found in the surface water of the Yellow Sea. We recovered 339 highly diverse and mostly full-length BGCs from largely uncultured and underexplored bacterial and archaeal phyla. Additionally, we present long-read metagenomic sequencing combined with metatranscriptomic analysis as a potential method for gaining access to the largely underutilized genetic reservoir of specialized metabolite gene clusters in the majority of microbes that are not cultured. The combination of long-read metagenomic and metatranscriptomic analyses is significant because it can more accurately assess the mechanisms of microbial adaptation to the environment through BGC expression based on metatranscriptomic data.

[1]  Madhurima Dutta,et al.  Squalene synthase in plants – Functional intricacy and evolutionary divergence while retaining a core catalytic structure , 2023, Plant Gene.

[2]  Elizabeth A. Suter,et al.  Diverse secondary metabolites are expressed in particle-associated and free-living microorganisms of the permanently anoxic Cariaco Basin , 2022, bioRxiv.

[3]  T. Weber,et al.  Long-Read Metagenome-Assembled Genomes Improve Identification of Novel Complete Biosynthetic Gene Clusters in a Complex Microbial Activated Sludge Ecosystem , 2022, mSystems.

[4]  Tom O. Delmont,et al.  Biosynthetic potential of the global ocean microbiome , 2022, Nature.

[5]  M. Arumugam,et al.  MIntO: A Modular and Scalable Pipeline For Microbiome Metagenomic and Metatranscriptomic Data Integration , 2022, Frontiers in Bioinformatics.

[6]  Victòria Pascal Andreu,et al.  Generating lineage-resolved, complete metagenome-assembled genomes from complex microbial communities , 2022, Nature Biotechnology.

[7]  F. Rodríguez-Valera,et al.  Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics , 2021, Frontiers in Microbiology.

[8]  C. Corre,et al.  Biosynthetic potential of uncultured Antarctic soil bacteria revealed through long-read metagenomic sequencing , 2021, The ISME Journal.

[9]  C. Corre,et al.  Biosynthetic potential of uncultured Antarctic soil bacteria revealed through long-read metagenomic sequencing , 2021, The ISME Journal.

[10]  C. Hertweck,et al.  Mining and unearthing hidden biosynthetic potential , 2021, Nature Communications.

[11]  Alexander M. Kloosterman,et al.  antiSMASH 6.0: improving cluster detection and comparison capabilities , 2021, Nucleic Acids Res..

[12]  P. Nielsen,et al.  Connecting structure to function with the recovery of over 1000 high-quality metagenome-assembled genomes from activated sludge using long-read sequencing , 2021, Nature Communications.

[13]  A. Visel,et al.  Long-read metagenomics of soil communities reveals phylum-specific secondary metabolite dynamics , 2021, Communications Biology.

[14]  Kai Blin,et al.  BiG-FAM: the biosynthetic gene cluster families database , 2020, Nucleic Acids Res..

[15]  Justin J. J. van der Hooft,et al.  BiG-SLiCE: A highly scalable tool maps the diversity of 1.2 million biosynthetic gene clusters , 2020, bioRxiv.

[16]  S. Blanco,et al.  The cosmetic application of cyanobacterial secondary metabolites , 2020 .

[17]  Douglas A. Mitchell,et al.  RRE-Finder: a Genome-Mining Tool for Class-Independent RiPP Discovery , 2020, mSystems.

[18]  A. Bhatt,et al.  Complete, closed bacterial genomes from microbiomes using nanopore sequencing , 2020, Nature Biotechnology.

[19]  J. Banfield,et al.  Bacterial Secondary Metabolite Biosynthetic Potential in Soil Varies with Phylum, Depth, and Vegetation Type , 2019, mBio.

[20]  Marnix H. Medema,et al.  A computational framework to explore large-scale biosynthetic diversity , 2019, Nature Chemical Biology.

[21]  B. Wilkinson,et al.  Antibiotics: past, present and future. , 2019, Current opinion in microbiology.

[22]  Paula Y. Calle,et al.  Uncovering the biosynthetic potential of rare metagenomic DNA using co-occurrence network analysis of targeted sequences , 2019, Nature Communications.

[23]  Hosein Mohimani,et al.  BiosyntheticSPAdes: reconstructing biosynthetic gene clusters from assembly graphs , 2019, Genome research.

[24]  P. Pevzner,et al.  metaFlye: scalable long-read metagenome assembly using repeat graphs , 2019, Nature Methods.

[25]  Feng Li,et al.  MetaBAT 2: an adaptive binning algorithm for robust and efficient genome reconstruction from metagenome assemblies , 2019, PeerJ.

[26]  Zhihong Zheng,et al.  Characterization of the secondary metabolite biosynthetic gene clusters in archaea , 2019, Comput. Biol. Chem..

[27]  Bas E. Dutilh,et al.  Robust taxonomic classification of uncharted microbial sequences and bins with CAT and BAT , 2019, Genome Biology.

[28]  E. Marcellin,et al.  Inter-Kingdom beach warfare: Microbial chemical communication activates natural chemical defences , 2018, The ISME Journal.

[29]  J. Sohng,et al.  Metabolic engineering of glycosylated polyketide biosynthesis. , 2018, Emerging topics in life sciences.

[30]  Brian C. Thomas,et al.  Novel soil bacteria possess diverse genes for secondary metabolite biosynthesis , 2018, Nature.

[31]  James R Cole,et al.  Nonpareil 3: Fast Estimation of Metagenomic Coverage and Sequence Diversity , 2018, mSystems.

[32]  J. DiRuggiero,et al.  MetaWRAP—a flexible pipeline for genome-resolved metagenomic data analysis , 2018, Microbiome.

[33]  William Fenical,et al.  Comparative transcriptomics as a guide to natural product discovery and biosynthetic gene cluster functionality , 2017, Proceedings of the National Academy of Sciences.

[34]  A. Benjdia,et al.  Radical SAM Enzymes in the Biosynthesis of Ribosomally Synthesized and Post-translationally Modified Peptides (RiPPs) , 2017, Front. Chem..

[35]  Donovan H. Parks,et al.  Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life , 2017, Nature Microbiology.

[36]  M. Kumar,et al.  Microbial metabolites in nutrition, healthcare and agriculture , 2017, 3 Biotech.

[37]  Niranjan Nagarajan,et al.  Fast and accurate de novo genome assembly from long uncorrected reads. , 2017, Genome research.

[38]  Blake A. Simmons,et al.  MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets , 2016, Bioinform..

[39]  G. Challis,et al.  Discovery of microbial natural products by activation of silent biosynthetic gene clusters , 2015, Nature Reviews Microbiology.

[40]  T. L. Schneider,et al.  Unusual non‐enzymatic flavin catalysis enhances understanding of flavoenzymes , 2015, FEBS letters.

[41]  Christina A. Cuomo,et al.  Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement , 2014, PloS one.

[42]  Chao Xie,et al.  Fast and sensitive protein alignment using DIAMOND , 2014, Nature Methods.

[43]  Kunihiko Sadakane,et al.  MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph , 2014, Bioinform..

[44]  S. Brady,et al.  Mining the metabiome: identifying novel natural products from microbial communities. , 2014, Chemistry & biology.

[45]  Anders F. Andersson,et al.  Binning metagenomic contigs by coverage and composition , 2014, Nature Methods.

[46]  G. Challis,et al.  Mechanism and Catalytic Diversity of Rieske Non-Heme Iron-Dependent Oxygenases. , 2013, ACS catalysis.

[47]  Günter P. Wagner,et al.  Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples , 2012, Theory in Biosciences.

[48]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration , 2012, Briefings Bioinform..

[49]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[50]  M. Donia,et al.  Accessing the Hidden Majority of Marine Natural Products through Metagenomics , 2011, Chembiochem : a European journal of chemical biology.

[51]  Miriam L. Land,et al.  Trace: Tennessee Research and Creative Exchange Prodigal: Prokaryotic Gene Recognition and Translation Initiation Site Identification Recommended Citation Prodigal: Prokaryotic Gene Recognition and Translation Initiation Site Identification , 2022 .

[52]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[53]  Ahmed I. S. Ahmed,et al.  A survey of the biosynthetic potential and specialized metabolites of archaea and understudied bacteria , 2023, Current Research in Biotechnology.

[54]  J. Bohlmann,et al.  Terpenoid biomaterials. , 2008, The Plant journal : for cell and molecular biology.

[55]  S. Kauppinen,et al.  β-ketoacyl-ACP synthase I of Escherichia coli: Nucleotide sequence of thefabB gene and identification of the cerulenin binding residue , 1988, Carlsberg research communications.