An Integrated Metabolomic and Genomic Mining Workflow To Uncover the Biosynthetic Potential of Bacteria

We here combine chemical analysis and genomics to probe for new bioactive secondary metabolites based on their pattern of distribution within bacterial species. We demonstrate the usefulness of this combined approach in a group of marine Gram-negative bacteria closely related to Pseudoalteromonas luteoviolacea, which is a species known to produce a broad spectrum of chemicals. The approach allowed us to identify new antibiotics and their associated biosynthetic pathways. Combining chemical analysis and genetics is an efficient “mining” workflow for identifying diverse pharmaceutical candidates in a broad range of microorganisms and therefore of great use in bioprospecting. ABSTRACT Microorganisms are a rich source of bioactives; however, chemical identification is a major bottleneck. Strategies that can prioritize the most prolific microbial strains and novel compounds are of great interest. Here, we present an integrated approach to evaluate the biosynthetic richness in bacteria and mine the associated chemical diversity. Thirteen strains closely related to Pseudoalteromonas luteoviolacea isolated from all over the Earth were analyzed using an untargeted metabolomics strategy, and metabolomic profiles were correlated with whole-genome sequences of the strains. We found considerable diversity: only 2% of the chemical features and 7% of the biosynthetic genes were common to all strains, while 30% of all features and 24% of the genes were unique to single strains. The list of chemical features was reduced to 50 discriminating features using a genetic algorithm and support vector machines. Features were dereplicated by tandem mass spectrometry (MS/MS) networking to identify molecular families of the same biosynthetic origin, and the associated pathways were probed using comparative genomics. Most of the discriminating features were related to antibacterial compounds, including the thiomarinols that were reported from P. luteoviolacea here for the first time. By comparative genomics, we identified the biosynthetic cluster responsible for the production of the antibiotic indolmycin, which could not be predicted with standard methods. In conclusion, we present an efficient, integrative strategy for elucidating the chemical richness of a given set of bacteria and link the chemistry to biosynthetic genes. IMPORTANCE We here combine chemical analysis and genomics to probe for new bioactive secondary metabolites based on their pattern of distribution within bacterial species. We demonstrate the usefulness of this combined approach in a group of marine Gram-negative bacteria closely related to Pseudoalteromonas luteoviolacea, which is a species known to produce a broad spectrum of chemicals. The approach allowed us to identify new antibiotics and their associated biosynthetic pathways. Combining chemical analysis and genetics is an efficient “mining” workflow for identifying diverse pharmaceutical candidates in a broad range of microorganisms and therefore of great use in bioprospecting.

[1]  Anna Lechner,et al.  Molecular networking and pattern-based genome mining improves discovery of biosynthetic gene clusters and their products from Salinispora species. , 2015, Chemistry and Biology.

[2]  K. Ryan,et al.  In vitro reconstitution of indolmycin biosynthesis reveals the molecular basis of oxazolinone assembly , 2015, Proceedings of the National Academy of Sciences.

[3]  Neil L Kelleher,et al.  A Roadmap for Natural Product Discovery Based on Large-Scale Genomics and Metabolomics , 2014, Nature chemical biology.

[4]  M. Schorn,et al.  Biosynthesis of polybrominated aromatic organic compounds by marine bacteria , 2014, Nature chemical biology.

[5]  Andreas Klitgaard,et al.  Accurate Dereplication of Bioactive Secondary Metabolites from Marine-Derived Fungi by UHPLC-DAD-QTOFMS and a MS/HRMS Library , 2014, Marine drugs.

[6]  Tilmann Weber,et al.  In silico tools for the analysis of antibiotic biosynthetic pathways. , 2014, International journal of medical microbiology : IJMM.

[7]  Krystle L. Chavarria,et al.  Diversity and evolution of secondary metabolism in the marine actinomycete genus Salinispora , 2014, Proceedings of the National Academy of Sciences.

[8]  Xing-Ming Zhao,et al.  A Survey on Evolutionary Algorithm Based Hybrid Intelligence in Bioinformatics , 2014, BioMed research international.

[9]  R. Müller,et al.  Future potential for anti-infectives from bacteria - how to exploit biodiversity and genomic potential. , 2014, International journal of medical microbiology : IJMM.

[10]  Jeroen S. Dickschat,et al.  Genome mining of Streptomyces ambofaciens , 2014, Journal of Industrial Microbiology & Biotechnology.

[11]  Roger G. Linington,et al.  Molecular networking as a dereplication strategy. , 2013, Journal of natural products.

[12]  Nuno Bandeira,et al.  MS/MS networking guided analysis of molecule and gene cluster families , 2013, Proceedings of the National Academy of Sciences.

[13]  J. Antón,et al.  High Metabolomic Microdiversity within Co-Occurring Isolates of the Extremely Halophilic Bacterium Salinibacter ruber , 2013, PloS one.

[14]  Kai Blin,et al.  antiSMASH 2.0—a versatile platform for genome mining of secondary metabolite producers , 2013, Nucleic Acids Res..

[15]  G. Suen,et al.  Comparison of 26 Sphingomonad Genomes Reveals Diverse Environmental Adaptations and Biodegradative Capabilities , 2013, Applied and Environmental Microbiology.

[16]  D. Ussery,et al.  CMG-Biotools, a Free Workbench for Basic Comparative Microbial Genomics , 2013, PloS one.

[17]  R. Breitling,et al.  Detecting Sequence Homology at the Gene Cluster Level with MultiGeneBlast , 2013, Molecular biology and evolution.

[18]  T. H. Smits,et al.  Comparative Genomics of 12 Strains of Erwinia amylovora Identifies a Pan-Genome with a Large Conserved Core , 2013, PloS one.

[19]  Xiaohui Lin,et al.  A support vector machine-recursive feature elimination feature selection method based on artificial contrast variables and mutual information. , 2012, Journal of chromatography. B, Analytical technologies in the biomedical and life sciences.

[20]  S. Schuster,et al.  Comparative genomics of the classical Bordetella subspecies: the evolution and exchange of virulence-associated diversity amongst closely related pathogens , 2012, BMC Genomics.

[21]  L. Gram,et al.  Gene Sequence Based Clustering Assists in Dereplication of Pseudoalteromonas luteoviolacea Strains with Identical Inhibitory Activity and Antibiotic Production , 2012, Marine drugs.

[22]  Jurica Zucko,et al.  Horizontal gene transfer and gene conversion drive evolution of modular polyketide synthases , 2012, Journal of Industrial Microbiology & Biotechnology.

[23]  Natalie I. Tasman,et al.  A Cross-platform Toolkit for Mass Spectrometry and Proteomics , 2012, Nature Biotechnology.

[24]  Nuno Bandeira,et al.  Mass spectral molecular networking of living microbial colonies , 2012, Proceedings of the National Academy of Sciences.

[25]  G. Siuzdak,et al.  XCMS Online: a web-based platform to process untargeted metabolomic data. , 2012, Analytical chemistry.

[26]  J. Badger,et al.  The Natural Product Domain Seeker NaPDoS: A Phylogeny Based Bioinformatic Tool to Classify Secondary Metabolite Gene Diversity , 2012, PloS one.

[27]  S. Neumann,et al.  CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. , 2012, Analytical chemistry.

[28]  K. Penn,et al.  Comparative genomics reveals evidence of marine adaptation in Salinispora species , 2012, BMC Genomics.

[29]  Jens Christian Frisvad,et al.  Dereplication of microbial natural products by LC-DAD-TOFMS. , 2011, Journal of natural products.

[30]  Kai Blin,et al.  antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences , 2011, Nucleic Acids Res..

[31]  K. Enomoto,et al.  Characterization of a gene cluster and its putative promoter region for violacein biosynthesis in Pseudoalteromonas sp. 520P1 , 2011, Applied Microbiology and Biotechnology.

[32]  Christopher M Thomas,et al.  A Natural Plasmid Uniquely Encodes Two Biosynthetic Pathways Creating a Potent Anti-MRSA Antibiotic , 2011, PloS one.

[33]  Mitchell J. Sullivan,et al.  Easyfig: a genome comparison visualizer , 2011, Bioinform..

[34]  Liang Tang,et al.  A method for handling metabonomics data from liquid chromatography/mass spectrometry: combinational use of support vector machine recursive feature elimination, genetic algorithm and random forest for feature selection , 2011, Metabolomics.

[35]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[36]  Melanie Hilario,et al.  Standard machine learning algorithms applied to UPLC-TOF/MS metabolic fingerprinting for the discovery of wound biomarkers in Arabidopsis thaliana , 2010 .

[37]  Anne Osbourn,et al.  Secondary metabolic gene clusters: evolutionary toolkits for chemical innovation. , 2010, Trends in genetics : TIG.

[38]  Matej Oresic,et al.  MZmine 2: Modular framework for processing, visualizing, and analyzing mass spectrometry-based molecular profile data , 2010, BMC Bioinformatics.

[39]  D. Ussery,et al.  Comparison of 61 Sequenced Escherichia coli Genomes , 2010, Microbial Ecology.

[40]  L. Gram,et al.  Explorative solid-phase extraction (E-SPE) for accelerated microbial natural product discovery, dereplication, and purification. , 2010, Journal of natural products.

[41]  L. Gram,et al.  Antibacterial Activity of Marine Culturable Bacteria Collected from a Global Sampling of Ocean Surface Waters and Surface Swabs of Marine Organisms , 2010, Marine Biotechnology.

[42]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[43]  H. Jenke-Kodama,et al.  Evolution of metabolic diversity: insights from microbial polyketide synthases. , 2009, Phytochemistry.

[44]  Nuno Bandeira,et al.  Dereplication and De Novo Sequencing of Nonribosomal Peptides , 2009, Nature Methods.

[45]  J. Sello,et al.  A Novel Tryptophanyl-tRNA Synthetase Gene Confers High-Level Resistance to Indolmycin , 2009, Antimicrobial Agents and Chemotherapy.

[46]  Nuno Bandeira,et al.  Interpretation of tandem mass spectra obtained from cyclic nonribosomal peptides. , 2009, Analytical chemistry.

[47]  Alla Lapidus,et al.  Genomic islands link secondary metabolism to functional adaptation in marine Actinobacteria , 2009, The ISME Journal.

[48]  Arjen Lommen,et al.  MetAlign: interface-driven, versatile metabolomics tool for hyphenated full-scan mass spectrometry data preprocessing. , 2009, Analytical chemistry.

[49]  S. Ferriera,et al.  Analysis of the Pseudoalteromonas tunicata Genome Reveals Properties of a Surface-Associated Life Style in the Marine Environment , 2008, PloS one.

[50]  Steffen Neumann,et al.  Critical assessment of alignment procedures for LC-MS proteomics and metabolomics measurements , 2008, BMC Bioinformatics.

[51]  Sirish L. Shah,et al.  Analysis of metabolomic data using support vector machines. , 2008, Analytical chemistry.

[52]  R. Müller,et al.  Efficient mining of myxobacterial metabolite profiles enabled by liquid chromatography-electrospray ionisation-time-of-flight mass spectrometry and compound-based principal component analysis. , 2008, Analytica chimica acta.

[53]  Daniel Krug,et al.  Discovering the Hidden Secondary Metabolome of Myxococcus xanthus: a Study of Intraspecific Diversity , 2008, Applied and Environmental Microbiology.

[54]  R. Arakawa,et al.  Isolation and Characterization of Two Groups of Novel Marine Bacteria Producing Violacein , 2008, Marine Biotechnology.

[55]  J. Bowman Bioactive Compound Synthetic Capacity and Ecological Significance of Marine Bacterial Genus Pseudoalteromonas , 2007, Marine drugs.

[56]  M. Orešič,et al.  Data processing for mass spectrometry-based metabolomics. , 2007, Journal of chromatography. A.

[57]  William Fenical,et al.  Genome sequencing reveals complex secondary metabolome in the marine actinomycete Salinispora tropica , 2007, Proceedings of the National Academy of Sciences.

[58]  Michael A Fischbach,et al.  New antibiotics from bacterial natural products , 2006, Nature Biotechnology.

[59]  D. Kaiser,et al.  Evolution of sensory complexity recorded in a myxobacterial genome , 2006, Proceedings of the National Academy of Sciences.

[60]  Thomas Börner,et al.  Natural Biocombinatorics in the Polyketide Synthase Genes of the Actinobacterium Streptomyces avermitilis , 2006, PLoS Comput. Biol..

[61]  F. Peláez The historical delivery of antibiotics from microbial natural products--can history repeat? , 2006, Biochemical pharmacology.

[62]  Matej Oresic,et al.  MZmine: toolbox for processing and visualization of mass spectrometry based molecular profile data , 2006, Bioinform..

[63]  R. Abagyan,et al.  METLIN: A Metabolite Mass Spectral Database , 2005, Therapeutic drug monitoring.

[64]  H. Tettelin,et al.  The microbial pan-genome. , 2005, Current opinion in genetics & development.

[65]  A. Danchin,et al.  Coping with cold: the genome of the versatile marine Antarctica bacterium Pseudoalteromonas haloplanktis TAC125. , 2005, Genome research.

[66]  Jaideep P. Sundaram,et al.  Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial "pan-genome". , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[67]  R. Firn Bioprospecting – why is it so unrewarding? , 2003, Biodiversity & Conservation.

[68]  Christopher M Thomas,et al.  Characterization of the mupirocin biosynthesis gene cluster from Pseudomonas fluorescens NCIMB 10586. , 2003, Chemistry & biology.

[69]  B. Neilan,et al.  Evolutionary Affiliations Within the Superfamily of Ketosynthases Reflect Complex Pathway Associations , 2003, Journal of Molecular Evolution.

[70]  K. Tagomori,et al.  Comparison of Genome Structures of Vibrios, Bacteria Possessing Two Chromosomes , 2002, Journal of bacteriology.

[71]  D. Söll,et al.  Indolmycin Resistance of Streptomyces coelicolor A3(2) by Induced Expression of One of Its Two Tryptophanyl-tRNA Synthetases* , 2002, The Journal of Biological Chemistry.

[72]  B. Barrell,et al.  Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2) , 2002, Nature.

[73]  M. V. Wittenau,et al.  Chemistry of Indolmycin , 2002 .

[74]  Yoshiyuki Sakaki,et al.  Genome sequence of an industrial microorganism Streptomyces avermitilis: Deducing the ability of producing secondary metabolites , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[75]  S. Kjelleberg,et al.  Marine Pseudoalteromonas species are associated with higher organisms and produce biologically active extracellular agents. , 1999, FEMS microbiology ecology.

[76]  S. Takahashi,et al.  Thiomarinols D, E, F and G, new hybrid antimicrobial antibiotics produced by a marine bacterium; isolation, structure, and antimicrobial activity. , 1997, The Journal of antibiotics.

[77]  K. Fujimoto,et al.  Thiomarinols B and C, new antimicrobial antibiotics produced by a marine bacterium. , 1995, The Journal of antibiotics.

[78]  U. Hanefeld,et al.  Structure-activity relationships of phenyl- and benzoylpyrroles. , 1995, Chemical & pharmaceutical bulletin.

[79]  R. Woodard,et al.  Stereochemistry of Indolmycin Biosynthesis. Steric Course of C- and N-Methylation Reactions , 1980 .

[80]  M. Speedie,et al.  Isolation and characterization of tryptophan transaminase and indolepyruvate C-methyltransferase. Enzymes involved in indolmycin biosynthesis in Streptomyces griseus. , 1975, The Journal of biological chemistry.

[81]  L. Hurley,et al.  The biosynthesis of indolmycin. , 1971, Journal of the American Chemical Society.

[82]  W. Tobie The Pigment of Bacillus violaceus , 1935, Journal of bacteriology.