Mining Bacterial Genomes for Secondary Metabolite Gene Clusters.

With the emergence of bacterial resistance against frequently used antibiotics, novel antibacterial compounds are urgently needed. Traditional bioactivity-guided drug discovery strategies involve laborious screening efforts and display high rediscovery rates. With the progress in next generation sequencing methods and the knowledge that the majority of antibiotics in clinical use are produced as secondary metabolites by bacteria, mining bacterial genomes for secondary metabolites with antimicrobial activity is a promising approach, which can guide a more time and cost-effective identification of novel compounds. However, what sounds easy to accomplish, comes with several challenges. To date, several tools for the prediction of secondary metabolite gene clusters are available, some of which are based on the detection of signature genes, while others are searching for specific patterns in gene content or regulation.Apart from the mere identification of gene clusters, several other factors such as determining cluster boundaries and assessing the novelty of the detected cluster are important. For this purpose, comparison of the predicted secondary metabolite genes with different cluster and compound databases is necessary. Furthermore, it is advisable to classify detected clusters into gene cluster families. So far, there is no standardized procedure for genome mining; however, different approaches to overcome all of these challenges exist and are addressed in this chapter. We give practical guidance on the workflow for secondary metabolite gene cluster identification, which includes the determination of gene cluster boundaries, addresses problems occurring with the use of draft genomes, and gives an outlook on the different methods for gene cluster classification. Based on comprehensible examples a protocol is set, which should enable the readers to mine their own genome data for interesting secondary metabolites.

[1]  H. Vogel,et al.  Current understanding of fatty acid biosynthesis and the acyl carrier protein. , 2010, The Biochemical journal.

[2]  S. Brady,et al.  eSNaPD: a versatile, web-based bioinformatics platform for surveying and mining natural product biosynthetic diversity from metagenomes. , 2014, Chemistry & biology.

[3]  Michael A. Skinnider,et al.  An automated Genomes-to-Natural Products platform (GNP) for the discovery of modular natural products , 2015, Nature Communications.

[4]  M. Marahiel,et al.  Impact of Epimerization Domains on the Intermodular Transfer of Enzyme‐Bound Intermediates in Nonribosomal Peptide Synthesis , 2006, Chembiochem : a European journal of chemical biology.

[5]  Edgar Wingender,et al.  PRODORIC: prokaryotic database of gene regulation , 2003, Nucleic Acids Res..

[6]  Marco Fondi,et al.  Methods for assembling reads and producing contigs. , 2015, Methods in molecular biology.

[7]  L. Heide The aminocoumarins: biosynthesis and biology. , 2009, Natural product reports.

[8]  William Stafford Noble,et al.  FIMO: scanning for occurrences of a given motif , 2011, Bioinform..

[9]  Z. Deng,et al.  Natural and engineered biosynthesis of nucleoside antibiotics in Actinomycetes , 2016, Journal of Industrial Microbiology & Biotechnology.

[10]  D. Herschlag,et al.  Assembly Line Polyketide Synthases: Mechanistic Insights and Unsolved Problems , 2014, Biochemistry.

[11]  David J Newman,et al.  Natural products as sources of new drugs over the 30 years from 1981 to 2010. , 2012, Journal of natural products.

[12]  Koichiro Tamura,et al.  MEGA6: Molecular Evolutionary Genetics Analysis version 6.0. , 2013, Molecular biology and evolution.

[13]  Neil L Kelleher,et al.  A Roadmap for Natural Product Discovery Based on Large-Scale Genomics and Metabolomics , 2014, Nature chemical biology.

[14]  J. Bartlett,et al.  The epidemic of antibiotic-resistant infections: a call to action for the medical community from the Infectious Diseases Society of America. , 2008, Clinical infectious diseases : an official publication of the Infectious Diseases Society of America.

[15]  W. Wohlleben,et al.  Elucidation of the zinc-dependent regulation in Amycolatopsis japonicum enabled the identification of the ethylenediamine-disuccinate ([S,S]-EDDS) genes. , 2016, Environmental microbiology.

[16]  Hein J. Wijma,et al.  Aminoacyl‐coenzyme A synthesis catalyzed by a CoA ligase from Penicillium chrysogenum , 2011, FEBS letters.

[17]  Gitanjali Yadav,et al.  SBSPKS: structure based sequence analysis of polyketide synthases , 2010, Nucleic Acids Res..

[18]  Victor M. Markowitz,et al.  IMG-ABC: A Knowledge Base To Fuel Discovery of Biosynthetic Gene Clusters and Novel Secondary Metabolites , 2015, mBio.

[19]  D. Newman,et al.  Natural products as sources of new drugs over the last 25 years. , 2007, Journal of natural products.

[20]  M. Fillat The FUR (ferric uptake regulator) superfamily: diversity and versatility of key transcriptional regulators. , 2014, Archives of biochemistry and biophysics.

[21]  Peter Williams,et al.  IMG: the integrated microbial genomes database and comparative analysis system , 2011, Nucleic Acids Res..

[22]  Krystle L. Chavarria,et al.  Diversity and evolution of secondary metabolism in the marine actinomycete genus Salinispora , 2014, Proceedings of the National Academy of Sciences.

[23]  Andriy Luzhetskyy,et al.  Type II polyketide synthases: gaining a deeper insight into enzymatic teamwork. , 2007, Natural product reports.

[24]  F. Peláez The historical delivery of antibiotics from microbial natural products--can history repeat? , 2006, Biochemical pharmacology.

[25]  Kira J Weissman,et al.  The structural biology of biosynthetic megaenzymes. , 2015, Nature chemical biology.

[26]  Kai Blin,et al.  antiSMASH 2.0—a versatile platform for genome mining of secondary metabolite producers , 2013, Nucleic Acids Res..

[27]  Kai Blin,et al.  antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences , 2011, Nucleic Acids Res..

[28]  Sylvie Lautru,et al.  Discovery of a new peptide natural product by Streptomyces coelicolor genome mining , 2005, Nature chemical biology.

[29]  Ray Luo,et al.  Comprehensive Structural and Biochemical Analysis of the Terminal Myxalamid Reductase Domain for the Engineered Production of Primary Alcohols. , 2015, Chemistry & biology.

[30]  M. Cryle,et al.  X-domain of peptide synthetases recruits oxygenases crucial for glycopeptide biosynthesis , 2015, Nature.

[31]  Michael A Fischbach,et al.  Computational approaches to natural product discovery. , 2015, Nature chemical biology.

[32]  D. Kallifidas,et al.  Zinc-Responsive Regulation of Alternative Ribosomal Protein Genes in Streptomyces coelicolor Involves Zur and σR , 2007, Journal of bacteriology.

[33]  Nobuyuki Fujita,et al.  DoBISCUIT: a database of secondary metabolite biosynthetic gene clusters , 2012, Nucleic Acids Res..

[34]  James H Naismith,et al.  Structural aspects of non-ribosomal peptide biosynthesis. , 2004, Current opinion in structural biology.

[35]  J. Badger,et al.  The Natural Product Domain Seeker NaPDoS: A Phylogeny Based Bioinformatic Tool to Classify Secondary Metabolite Gene Diversity , 2012, PloS one.

[36]  T. Eguchi,et al.  Biosynthetic genes for aminoglycoside antibiotics , 2009, The Journal of Antibiotics.

[37]  Michael G Thomas,et al.  Biosynthesis of polyketide synthase extender units. , 2009, Natural product reports.

[38]  Haruo Ikeda,et al.  Exploration and mining of the bacterial terpenome. , 2012, Accounts of chemical research.

[39]  M. Burkart,et al.  Explorations of catalytic domains in non-ribosomal peptide synthetase enzymology. , 2012, Natural product reports.

[40]  M. Marahiel,et al.  Nonribosomal peptide synthetases: structures and dynamics. , 2010, Current opinion in structural biology.

[41]  Debasisa Mohanty,et al.  Versatility of polyketide synthases in generating metabolic diversity. , 2007, Current opinion in structural biology.

[42]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[43]  P. G. Arnison,et al.  Ribosomally synthesized and post-translationally modified peptide natural products: overview and recommendations for a universal nomenclature. , 2013, Natural product reports.

[44]  G. Niu,et al.  Nucleoside antibiotics: biosynthesis, regulation, and biotechnology. , 2015, Trends in microbiology.

[45]  Michael A McDonough,et al.  The enzymes of β-lactam biosynthesis. , 2013, Natural product reports.

[46]  Trey Ideker,et al.  Cytoscape 2.8: new features for data integration and network visualization , 2010, Bioinform..

[47]  Kai Blin,et al.  antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters , 2015, Nucleic Acids Res..

[48]  Jörn Piel,et al.  Biosynthesis of polyketides by trans-AT polyketide synthases. , 2016, Natural product reports.

[49]  Torsten Seemann,et al.  Prokka: rapid prokaryotic genome annotation , 2014, Bioinform..

[50]  J. Zucko,et al.  ClustScan: an integrated program package for the semi-automatic annotation of modular biosynthetic gene clusters and in silico prediction of novel chemical structures , 2008, Nucleic acids research.

[51]  Oscar P. Kuipers,et al.  BAGEL: a web-based bacteriocin genome mining tool , 2006, Nucleic Acids Res..

[52]  B. Shen,et al.  Biosynthesis of hybrid peptide-polyketide natural products. , 2001, Current opinion in drug discovery & development.

[53]  Kira J Weissman,et al.  Genetic engineering of modular PKSs: from combinatorial biosynthesis to synthetic biology. , 2016, Natural product reports.

[54]  Hung‐wen Liu,et al.  Enoyl-CoA hydratase. reaction, mechanism, and inhibition. , 2003, Bioorganic & medicinal chemistry.

[55]  Peter Man-Un Ung,et al.  Automated genome mining for natural products , 2009, BMC Bioinformatics.

[56]  Ivan Erill,et al.  CollecTF: a database of experimentally validated transcription factor-binding sites in Bacteria , 2013, Nucleic Acids Res..

[57]  S. Salzberg,et al.  Improved microbial gene identification with GLIMMER. , 1999, Nucleic acids research.

[58]  R. Overbeek,et al.  Searching for patterns in genomic data. , 1997, Trends in genetics : TIG.

[59]  S. Lee,et al.  Metabolic engineering of antibiotic factories: new tools for antibiotic production in actinomycetes. , 2015, Trends in biotechnology.

[60]  Jon Clardy,et al.  Variable genetic architectures produce virtually identical molecules in bacterial symbionts of fungus-growing ants , 2015, Proceedings of the National Academy of Sciences.

[61]  Roger G. Linington,et al.  Insights into Secondary Metabolism from a Global Analysis of Prokaryotic Biosynthetic Gene Clusters , 2014, Cell.

[62]  Alessio Mengoni,et al.  CONTIGuator: a bacterial genomes finishing tool for structural insights on draft genomes , 2011, Source Code for Biology and Medicine.

[63]  Fuchao Xu,et al.  Type III polyketide synthases in natural product biosynthesis , 2012, IUBMB life.

[64]  C. Walsh,et al.  Tailoring enzymes that modify nonribosomal peptides during and after chain elongation on NRPS assembly lines. , 2001, Current opinion in chemical biology.

[65]  Kyle R. Conway,et al.  ClusterMine360: a database of microbial PKS/NRPS biosynthesis , 2012, Nucleic Acids Res..

[66]  Carla S. Jones,et al.  Minimum Information about a Biosynthetic Gene cluster. , 2015, Nature chemical biology.

[67]  Matthew Berriman,et al.  Artemis: an integrated platform for visualization and analysis of high-throughput sequence-based experimental data , 2011, Bioinform..

[68]  Rick L. Stevens,et al.  The RAST Server: Rapid Annotations using Subsystems Technology , 2008, BMC Genomics.

[69]  Michael A. Skinnider,et al.  Genomes to natural products PRediction Informatics for Secondary Metabolomes (PRISM) , 2015, Nucleic acids research.

[70]  R. Breitling,et al.  Detecting Sequence Homology at the Gene Cluster Level with MultiGeneBlast , 2013, Molecular biology and evolution.

[71]  T. Weber Antibiotics: Biosynthesis, Generation of Novel Compounds , 2010 .

[72]  P. Dorrestein,et al.  Biosynthetic multitasking facilitates thalassospiramide structural diversity in marine bacteria. , 2013, Journal of the American Chemical Society.

[73]  B. Wilkinson,et al.  Genes and enzymes involved in bacterial isoprenoid biosynthesis. , 2009, Current opinion in chemical biology.

[74]  V. Nizet,et al.  Salinipyrone and Pacificanone Are Biosynthetic By‐products of the Rosamicin Polyketide Synthase , 2015, Chembiochem : a European journal of chemical biology.

[75]  Raw sequence data and quality control. , 2015, Methods in molecular biology.

[76]  R. Giegerich,et al.  GenDB--an open source genome annotation system for prokaryote genomes. , 2003, Nucleic acids research.