antiSMASH 7.0: new and improved predictions for detection, regulation, chemical structures and visualisation

Abstract Microorganisms produce small bioactive compounds as part of their secondary or specialised metabolism. Often, such metabolites have antimicrobial, anticancer, antifungal, antiviral or other bio-activities and thus play an important role for applications in medicine and agriculture. In the past decade, genome mining has become a widely-used method to explore, access, and analyse the available biodiversity of these compounds. Since 2011, the ‘antibiotics and secondary metabolite analysis shell—antiSMASH’ (https://antismash.secondarymetabolites.org/) has supported researchers in their microbial genome mining tasks, both as a free to use web server and as a standalone tool under an OSI-approved open source licence. It is currently the most widely used tool for detecting and characterising biosynthetic gene clusters (BGCs) in archaea, bacteria, and fungi. Here, we present the updated version 7 of antiSMASH. antiSMASH 7 increases the number of supported cluster types from 71 to 81, as well as containing improvements in the areas of chemical structure prediction, enzymatic assembly-line visualisation and gene cluster regulation.

[1]  Jingyuan Fu,et al.  gutSMASH predicts specialized primary metabolic pathways from the human gut microbiota , 2023, Nature Biotechnology.

[2]  A. Butler,et al.  Automated genome mining predicts combinatorial diversity and taxonomic distribution of peptide metallophore structures , 2022, bioRxiv.

[3]  Thomas J. Booth,et al.  MIBiG 3.0: a community-driven effort to annotate experimentally validated biosynthetic gene clusters , 2022, Nucleic Acids Res..

[4]  B. Moore,et al.  The Natural Product Domain Seeker version 2 (NaPDoS2) webtool relates ketosynthase phylogeny to biosynthetic function , 2022, The Journal of biological chemistry.

[5]  T. Sparks,et al.  Impact of Natural Products on Discovery of, and Innovation in, Crop Protection Compounds. , 2021, Pest management science.

[6]  Victòria Pascal Andreu,et al.  The gutSMASH web server: automated identification of primary metabolic gene clusters from the gut microbiota , 2021, Nucleic Acids Res..

[7]  Alexander M. Kloosterman,et al.  antiSMASH 6.0: improving cluster detection and comparison capabilities , 2021, Nucleic Acids Res..

[8]  Kai Blin,et al.  The antiSMASH database version 3: increased taxonomic coverage and new query features for modular enzymes , 2020, Nucleic Acids Res..

[9]  Silvio C. E. Tosatto,et al.  Pfam: The protein families database in 2021 , 2020, Nucleic Acids Res..

[10]  Peer Bork,et al.  SMART: recent updates, new developments and status in 2020 , 2020, Nucleic Acids Res..

[11]  Kai Blin,et al.  BiG-FAM: the biosynthetic gene cluster families database , 2020, Nucleic Acids Res..

[12]  Justin J. J. van der Hooft,et al.  BiG-SLiCE: A highly scalable tool maps the diversity of 1.2 million biosynthetic gene clusters , 2020, bioRxiv.

[13]  Kai Blin,et al.  Designing sgRNAs for CRISPR-BEST base editing applications with CRISPy-web 2.0 , 2020, Synthetic and systems biotechnology.

[14]  Kai Blin,et al.  ARTS 2.0: feature updates and expansion of the Antibiotic Resistant Target Seeker for comparative genome mining , 2020, Nucleic Acids Res..

[15]  David J Newman,et al.  Natural Products as Sources of New Drugs over the Nearly Four Decades from 01/1981 to 09/2019. , 2020, Journal of natural products.

[16]  I-Min A. Chen,et al.  IMG-ABC v.5.0: an update to the IMG/Atlas of Biosynthetic Gene Clusters Knowledgebase , 2019, Nucleic Acids Res..

[17]  C. Médigue,et al.  MicroScope: an integrated platform for the annotation and exploration of microbial gene functions through genomic, pangenomic and metabolic comparative analysis , 2019, Nucleic Acids Res..

[18]  Marnix H. Medema,et al.  A computational framework to explore large-scale biosynthetic diversity , 2019, Nature Chemical Biology.

[19]  Danny A. Bitton,et al.  A deep learning genome-mining strategy for biosynthetic gene cluster prediction , 2019, Nucleic acids research.

[20]  R. Ueoka,et al.  Automated structure prediction of trans-acyltransferase polyketide synthase products , 2019, Nature Chemical Biology.

[21]  S. Lee,et al.  antiSMASH 5.0: updates to the secondary metabolite genome mining pipeline , 2019, Nucleic Acids Res..

[22]  Oscar P. Kuipers,et al.  BAGEL4: a user-friendly web server to thoroughly mine RiPPs and bacteriocins , 2018, Nucleic Acids Res..

[23]  Kai Blin,et al.  Recent development of antiSMASH and other computational approaches to mine secondary metabolite biosynthetic gene clusters , 2017, Briefings Bioinform..

[24]  Kai Blin,et al.  antiSMASH 4.0—improvements in chemistry prediction and gene cluster boundary identification , 2017, Nucleic Acids Res..

[25]  Kai Blin,et al.  plantiSMASH: automated identification, annotation and expression analysis of plant biosynthetic gene clusters , 2016, bioRxiv.

[26]  Tilmann Weber,et al.  The evolution of genome mining in microbes - a review. , 2016, Natural product reports.

[27]  Tilmann Weber,et al.  The secondary metabolite bioinformatics portal: Computational tools to facilitate synthetic biology of secondary metabolite production , 2016, Synthetic and systems biotechnology.

[28]  Michael A Fischbach,et al.  Computational approaches to natural product discovery. , 2015, Nature chemical biology.

[29]  Kai Blin,et al.  antiSMASH 3.0—a comprehensive resource for the genome mining of biosynthetic gene clusters , 2015, Nucleic Acids Res..

[30]  Rainer Breitling,et al.  Pep2Path: Automated Mass Spectrometry-Guided Genome Mining of Peptidic Natural Products , 2014, PLoS Comput. Biol..

[31]  Tilmann Weber,et al.  In silico tools for the analysis of antibiotic biosynthetic pathways. , 2014, International journal of medical microbiology : IJMM.

[32]  Kai Blin,et al.  antiSMASH 2.0—a versatile platform for genome mining of secondary metabolite producers , 2013, Nucleic Acids Res..

[33]  Erin Beck,et al.  TIGRFAMs and Genome Properties in 2013 , 2012, Nucleic Acids Res..

[34]  Kai Blin,et al.  antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences , 2011, Nucleic Acids Res..

[35]  Kai Blin,et al.  NRPSpredictor2—a web server for predicting NRPS adenylation domain specificity , 2011, Nucleic Acids Res..

[36]  K. Sivonen,et al.  Highly Diverse Cyanobactins in Strains of the Genus Anabaena , 2009, Applied and Environmental Microbiology.

[37]  Gitanjali Yadav,et al.  Towards Prediction of Metabolic Products of Polyketide Synthases: An In Silico Analysis , 2009, PLoS Comput. Biol..

[38]  Tilmann Weber,et al.  Specificity prediction of adenylation domains in nonribosomal peptide synthetases (NRPS) using transductive support vector machines (TSVMs) , 2005, Nucleic acids research.

[39]  T. Stachelhaus,et al.  The specificity-conferring code of adenylation domains in nonribosomal peptide synthetases. , 1999, Chemistry & biology.

[40]  OUP accepted manuscript , 2021, Nucleic Acids Research.