NRPS-PKS: a knowledge-based resource for analysis of NRPS/PKS megasynthases

NRPS-PKS is web-based software for analysing large multi-enzymatic, multi-domain megasynthases that are involved in the biosynthesis of pharmaceutically important natural products such as cyclosporin, rifamycin and erythromycin. NRPS-PKS has been developed based on a comprehensive analysis of the sequence and structural features of several experimentally characterized biosynthetic gene clusters. The results of these analyses have been organized as four integrated searchable databases for elucidating domain organization and substrate specificity of nonribosomal peptide synthetases and three types of polyketide synthases. These databases work as the backend of NRPS-PKS and provide the knowledge base for predicting domain organization and substrate specificity of uncharacterized NRPS/PKS clusters. Benchmarking on a large set of biosynthetic gene clusters has demonstrated that, apart from correct identification of NRPS and PKS domains, NRPS-PKS can also predict specificities of adenylation and acyltransferase domains with reasonably high accuracy. These features of NRPS-PKS make it a valuable resource for identification of natural products biosynthesized by NRPS/PKS gene clusters found in newly sequenced genomes. The training and test sets of gene clusters included in NRPS-PKS correlate information on 307 open reading frames, 2223 functional protein domains, 68 starter/extender precursors and their specific recognition motifs, and also the chemical structure of 101 natural products from four different families. NRPS-PKS is a unique resource which provides a user-friendly interface for correlating chemical structures of natural products with the domains and modules in the corresponding nonribosomal peptide synthetases or polyketide synthases. It also provides guidelines for domain/module swapping as well as site-directed mutagenesis experiments to engineer biosynthesis of novel natural products. NRPS-PKS can be accessed at http://www.nii.res.in/nrps-pks.html.

[1]  P. Brick,et al.  Structural basis for the activation of phenylalanine in the non‐ribosomal biosynthesis of gramicidin S , 1997, The EMBO journal.

[2]  M. Marahiel,et al.  The bacitracin biosynthesis operon of Bacillus licheniformis ATCC 10716: molecular characterization of three multi-modular peptide synthetases. , 1997, Chemistry & biology.

[3]  Christopher T. Walsh,et al.  The structure of VibH represents nonribosomal peptide synthetase condensation, cyclization and epimerization domains , 2002, Nature Structural Biology.

[4]  C. Walsh,et al.  The parallel and convergent universes of polyketide synthases and nonribosomal peptide synthetases. , 1999, Chemistry & biology.

[5]  J R Jacobsen,et al.  Tolerance and specificity of polyketide synthases. , 1999, Annual review of biochemistry.

[6]  M. Marahiel,et al.  Ways of Assembling Complex Natural Products on Modular Nonribosomal Peptide Synthetases , 2002, Chembiochem : a European journal of chemical biology.

[7]  Rajesh S. Gokhale,et al.  Biochemistry of Polyketide Synthases , 2001 .

[8]  Gitanjali Yadav,et al.  SEARCHPKS: a program for detection and analysis of polyketide synthase domains , 2003, Nucleic Acids Res..

[9]  D. Hopwood,et al.  Genetic Contributions to Understanding Polyketide Synthases. , 1997, Chemical reviews.

[10]  Mohamed A. Marahiel,et al.  Modular Peptide Synthetases Involved in Nonribosomal Peptide Synthesis. , 1997, Chemical reviews.

[11]  Richard A. Dixon,et al.  Structure of chalcone synthase and the molecular basis of plant polyketide biosynthesis , 1999, Nature Structural Biology.

[12]  David C. Jones,et al.  GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences. , 1999, Journal of molecular biology.

[13]  B. Shen,et al.  Biosynthesis of hybrid peptide-polyketide natural products. , 2001, Current opinion in drug discovery & development.

[14]  T. Stachelhaus,et al.  The specificity-conferring code of adenylation domains in nonribosomal peptide synthetases. , 1999, Chemistry & biology.

[15]  Alex Bateman,et al.  The InterPro Database, 2003 brings increased coverage and new features , 2003, Nucleic Acids Res..

[16]  Joseph P Noel,et al.  The chalcone synthase superfamily of type III polyketide synthases. , 2003, Natural product reports.

[17]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[18]  G. Challis,et al.  Predictive, structure-based model of amino acid recognition by nonribosomal peptide synthetase adenylation domains. , 2000, Chemistry & biology.

[19]  Gitanjali Yadav,et al.  Computational approach for prediction of domain organization and substrate specificity of modular polyketide synthases. , 2003, Journal of molecular biology.

[20]  J. Staunton,et al.  Polyketide biosynthesis: a millennium review. , 2001, Natural product reports.

[21]  Mohamed A. Marahiel,et al.  Modular Peptide Synthetases Involved in Nonribosomal Peptide Synthesis , 1998 .

[22]  Gitanjali Yadav,et al.  A New Family of Type III Polyketide Synthases in Mycobacterium tuberculosis* , 2003, Journal of Biological Chemistry.

[23]  C. Khosla,et al.  Role of linkers in communication between protein modules. , 2000, Current opinion in chemical biology.

[24]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[25]  L. Katz,et al.  Novel macrolides through genetic engineering , 1999, Medicinal research reviews.

[26]  Benjamin A. Shoemaker,et al.  CDD: a database of conserved domain alignments with links to domain three-dimensional structure , 2002, Nucleic Acids Res..

[27]  C. Walsh,et al.  Harnessing the biosynthetic code: combinations, permutations, and mutations. , 1998, Science.

[28]  Z Dauter,et al.  The Escherichia coli Malonyl-CoA:Acyl Carrier Protein Transacylase at 1.5-Å Resolution. , 1995, The Journal of Biological Chemistry.

[29]  R. Kneusel,et al.  Molecular cloning and heterologous expression of acridone synthase from elicited Ruta graveolens L. cell suspension cultures , 1995, Plant Molecular Biology.