Phylogenetic Analysis of Polyketide Synthase I Domains from Soil Metagenomic Libraries Allows Selection of Promising Clones

ABSTRACT The metagenomic approach provides direct access to diverse unexplored genomes, especially from uncultivated bacteria in a given environment. This diversity can conceal many new biosynthetic pathways. Type I polyketide synthases (PKSI) are modular enzymes involved in the biosynthesis of many natural products of industrial interest. Among the PKSI domains, the ketosynthase domain (KS) was used to screen a large soil metagenomic library containing more than 100,000 clones to detect those containing PKS genes. Over 60,000 clones were screened, and 139 clones containing KS domains were detected. A 700-bp fragment of the KS domain was sequenced for 40 of 139 randomly chosen clones. None of the 40 protein sequences were identical to those found in public databases, and nucleic sequences were not redundant. Phylogenetic analyses were performed on the protein sequences of three metagenomic clones to select the clones which one can predict to produce new compounds. Two PKS-positive clones do not belong to any of the 23 published PKSI included in the analysis, encouraging further analyses on these two clones identified by the selection process.

[1]  Ben Shen,et al.  Polyketide biosynthesis beyond the type I, II and III polyketide synthase paradigms. , 2003, Current opinion in chemical biology.

[2]  H. Blöcker,et al.  The Biosynthesis of the Aromatic Myxobacterial Electron Transport Inhibitor Stigmatellin Is Directed by a Novel Type of Modular Polyketide Synthase* , 2002, The Journal of Biological Chemistry.

[3]  S. Hill,et al.  Characterization of the biosynthetic gene cluster for the antifungal polyketide soraphen A from Sorangium cellulosum So ce26. , 2002, Gene.

[4]  Gitanjali Yadav,et al.  SEARCHPKS: a program for detection and analysis of polyketide synthase domains , 2003, Nucleic Acids Res..

[5]  J. V. Lopez Naturally mosaic operons for secondary metabolite biosynthesis: variability and putative horizontal transfer of discrete catalytic domains of the epothilone polyketide synthase locus , 2003, Molecular Genetics and Genomics.

[6]  M. Watve,et al.  How many antibiotics are produced by the genus Streptomyces? , 2001, Archives of Microbiology.

[7]  J. Martín,et al.  A complex multienzyme system encoded by five polyketide synthase genes is involved in the biosynthesis of the 26-membered polyene macrolide pimaricin in Streptomyces natalensis. , 2000, Chemistry & biology.

[8]  E. Koonin,et al.  Construction and analysis of bacterial artificial chromosome libraries from a marine microbial assemblage. , 2000, Environmental microbiology.

[9]  Jo Handelsman,et al.  Isolation of Antibiotics Turbomycin A and B from a Metagenomic Library of Soil Microbial DNA , 2002, Applied and Environmental Microbiology.

[10]  J. Thompson,et al.  DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches. , 2000, Nucleic acids research.

[11]  K. Schleifer,et al.  Phylogenetic identification and in situ detection of individual microbial cells without cultivation. , 1995, Microbiological reviews.

[12]  E. Delong,et al.  Phylogenetic analysis of ribosomal RNA operons from uncultivated coastal marine bacterioplankton. , 2001, Environmental microbiology.

[13]  Jo Handelsman,et al.  A Census of rRNA Genes and Linked Genomic Sequences within a Soil Metagenomic Library , 2003, Applied and Environmental Microbiology.

[14]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[15]  Pascale Jeannin,et al.  Recombinant Environmental Libraries Provide Access to Microbial Diversity for Drug Discovery from Natural Products , 2003, Applied and Environmental Microbiology.

[16]  William R. Taylor,et al.  The rapid generation of mutation data matrices from protein sequences , 1992, Comput. Appl. Biosci..

[17]  P. Leadlay,et al.  The biosynthetic gene cluster for the polyketide immunosuppressant rapamycin. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[18]  B. Neilan,et al.  Evolutionary Affiliations Within the Superfamily of Ketosynthases Reflect Complex Pathway Associations , 2003, Journal of Molecular Evolution.

[19]  Manolo Gouy,et al.  SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny , 1996, Comput. Appl. Biosci..

[20]  R. Firn,et al.  The evolution of secondary metabolism – a unifying model , 2000, Molecular microbiology.

[21]  J. Handelsman,et al.  Cloning the Soil Metagenome: a Strategy for Accessing the Genetic and Functional Diversity of Uncultured Microorganisms , 2000, Applied and Environmental Microbiology.

[22]  J. Sambrook,et al.  Molecular Cloning: A Laboratory Manual , 2001 .

[23]  C R Hutchinson,et al.  Alteration of the substrate specificity of a modular polyketide synthase acyltransferase domain through site-specific mutations. , 2001, Biochemistry.

[24]  Gitanjali Yadav,et al.  Computational approach for prediction of domain organization and substrate specificity of modular polyketide synthases. , 2003, Journal of molecular biology.

[25]  J. Staunton,et al.  Polyketide biosynthesis: a millennium review. , 2001, Natural product reports.

[26]  D. Hopwood,et al.  Genetic Contributions to Understanding Polyketide Synthases. , 1997, Chemical reviews.

[27]  P. Simonet,et al.  Quantification of bacterial subgroups in soil: comparison of DNA extracted directly from soil or from cells previously released by density gradient centrifugation. , 2001, Environmental microbiology.

[28]  S Omura,et al.  Organization of the biosynthetic gene cluster for the polyketide anthelmintic macrolide avermectin in Streptomyces avermitilis. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[29]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[30]  R. Müller,et al.  Melithiazol biosynthesis: further insights into myxobacterial PKS/NRPS systems and evidence for a new subclass of methyl transferases. , 2003, Chemistry & biology.

[31]  M. Gouy,et al.  WWW-query: an on-line retrieval system for biological sequence banks. , 1996, Biochimie.

[32]  P. Hugenholtz,et al.  Laboratory Cultivation of Widespread and Previously Uncultured Soil Bacteria , 2003, Applied and Environmental Microbiology.

[33]  P. Leadlay,et al.  Divergent sequence motifs correlated with the substrate specificity of (methyl)malonyl‐CoA:acyl carrier protein transacylase domains in modular polyketide synthases , 1995, FEBS letters.

[34]  L. Øvreås,et al.  Prokaryotic Diversity--Magnitude, Dynamics, and Controlling Factors , 2002, Science.