Investigation of terpene diversification across multiple sequenced plant genomes

Significance The terpenes are the largest class of plant natural products. This major class of compounds represents tremendous chemical diversity of which only a relatively small fraction has so far been accessed and used by industry. The primary drivers of terpene diversification are terpenoid synthases and cytochromes P450, which synthesize and modify terpene scaffolds. Here, focusing on these two gene families, we investigate terpene synthesis and evolution across 17 sequenced plant genomes. Our analyses shed light on the roots of terpene biosynthesis and diversification in plants. They also reveal that different genomic mechanisms of pathway assembly predominate in eudicots and monocots. Plants produce an array of specialized metabolites, including chemicals that are important as medicines, flavors, fragrances, pigments and insecticides. The vast majority of this metabolic diversity is untapped. Here we take a systematic approach toward dissecting genetic components of plant specialized metabolism. Focusing on the terpenes, the largest class of plant natural products, we investigate the basis of terpene diversity through analysis of multiple sequenced plant genomes. The primary drivers of terpene diversification are terpenoid synthase (TS) “signature” enzymes (which generate scaffold diversity), and cytochromes P450 (CYPs), which modify and further diversify these scaffolds, so paving the way for further downstream modifications. Our systematic search of sequenced plant genomes for all TS and CYP genes reveals that distinct TS/CYP gene pairs are found together far more commonly than would be expected by chance, and that certain TS/CYP pairings predominate, providing signals for key events that are likely to have shaped terpene diversity. We recover TS/CYP gene pairs for previously characterized terpene metabolic gene clusters and demonstrate new functional pairing of TSs and CYPs within previously uncharacterized clusters. Unexpectedly, we find evidence for different mechanisms of pathway assembly in eudicots and monocots; in the former, microsyntenic blocks of TS/CYP gene pairs duplicate and provide templates for the evolution of new pathways, whereas in the latter, new pathways arise by mixing and matching of individual TS and CYP genes through dynamic genome rearrangements. This is, to our knowledge, the first documented observation of the unique pattern of TS and CYP assembly in eudicots and monocots.

[1]  Manuela Herman,et al.  Biochemistry And Molecular Biology Of Plants , 2016 .

[2]  R. Peters,et al.  The application of synthetic biology to elucidation of plant mono-, sesqui-, and diterpenoid metabolism. , 2014, Molecular plant.

[3]  Jiachen Zi,et al.  Biosynthesis of Lycosantalonol, a cis-Prenyl Derived Diterpenoid , 2014, Journal of the American Chemical Society.

[4]  I. Graham,et al.  Production of Bioactive Diterpenoids in the Euphorbiaceae Depends on Evolutionarily Conserved Gene Clusters[C][W][OPEN] , 2014, Plant Cell.

[5]  A. Osbourn,et al.  Gene clustering in plant specialized metabolism. , 2014, Current opinion in biotechnology.

[6]  M. Van Montagu,et al.  Combinatorial biosynthesis of sapogenins and saponins in Saccharomyces cerevisiae using a C-16α hydroxylase from Bupleurum falcatum , 2014, Proceedings of the National Academy of Sciences.

[7]  T. Larson,et al.  Production of Bioactive Diterpenoids in the Euphorbiaceae Depends on Evolutionarily Conserved Gene ClustersC , 2014 .

[8]  A. Warzybok,et al.  Reliable Reference Genes for Normalization of Gene Expression in Cucumber Grown under Different Nitrogen Nutrition , 2013, PloS one.

[9]  A. Aharoni,et al.  Biosynthesis of Antinutritional Alkaloids in Solanaceous Crops Is Mediated by Clustered Genes , 2013, Science.

[10]  Anthony M. Bolger,et al.  Evolution of a Complex Locus for Terpene Biosynthesis in Solanum[W][OPEN] , 2013, Plant Cell.

[11]  Douglas G. Scofield,et al.  The Norway spruce genome sequence and conifer genome evolution , 2013, Nature.

[12]  J. Bohlmann,et al.  Gene Discovery of Modular Diterpene Metabolism in Nonmodel Systems1[W][OA] , 2013, Plant Physiology.

[13]  M. Kolesnikova,et al.  An effective strategy for exploring unknown metabolic pathways by genome mining. , 2013, Journal of the American Chemical Society.

[14]  A. Osbourn,et al.  Modularity of Plant Metabolic Gene Clusters: A Trio of Linked Genes That Are Collectively Required for Acylation of Triterpenes in Oat[W][OA] , 2013, Plant Cell.

[15]  B. Hamberger,et al.  Plant P450s as versatile drivers for evolution of species-specific chemical diversity , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[16]  J. Bohlmann,et al.  Evolution of Conifer Diterpene Synthases: Diterpene Resin Acid Biosynthesis in Lodgepole Pine and Jack Pine Involves Monofunctional and Bifunctional Diterpene Synthases1[W][OA] , 2012, Plant Physiology.

[17]  R. Peters,et al.  Terpenoid synthase structures: a so far incomplete view of complex catalysis. , 2012, Natural product reports.

[18]  A. Takos,et al.  Why biosynthetic genes for chemical defense compounds cluster. , 2012, Trends in plant science.

[19]  J. Noel,et al.  The Rise of Chemodiversity in Plants , 2012, Science.

[20]  V. De Luca,et al.  Mining the Biodiversity of Plants: A Revolution in the Making , 2012, Science.

[21]  M. Mizutani Impacts of diversification of cytochrome P450 on plant metabolism. , 2012, Biological & pharmaceutical bulletin.

[22]  Jian Wang,et al.  Genome sequence of foxtail millet (Setaria italica) provides insights into grass evolution and biofuel potential , 2012, Nature Biotechnology.

[23]  Daniel W. A. Buchan,et al.  The tomato genome sequence provides insights into fleshy fruit evolution , 2012, Nature.

[24]  J. Silberg,et al.  A transposase strategy for creating libraries of circularly permuted proteins , 2012, Nucleic acids research.

[25]  Yisheng Wu,et al.  CYP701A8: A Rice ent-Kaurene Oxidase Paralog Diverted to More Specialized Diterpenoid Metabolism1[W][OA] , 2012, Plant Physiology.

[26]  The UniProt Consortium,et al.  Reorganizing the protein space at the Universal Protein Resource (UniProt) , 2011, Nucleic Acids Res..

[27]  Alvaro J. González,et al.  The Medicago Genome Provides Insight into the Evolution of Rhizobial Symbioses , 2011, Nature.

[28]  J. Poulain,et al.  The genome of the mesopolyploid crop species Brassica rapa , 2011, Nature Genetics.

[29]  Hadi Quesneville,et al.  Formation of plant metabolic gene clusters within dynamic chromosomal regions , 2011, Proceedings of the National Academy of Sciences.

[30]  David M. A. Martin,et al.  Genome sequence and analysis of the tuber crop potato , 2011, Nature.

[31]  Graham J. Etherington,et al.  Investigation of the potential for triterpene synthesis in rice through genome mining and metabolic engineering. , 2011, The New phytologist.

[32]  Michael S. Barker,et al.  The Selaginella Genome Identifies Genetic Changes Associated with the Evolution of Vascular Plants , 2011, Science.

[33]  D. Nelson,et al.  A P450-centric view of plant evolution. , 2011, The Plant journal : for cell and molecular biology.

[34]  J. Bohlmann,et al.  The family of terpene synthases in plants: a mid-size family of genes for specialized metabolism that is highly diversified throughout the kingdom. , 2011, The Plant journal : for cell and molecular biology.

[35]  Anne Osbourn,et al.  From hormones to secondary metabolism: the emergence of metabolic gene clusters in plants. , 2011, The Plant journal : for cell and molecular biology.

[36]  C. Gatz,et al.  Herbivore-induced and floral homoterpene volatiles are biosynthesized by a single P450 enzyme (CYP82G1) in Arabidopsis , 2010, Proceedings of the National Academy of Sciences.

[37]  Anne Osbourn,et al.  Secondary metabolic gene clusters: evolutionary toolkits for chemical innovation. , 2010, Trends in genetics : TIG.

[38]  B. Haas,et al.  Draft genome sequence of the oilseed species Ricinus communis , 2010, Nature Biotechnology.

[39]  J. Keasling,et al.  Cloning of casbene and neocembrene synthases from Euphorbiaceae plants and expression in Saccharomyces cerevisiae. , 2010, Phytochemistry.

[40]  D. Ober,et al.  Gene duplications and the time thereafter - examples from plant secondary metabolism. , 2010, Plant biology.

[41]  Sai Guna Ranjan Gurazada,et al.  Genome sequencing and analysis of the model grass Brachypodium distachyon , 2010, Nature.

[42]  T. Sakurai,et al.  Genome sequence of the palaeopolyploid soybean , 2010, Nature.

[43]  P. Shaw,et al.  Cell Type–Specific Chromatin Decondensation of a Metabolic Gene Cluster in Oats C W OA , 2010 .

[44]  Xiao-Dong Luo,et al.  Three Terpenoids and a Tocopherol-Related Compound from Ricinus communis , 2009 .

[45]  Asan,et al.  The genome of the cucumber, Cucumis sativus L. , 2009, Nature Genetics.

[46]  Dawn H. Nagel,et al.  The B73 Maize Genome: Complexity, Diversity, and Dynamics , 2009, Science.

[47]  Kazuki Saito,et al.  Functional genomics for plant natural product biosynthesis. , 2009, Natural product reports.

[48]  Dana J Morrone,et al.  CYP76M7 Is an ent-Cassadiene C11α-Hydroxylase Defining a Second Multifunctional Diterpenoid Biosynthetic Gene Cluster in Rice[W][OA] , 2009, The Plant Cell Online.

[49]  D. Nelson The Cytochrome P450 Homepage , 2009, Human Genomics.

[50]  Dana J Morrone,et al.  Increasing diterpene yield with a modular metabolic engineering system in E. coli: comparison of MEV and MEP isoprenoid precursor pathway engineering , 2009, Applied Microbiology and Biotechnology.

[51]  W. K. Wilson,et al.  Product profile of PEN3: the last unexamined oxidosqualene cyclase in Arabidopsis thaliana. , 2009, Organic letters.

[52]  Mihaela M. Martis,et al.  The Sorghum bicolor genome and the diversification of grasses , 2009, Nature.

[53]  A. Osbourn,et al.  Metabolic Diversification—Independent Assembly of Operon-Like Gene Clusters in Different Plants , 2008, Science.

[54]  N. Provart,et al.  An extensive (co-)expression analysis tool for the cytochrome P450 superfamily in Arabidopsis thaliana , 2008, BMC Plant Biology.

[55]  J. Gershenzon,et al.  Identification and Regulation of TPS04/GES, an Arabidopsis Geranyllinalool Synthase Catalyzing the First Step in the Formation of the Insect-Induced Volatile C16-Homoterpene TMTT[W] , 2008, The Plant Cell Online.

[56]  D. Christianson Unearthing the roots of the terpenome. , 2008, Current opinion in chemical biology.

[57]  J. Bohlmann,et al.  Terpenoid biomaterials. , 2008, The Plant journal : for cell and molecular biology.

[58]  Kazunori Okada,et al.  Identification of a Biosynthetic Gene Cluster in Rice for Momilactones* , 2007, Journal of Biological Chemistry.

[59]  J. Poulain,et al.  The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla , 2007, Nature.

[60]  J. Gershenzon,et al.  The function of terpene natural products in the natural world. , 2007, Nature chemical biology.

[61]  D. Hoffmeister,et al.  Natural products of filamentous fungi: enzymes, genes, and their regulation. , 2007, Natural product reports.

[62]  A. Osbourn,et al.  A different function for a member of an ancient and highly conserved cytochrome P450 family: From essential sterols to plant defense , 2006, Proceedings of the National Academy of Sciences.

[63]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[64]  J. Bohlmann,et al.  Genes, enzymes and chemicals of terpenoid diversity in the constitutive and induced defence of conifers against insects and pathogens. , 2006, The New phytologist.

[65]  M. Stitt,et al.  Genome-Wide Identification and Testing of Superior Reference Genes for Transcript Normalization in Arabidopsis1[w] , 2005, Plant Physiology.

[66]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.

[67]  李佩芳 International Rice Genome Sequencing Project. 2005. The map-based sequence of the rice genome. , 2005 .

[68]  Y. Ebizuka,et al.  Cucurbitadienol synthase, the first committed enzyme for cucurbitacin biosynthesis, is a distinct enzyme from cycloartenol synthase for phytosterol biosynthesis , 2004 .

[69]  Reuben J. Peters,et al.  Identification of Syn-Pimara-7,15-Diene Synthase Reveals Functional Clustering of Terpene Synthases Involved in Rice Phytoalexin/Allelochemical Biosynthesis1 , 2004, Plant Physiology.

[70]  A. Osbourn,et al.  A gene cluster for secondary metabolism in oat: implications for the evolution of metabolic diversity in plants. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[71]  S. Aubourg,et al.  Genomic analysis of the terpenoid synthase (AtTPS) gene family of Arabidopsis thaliana , 2002, Molecular Genetics and Genomics.

[72]  J. Chappell The genetics and molecular genetics of terpene and sterol origami. , 2002, Current opinion in plant biology.

[73]  J. Memelink,et al.  Geraniol 10‐hydroxylase1, a cytochrome P450 enzyme involved in terpenoid indole alkaloid biosynthesis , 2001, FEBS letters.

[74]  J. Chappell,et al.  Cloning, heterologous expression, and functional characterization of 5-epi-aristolochene-1,3-dihydroxylase from tobacco (Nicotiana tabacum). , 2001, Archives of biochemistry and biophysics.

[75]  R. Croteau,et al.  Genomic organization of plant terpene synthases and molecular evolutionary implications. , 2001, Genetics.

[76]  John H. Loughrin,et al.  Suppression of a P450 hydroxylase gene in plant trichome glands enhances natural-product-based aphid resistance , 2001, Nature Biotechnology.

[77]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[78]  R. Croteau,et al.  Regiospecific cytochrome P450 limonene hydroxylases from mint (Mentha) species: cDNA isolation, characterization, and functional expression of (-)-4S-limonene-3-hydroxylase and (-)-4S-limonene-6-hydroxylase. , 1999, Archives of biochemistry and biophysics.

[79]  Benveniste,et al.  Cytochrome P450 , 1993, Handbook of Experimental Pharmacology.

[80]  Donald Ervin Knuth,et al.  The Art of Computer Programming, Volume II: Seminumerical Algorithms , 1970 .

[81]  T. Halsall,et al.  Natural Products , 1960, Nature.