Identification of enzymatic and regulatory genes of plant metabolism through QTL analysis in Arabidopsis.

The biochemical diversity in the plant kingdom is estimated to well exceed 100,000 distinct compounds (Weckwerth, 2003) and 4000 to 20,000 metabolites per species seem likely (Fernie et al., 2004). In recent years extensive progress has been made towards the identification of enzymes and regulatory genes working in a complex network to generate this large arsenal of metabolites. Genetic loci influencing quantitative traits, e.g. metabolites or biomass, may be mapped to associated molecular markers, a method called quantitative trait locus mapping (QTL mapping), which may facilitate the identification of novel genes in biochemical pathways. Arabidopsis thaliana, as a model organism for seed plants, is a suitable target for metabolic QTL (mQTL) studies due to the availability of highly developed molecular and genetic tools, and the extensive knowledge accumulated on the metabolite profile. While intensely studied, in particular since the availability of its complete sequence, the genome of Arabidopsis still comprises a large proportion of genes with only tentative function based on sequence homology. From a total number of 33,518 genes currently listed (TAIR 9, http://www.arabidopsis.org), only about 25% have direct experimental evidence for their molecular function and biological process, while for more than 30% no biological data are available. Modern metabolomics approaches together with continually extended genomic resources will facilitate the task of assigning functions to those genes. In our previous study we reported on the identification of mQTL (Lisec et al., 2008). In this paper, we summarize the current status of mQTL analyses and causal gene identification in Arabidopsis and present evidence that a candidate gene located within the confidence interval of a fumarate mQTL (AT5G50950) encoding a putative fumarase is likely to be the causal gene of this QTL. The total number of genes molecularly identified based on mQTL studies is still limited, but the advent of multi-parallel analysis techniques for measurement of gene expression, as well as protein and metabolite abundances and for rapid gene identification will assist in the important task of assigning enzymes and regulatory genes to the growing network of known metabolic reactions.

[1]  D. Oliver,et al.  Biochemical and molecular characterization of fumarase from plants: purification and characterization of the enzyme--cloning, sequencing, and expression of the gene. , 1997, Archives of biochemistry and biophysics.

[2]  Members of the Complex Trait Consortium The nature and identification of quantitative trait loci: a community's view , 2003 .

[3]  Wenxu Zhou,et al.  Arabidopsis has a cytosolic fumarase required for the massive allocation of photosynthate into fumaric acid and for rapid plant growth on high nitrogen. , 2010, The Plant journal : for cell and molecular biology.

[4]  R W Doerge,et al.  Genomic Survey of Gene Expression Diversity in Arabidopsis thaliana , 2006, Genetics.

[5]  E. Buckler,et al.  Genetic association mapping and genome organization of maize. , 2006, Current opinion in biotechnology.

[6]  Jingyuan Fu,et al.  System-wide molecular evidence for phenotypic buffering in Arabidopsis , 2009, Nature Genetics.

[7]  R. Mott,et al.  The 1001 Genomes Project for Arabidopsis thaliana , 2009, Genome Biology.

[8]  R. Last,et al.  Arabidopsis Map-Based Cloning in the Post-Genome Era , 2002, Plant Physiology.

[9]  Detlef Weigel,et al.  Genome-wide patterns of single-feature polymorphism in Arabidopsis thaliana , 2007, Proceedings of the National Academy of Sciences.

[10]  Jingyuan Fu,et al.  The genetics of plant metabolism , 2006, Nature Genetics.

[11]  Mark P. Styczynski,et al.  Systematic identification of conserved metabolites in GC/MS data for metabolomics and biomarker discovery. , 2007, Analytical chemistry.

[12]  Lieven Sterck,et al.  Genetical metabolomics of flavonoid biosynthesis in Populus: a case study. , 2006, The Plant journal : for cell and molecular biology.

[13]  J. Selbig,et al.  Metabolite profile analysis: from raw data to regression and classification. , 2007, Physiologia plantarum.

[14]  Jingyuan Fu,et al.  Regulatory network construction in Arabidopsis by using genome-wide gene expression quantitative trait loci , 2007, Proceedings of the National Academy of Sciences.

[15]  R. Lister,et al.  Next is now: new technologies for sequencing of genomes, transcriptomes, and beyond. , 2009, Current opinion in plant biology.

[16]  Mattias Jakobsson,et al.  The Pattern of Polymorphism in Arabidopsis thaliana , 2005, PLoS biology.

[17]  Ulrich Broeckel,et al.  Genotyping platforms for mass-throughput genotyping with SNPs, including human genome-wide scans. , 2008, Advances in genetics.

[18]  J. Gershenzon,et al.  Comparative quantitative trait loci mapping of aliphatic, indolic and benzylic glucosinolate production in Arabidopsis thaliana leaves and seeds. , 2001, Genetics.

[19]  Detlef Weigel,et al.  SHOREmap: simultaneous mapping and mutation identification by deep sequencing , 2009, Nature Methods.

[20]  J. Witte,et al.  Genetic dissection of complex traits. , 1994, Nature genetics.

[21]  Bjarne Gram Hansen,et al.  Biochemical Networks and Epistasis Shape the Arabidopsis thaliana Metabolome[W] , 2008, The Plant Cell Online.

[22]  U. Roessner,et al.  Comprehensive metabolic profiling and phenotyping of interspecific introgression lines for tomato improvement , 2006, Nature Biotechnology.

[23]  Joachim Selbig,et al.  The metabolic signature related to high plant growth rate in Arabidopsis thaliana , 2007, Proceedings of the National Academy of Sciences.

[24]  Thomas Altmann,et al.  SNP identification in crop plants. , 2009, Current opinion in plant biology.

[25]  J. Nap,et al.  Genetical genomics : the added value from segregation , 2001 .

[26]  Alexander Erban,et al.  TagFinder for the quantitative analysis of gas chromatography - mass spectrometry (GC-MS)-based metabolite profiling experiments , 2008, Bioinform..

[27]  S. Salvi,et al.  To clone or not to clone plant QTLs: present and future challenges. , 2005, Trends in plant science.

[28]  J. Keurentjes,et al.  Untargeted large-scale plant metabolomics using liquid chromatography coupled to mass spectrometry , 2007, Nature Protocols.

[29]  J. Witte,et al.  Genetic dissection of complex traits , 1996, Nature Genetics.

[30]  Bjarne Gram Hansen,et al.  Subclade of Flavin-Monooxygenases Involved in Aliphatic Glucosinolate Biosynthesis1[W] , 2008, Plant Physiology.

[31]  G. Coupland,et al.  Analysis of natural allelic variation at flowering time loci in the Landsberg erecta and Cape Verde Islands ecotypes of Arabidopsis thaliana. , 1998, Genetics.

[32]  R. Amasino,et al.  Molecular analysis of FRIGIDA, a major determinant of natural variation in Arabidopsis flowering time. , 2000, Science.

[33]  Robert W. Williams,et al.  The nature and identification of quantitative trait loci: a community's view , 2003, Nature Reviews Genetics.

[34]  Muhammad Ali Amer,et al.  Genome-wide association study of 107 phenotypes in a common set of Arabidopsis thaliana inbred lines , 2010, Nature.

[35]  Thomas Mitchell-Olds,et al.  Evolutionary dynamics of an Arabidopsis insect resistance quantitative trait locus , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Joachim Selbig,et al.  Identification of heterotic metabolite QTL in Arabidopsis thaliana RIL and IL populations. , 2009, The Plant journal : for cell and molecular biology.

[37]  Martin Scholz,et al.  Setup and Annotation of Metabolomic Experiments by Integrating Biological and Mass Spectrometric Metadata , 2005, DILS.

[38]  D. Kliebenstein A quantitative genetics and ecological model system: understanding the aliphatic glucosinolate biosynthetic network via QTLs , 2008, Phytochemistry Reviews.

[39]  B. Burr,et al.  Gene mapping with recombinant inbreds in maize. , 1988, Genetics.

[40]  D. Kliebenstein,et al.  Identifying the molecular basis of QTLs: eQTLs add a new dimension. , 2008, Trends in plant science.

[41]  Richard M. Clark,et al.  Common Sequence Polymorphisms Shaping Genetic Diversity in Arabidopsis thaliana , 2007, Science.

[42]  C. Nusbaum,et al.  Large-scale identification, mapping, and genotyping of single-nucleotide polymorphisms in the human genome. , 1998, Science.

[43]  J. Keurentjes Genetical metabolomics: closing in on phenotypes. , 2009, Current opinion in plant biology.

[44]  D. Kliebenstein Advancing Genetic Theory and Application by Metabolic Quantitative Trait Loci Analysis , 2009, The Plant Cell Online.

[45]  Z. Lippman,et al.  An integrated view of quantitative trait variation using tomato interspecific introgression lines. , 2007, Current opinion in genetics & development.

[46]  B. Mueller‐Roeber,et al.  A quantitative RT-PCR platform for high-throughput expression profiling of 2500 rice transcription factors , 2007, Plant Methods.

[47]  J. Tokuhisa,et al.  Benzoic acid glucosinolate esters and other glucosinolates from Arabidopsis thaliana. , 2002, Phytochemistry.

[48]  Henning Redestig,et al.  TargetSearch - a Bioconductor package for the efficient preprocessing of GC-MS metabolite profiling data , 2009, BMC Bioinformatics.

[49]  S. Pinson,et al.  Identification of quantitative trait loci (QTLs) for heading date and plant height in cultivated rice (Oryza sativa L.) , 1995, Theoretical and Applied Genetics.

[50]  Thomas Mitchell-Olds,et al.  Comparative analysis of quantitative trait loci controlling glucosinolates, myrosinase and insect resistance in Arabidopsis thaliana. , 2002, Genetics.

[51]  B. Berger,et al.  The R2R3-MYB transcription factor HAG1/MYB28 is a regulator of methionine-derived glucosinolate biosynthesis in Arabidopsis thaliana. , 2007, The Plant journal : for cell and molecular biology.

[52]  Richard M. Clark,et al.  Sequencing of natural strains of Arabidopsis thaliana with short reads. , 2008, Genome research.

[53]  P. Oefner,et al.  The extent of linkage disequilibrium in Arabidopsis thaliana , 2002, Nature Genetics.

[54]  A. Fernie,et al.  Gas chromatography mass spectrometry–based metabolite profiling in plants , 2006, Nature Protocols.

[55]  D. Kliebenstein,et al.  Understanding the Evolution of Defense Metabolites in Arabidopsis thaliana Using Genome-wide Association Mapping , 2010, Genetics.

[56]  M. Koornneef,et al.  A mixed model QTL analysis for a complex cross population consisting of a half diallel of two-way hybrids in Arabidopsis thaliana: analysis of simulated data , 2008, Euphytica.

[57]  A. Fernie,et al.  Metabolomics-assisted breeding: a viable option for crop improvement? , 2009, Trends in genetics : TIG.

[58]  M. Koornneef,et al.  Cloning of DOG1, a quantitative trait locus controlling seed dormancy in Arabidopsis , 2006, Proceedings of the National Academy of Sciences.

[59]  R. Meyer,et al.  Construction and analysis of 2 reciprocal Arabidopsis introgression line populations. , 2008, The Journal of heredity.

[60]  W. Weckwerth Metabolomics in systems biology. , 2003, Annual review of plant biology.

[61]  W. Weckwerth,et al.  Metabolite profiling in plant biology: platforms and destinations , 2004, Genome Biology.

[62]  H. Piepho,et al.  Genetic Basis of Heterosis for Growth-Related Traits in Arabidopsis Investigated by Testcross Progenies of Near-Isogenic Lines Reveals a Significant Role of Epistasis , 2007, Genetics.

[63]  R. Meyer,et al.  Segregation distortion in Arabidopsis C24/Col-0 and Col-0/C24 recombinant inbred line populations is due to reduced fertility caused by epistatic interaction of two loci , 2006, Theoretical and Applied Genetics.

[64]  D. Kliebenstein,et al.  A Systems Biology Approach Identifies a R2R3 MYB Gene Subfamily with Distinct and Overlapping Functions in Regulation of Aliphatic Glucosinolates , 2007, PloS one.

[65]  Detlef Weigel,et al.  Recombination and linkage disequilibrium in Arabidopsis thaliana , 2007, Nature Genetics.

[66]  Joachim Selbig,et al.  Identification of metabolic and biomass QTL in Arabidopsis thaliana in a parallel analysis of RIL and IL populations , 2007, The Plant journal : for cell and molecular biology.

[67]  R. Stoughton,et al.  Genetics of gene expression surveyed in maize, mouse and man , 2003, Nature.

[68]  D. Weigel,et al.  Identification of a Spontaneous Frame Shift Mutation in a Nonreference Arabidopsis Accession Using Whole Genome Sequencing1 , 2010, Plant Physiology.

[69]  A. Fernie,et al.  Metabolite profiling: from diagnostics to systems biology , 2004, Nature Reviews Molecular Cell Biology.

[70]  Bjarni J. Vilhjálmsson,et al.  Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines , 2010 .

[71]  Daniel J. Kliebenstein,et al.  Linking Metabolic QTLs with Network and cis-eQTLs Controlling Biosynthetic Pathways , 2007, PLoS genetics.

[72]  D. Zamir,et al.  An introgression line population of Lycopersicon pennellii in the cultivated tomato enables the identification and fine mapping of yield-associated QTL. , 1995, Genetics.

[73]  Leonid Kruglyak,et al.  The road to genome-wide association studies , 2008, Nature Reviews Genetics.

[74]  E. Fridman,et al.  A recombination hotspot delimits a wild-species quantitative trait locus for tomato sugar content to 484 bp within an invertase gene. , 2000, Proceedings of the National Academy of Sciences of the United States of America.