Transcriptome Analysis of Yellow Horn (Xanthoceras sorbifolia Bunge): A Potential Oil-Rich Seed Tree for Biodiesel in China

Background Yellow horn (Xanthoceras sorbifolia Bunge) is an oil-rich seed shrub that grows well in cold, barren environments and has great potential for biodiesel production in China. However, the limited genetic data means that little information about the key genes involved in oil biosynthesis is available, which limits further improvement of this species. In this study, we describe sequencing and de novo transcriptome assembly to produce the first comprehensive and integrated genomic resource for yellow horn and identify the pathways and key genes related to oil accumulation. In addition, potential molecular markers were identified and compiled. Methodology/Principal Findings Total RNA was isolated from 30 plants from two regions, including buds, leaves, flowers and seeds. Equal quantities of RNA from these tissues were pooled to construct a cDNA library for 454 pyrosequencing. A total of 1,147,624 high-quality reads with total and average lengths of 530.6 Mb and 462 bp, respectively, were generated. These reads were assembled into 51,867 unigenes, corresponding to a total of 36.1 Mb with a mean length, N50 and median of 696, 928 and 570 bp, respectively. Of the unigenes, 17,541 (33.82%) were unmatched in any public protein databases. We identified 281 unigenes that may be involved in de novo fatty acid (FA) and triacylglycerol (TAG) biosynthesis and metabolism. Furthermore, 6,707 SSRs, 16,925 SNPs and 6,201 InDels with high-confidence were also identified in this study. Conclusions This transcriptome represents a new functional genomics resource and a foundation for further studies on the metabolic engineering of yellow horn to increase oil content and modify oil composition. The potential molecular markers identified in this study provide a basis for polymorphism analysis of Xanthoceras, and even Sapindaceae; they will also accelerate the process of breeding new varieties with better agronomic characteristics.

[1]  Zeng-Yu Yao,et al.  Biodiesel production from Xanthoceras sorbifolia in China: Opportunities and challenges , 2013 .

[2]  J. Ohlrogge,et al.  Biochemical pathways in seed oil synthesis. , 2013, Current opinion in plant biology.

[3]  Jingwei Jiang,et al.  De novo Sequence Assembly and Characterization of Lycoris aurea Transcriptome Using GS FLX Titanium Platform of 454 Pyrosequencing , 2013, PloS one.

[4]  I. Nookaew,et al.  A comprehensive comparison of RNA-Seq-based transcriptome analysis from reads to differential gene expression and cross-comparison with microarrays: a case study in Saccharomyces cerevisiae , 2012, Nucleic acids research.

[5]  R. Togawa,et al.  Global transcriptome analysis of two wild relatives of peanut under drought and fungi infection , 2012, BMC Genomics.

[6]  Y. Zhang,et al.  De Novo Sequencing of Hypericum perforatum Transcriptome to Identify Potential Genes Involved in the Biosynthesis of Active Metabolites , 2012, PloS one.

[7]  Qi-xiang Sun,et al.  Forest biomass resources and utilization in China , 2012 .

[8]  X. Tao,et al.  Digital Gene Expression Analysis Based on Integrated De Novo Transcriptome Assembly of Sweet Potato [Ipomoea batatas (L.) Lam.] , 2012, PloS one.

[9]  Jin Wang,et al.  De novo assembly and Characterisation of the Transcriptome during seed development, and generation of genic-SSR markers in Peanut (Arachis hypogaea L.) , 2012, BMC Genomics.

[10]  John F. Karpovich,et al.  De novo transcriptome assembly and polymorphism detection in the flowering plant Silene vulgaris (Caryophyllaceae) , 2012, Molecular ecology resources.

[11]  Meng Luo,et al.  Biodiesel production from yellow horn (Xanthoceras sorbifolia Bunge.) seed oil using ion exchange resin as heterogeneous catalyst. , 2012, Bioresource technology.

[12]  E. Bornberg-Bauer,et al.  Evaluating Characteristics of De Novo Assembly Software on 454 Transcriptome Data: A Simulation Approach , 2012, PloS one.

[13]  N. Wang,et al.  De Novo Transcriptome of Safflower and the Identification of Putative Genes for Oleosin and the Biosynthesis of Flavonoids , 2012, PloS one.

[14]  C. Weekley,et al.  Assembly, Gene Annotation and Marker Development Using 454 Floral Transcriptome Sequences in Ziziphus Celata (Rhamnaceae), a Highly Endangered, Florida Endemic Plant , 2011, DNA research : an international journal for rapid publication of reports on genes and genomes.

[15]  Guodong Wang,et al.  Enhanced Seed Oil Production in Canola by Conditional Expression of Brassica napus LEAFY COTYLEDON1 and LEC1-LIKE in Developing Seeds1[W][OA] , 2011, Plant Physiology.

[16]  M. Parani,et al.  De novo assembly and transcriptome analysis of five major tissues of Jatropha curcas L. using GS FLX titanium platform of 454 pyrosequencing , 2011, BMC Genomics.

[17]  J. Ohlrogge,et al.  The seeds of green energy: Expanding the contribution of plant oils as biofuels , 2011 .

[18]  M. Chye,et al.  New roles for acyl-CoA-binding proteins (ACBPs) in plant development, stress responses and lipid metabolism. , 2011, Progress in lipid research.

[19]  Kyle Bibby,et al.  Transcriptome sequencing and annotation of the microalgae Dunaliella tertiolecta: Pathway description and gene discovery for production of next-generation biofuels , 2011, BMC Genomics.

[20]  J. Cañizares,et al.  Transcriptome characterization and high throughput SSRs and SNPs discovery in Cucurbita pepo (Cucurbitaceae) , 2011, BMC Genomics.

[21]  D. Neale,et al.  Forest tree genomics: growing resources and applications , 2011, Nature Reviews Genetics.

[22]  J. Tregear,et al.  Transcriptome analysis reveals differentially expressed genes associated with the mantled homeotic flowering abnormality in oil palm (Elaeis guineensis) , 2011, Tree Genetics & Genomes.

[23]  T. Efferth,et al.  Supercritical carbon dioxide extraction of seed oil from yellow horn (Xanthoceras sorbifolia Bunge.) and its anti-oxidant activity. , 2010, Bioresource technology.

[24]  I. Hara-Nishimura,et al.  Oil-body-membrane proteins and their physiological functions in plants. , 2010, Biological & pharmaceutical bulletin.

[25]  D. Dou,et al.  A new angeloylated triterpenoid saponin from the husks of Xanthoceras sorbifolia Bunge , 2010, Journal of Natural Medicines.

[26]  D. Qi,et al.  Major Energy Plants and Their Potential for Bioenergy Development in China , 2010, Environmental management.

[27]  T. Efferth,et al.  Rapid microwave-assisted transesterification of yellow horn oil to biodiesel using a heteropolyacid solid catalyst. , 2010, Bioresource technology.

[28]  Ying Gao,et al.  Bioinformatics Applications Note Sequence Analysis Cd-hit Suite: a Web Server for Clustering and Comparing Biological Sequences , 2022 .

[29]  Mark Johnston,et al.  Benchmarking next-generation transcriptome sequencing for functional and evolutionary genomics. , 2009, Molecular biology and evolution.

[30]  Alexandra To,et al.  Role of WRINKLED1 in the transcriptional regulation of glycolytic and fatty acid biosynthetic genes in Arabidopsis. , 2009, The Plant journal : for cell and molecular biology.

[31]  Z. Yao,et al.  Biosorption of Methylene Blue from Aqueous Solution Using a Bioenergy Forest Waste: Xanthoceras sorbifolia Seed Coat , 2009 .

[32]  Yan Long,et al.  Single nucleotide polymorphism (SNP) discovery in the polyploid Brassica napus using Solexa transcriptome sequencing. , 2009, Plant biotechnology journal.

[33]  M. P. Dorado,et al.  The Ideal Vegetable Oil-based Biodiesel Composition: A Review of Social, Economical and Technical Implications , 2009 .

[34]  Meng Luo,et al.  Rapid microwave-assisted transesterification for the preparation of fatty acid methyl esters from the oil of yellow horn (Xanthoceras sorbifolia Bunge.) , 2009 .

[35]  K. Gruys,et al.  Expression of Umbelopsis ramanniana DGAT2A in Seed Increases Oil in Soybean1[OA] , 2008, Plant Physiology.

[36]  Christoph Benning,et al.  Plant triacylglycerols as feedstocks for the production of biofuels. , 2008, The Plant journal : for cell and molecular biology.

[37]  Jung-Hua Wu,et al.  Analysis of biodiesel promotion in Taiwan , 2008 .

[38]  Shirong Zhang,et al.  A phenylalanine in DGAT is a key determinant of oil content and composition in maize , 2008, Nature Genetics.

[39]  Daemon Fairless Biofuel: The little shrub that could - maybe , 2007, Nature.

[40]  M. K. Maiti,et al.  Functional expression of an acyl carrier protein (ACP) from Azospirillum brasilense alters fatty acid profiles in Escherichia coli and Brassica juncea. , 2007, Plant physiology and biochemistry : PPB.

[41]  K. Lambert,et al.  Developmental control of Arabidopsis seed oil biosynthesis , 2007, Planta.

[42]  J. Shanklin,et al.  Modulating seed β-ketoacyl-acyl carrier protein synthase II level converts the composition of a temperate seed oil to that of a palm-like tropical oil , 2007, Proceedings of the National Academy of Sciences.

[43]  Matthew D. Smith,et al.  Characterization of a Plastid Triacylglycerol Lipase from Arabidopsis1[OA] , 2007, Plant Physiology.

[44]  M. P. Dorado,et al.  An approach to the economics of two vegetable oil-based biofuels in Spain , 2006 .

[45]  Manju M. Gupta,et al.  Mapping of the loci controlling oleic and linolenic acid contents and development of fad2 and fad3 allele-specific markers in canola (Brassica napus L.) , 2006, Theoretical and Applied Genetics.

[46]  Saranyan K. Palaniswamy,et al.  AGRIS and AtRegNet. A Platform to Link cis-Regulatory Elements and Transcription Factors into Regulatory Networks1[W][OA] , 2006, Plant Physiology.

[47]  Song Li,et al.  LUCY2: an interactive DNA sequence quality trimming and vector removal tool , 2004, Bioinform..

[48]  M. Hills Control of storage-product synthesis in seeds. , 2004, Current opinion in plant biology.

[49]  D. Pental,et al.  Molecular tagging of erucic acid trait in oilseed mustard (Brassica juncea) by QTL mapping and single nucleotide polymorphisms in FAE1 gene , 2004, Theoretical and Applied Genetics.

[50]  Y. Madoka,et al.  Chloroplast transformation with modified accD operon increases acetyl-CoA carboxylase and causes extension of leaf longevity and increase in seed yield in tobacco. , 2002, Plant & cell physiology.

[51]  J. Tzen,et al.  Steroleosin, a sterol-binding dehydrogenase in seed oil bodies. , 2002, Plant physiology.

[52]  J. Mullikin,et al.  SSAHA: a fast search method for large DNA databases. , 2001, Genome research.

[53]  J. Mundy,et al.  Oil bodies and their associated proteins, oleosin and caleosin. , 2001, Physiologia plantarum.

[54]  P. Covello,et al.  Seed-specific over-expression of an Arabidopsis cDNA encoding a diacylglycerol acyltransferase enhances seed oil content and seed weight. , 2001, Plant physiology.

[55]  A. Kinney,et al.  VARIATIONS IN THE BIOSYNTHESIS OF SEED-STORAGE LIPIDS. , 2001, Annual review of plant physiology and plant molecular biology.

[56]  P. Edwards,et al.  Overexpression of 3-ketoacyl-acyl-carrier protein synthase IIIs in plants reduces the rate of lipid synthesis. , 2001, Plant physiology.

[57]  A. Kumar,et al.  Enhancement of seed oil content by expression of glycerol-3-phosphate acyltransferase genes. , 2000, Biochemical Society transactions.

[58]  H. Nielsen,et al.  Caleosins: Ca2+-binding proteins associated with lipid bodies , 2000, Plant Molecular Biology.

[59]  E. Krebbers,et al.  Gene discovery and product development for grain quality traits. , 1999, Science.

[60]  D. Taylor,et al.  Modification of seed oil content and acyl composition in the brassicaceae by expression of a yeast sn-2 acyltransferase gene. , 1997, The Plant cell.

[61]  N. Færgeman,et al.  Role of long-chain fatty acyl-CoA esters in the regulation of metabolism and in cell signalling. , 1997, The Biochemical journal.

[62]  J. Ohlrogge,et al.  Targeting of the Arabidopsis Homomeric Acetyl-Coenzyme A Carboxylase to Plastids of Rapeseeds , 1997, Plant physiology.

[63]  N. Murata,et al.  Targeted mutagenesis of acyl‐lipid desaturases in Synechocystis: evidence for the important roles of polyunsaturated membrane lipids in growth, respiration and photosynthesis. , 1996, The EMBO journal.

[64]  N. Murata,et al.  Modes of Fatty-Acid Desaturation in Cyanobacteria , 1992 .

[65]  A. Slabas,et al.  The biochemistry and molecular biology of plant lipid biosynthesis , 1992, Plant Molecular Biology.

[66]  J. Jaworski,et al.  A Cerulenin Insensitive Short Chain 3-Ketoacyl-Acyl Carrier Protein Synthase in Spinacia oleracea Leaves. , 1989, Plant physiology.

[67]  Z. Ji Research advance of Xanthoceras sorbifolia Bunge oil , 2011 .

[68]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[69]  Zhou Shao-ji Preparation of biodiesel from Xanthoceras sorblfolia Bunge seed oil , 2009 .

[70]  Chunzhao Liu,et al.  Biofuels in China: opportunities and challenges , 2009, In Vitro Cellular & Developmental Biology - Plant.

[71]  Y. Zu,et al.  Determination of Fatty Acid Methyl Esters in Biodiesel Produced from Yellow Horn Oil by LC , 2008 .

[72]  A. Huang,et al.  Oil bodies and oleosins in seeds , 1992 .

[73]  K. J. Harrington,et al.  Chemical and physical properties of vegetable oil esters and their effect on diesel fuel performance , 1986 .

[74]  P. Stumpf,et al.  Purification and characterization of beta-ketoacyl-ACP synthetase I from Spinacia oleracea leaves. , 1983, Archives of biochemistry and biophysics.

[75]  Linhai Wang,et al.  Characterization of the sesame (Sesamum indicum L.) global transcriptome using Illumina paired-end sequencing and development of EST-SSR markers , 2011, BMC genomics.