De novo assembly of the pennycress (Thlaspi arvense) transcriptome provides tools for the development of a winter cover crop and biodiesel feedstock

Field pennycress (Thlaspi arvense L.) has potential as an oilseed crop that may be grown during fall (autumn) and winter months in the Midwestern United States and harvested in the early spring as a biodiesel feedstock. There has been little agronomic improvement in pennycress through traditional breeding. Recent advances in genomic technologies allow for the development of genomic tools to enable rapid improvements to be made through genomic assisted breeding. Here we report an annotated transcriptome assembly for pennycress. RNA was isolated from representative plant tissues, and 203 million unique Illumina RNA-seq reads were produced and used in the transcriptome assembly. The draft transcriptome assembly consists of 33 873 contigs with a mean length of 1242 bp. A global comparison of homology between the pennycress and Arabidopsis transcriptomes, along with four other Brassicaceae species, revealed a high level of global sequence conservation within the family. The final assembly was functionally annotated, allowing for the identification of putative genes controlling important agronomic traits such as flowering and glucosinolate metabolism. Identification of these genes leads to testable hypotheses concerning their conserved function and to rational strategies to improve agronomic properties in pennycress. Future work to characterize isoform variation between diverse pennycress lines and develop a draft genome sequence for pennycress will further direct trait improvement.

[1]  G. Mcintyre,et al.  THE BIOLOGY OF CANADIAN WEEDS: 9. Thlaspi arvense L. , 1975 .

[2]  K. Best Studies on the Flowering of Thlaspi arvense L. III. The Influence of Vernalization Under Natural and Controlled Conditions , 1976, Botanical Gazette.

[3]  K. Best Studies on the Flowering of Thlaspi arvense L. IV. Genetic and Ecological Differences between Early- and Late-Flowering Strains , 1978, Botanical Gazette.

[4]  R. Turkington,et al.  The biology of Canadian weeds , 1978 .

[5]  G. A. Mulligan The biology of Canadian weeds , 1979 .

[6]  M. Devine,et al.  The Influence of Temperature Upon Physiological Processes in Early-Flowering and Late-Flowering Strains of Thlaspi arvense L. , 1995, International Journal of Plant Sciences.

[7]  L. Mitich Field Pennycress (Thlaspi arvense L.)—The Stinkweed , 1996, Weed Technology.

[8]  Atle M. Bones,et al.  THE MYROSINASE-GLUCOSINOLATE SYSTEM, ITS ORGANISATION AND BIOCHEMISTRY , 1996 .

[9]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[10]  R. Amasino,et al.  Molecular analysis of FRIGIDA, a major determinant of natural variation in Arabidopsis flowering time. , 2000, Science.

[11]  E. Finnegan,et al.  The molecular basis of vernalization: the central role of FLOWERING LOCUS C (FLC). , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[12]  S. Henikoff,et al.  Targeting induced local lesions IN genomes (TILLING) for plant functional genomics. , 2000, Plant physiology.

[13]  D. W. Reeves,et al.  USING WINTER COVER CROPS TO IMPROVE SOIL AND WATER QUALITY , 2001 .

[14]  R. Amasino,et al.  Loss of FLOWERING LOCUS C Activity Eliminates the Late-Flowering Phenotype of FRIGIDA and Autonomous Pathway Mutations but Not Responsiveness to Vernalization , 2001, Plant Cell.

[15]  C. Helliwell,et al.  Control of flowering time by FLC orthologues in Brassica napus. , 2001, The Plant journal : for cell and molecular biology.

[16]  S. Warwick,et al.  The biology of Canadian weeds. 9. Thlaspi arvense L. (updated) , 2002 .

[17]  R. Amasino,et al.  Characterization and effects of the replicated flowering time gene FLC in Brassica rapa. , 2002, Genetics.

[18]  C. Dean,et al.  Arabidopsis, the Rosetta stone of flowering time? , 2002, Science.

[19]  C. Lister,et al.  Analysis of the Molecular Basis of Flowering Time Variation in Arabidopsis Accessions1[w] , 2003, Plant Physiology.

[20]  S. Vaughn,et al.  BIOFUMIGANT COMPOUNDS RELEASED BY FIELD PENNYCRESS (Thlaspi arvense) SEEDMEAL , 2005, Journal of Chemical Ecology.

[21]  Z. Chen,et al.  Evolution of genome size in Brassicaceae. , 2005, Annals of botany.

[22]  Juan Miguel García-Gómez,et al.  BIOINFORMATICS APPLICATIONS NOTE Sequence analysis Manipulation of FASTQ data with Galaxy , 2005 .

[23]  R. Amasino,et al.  Vernalization and flowering time. , 2005, Current opinion in biotechnology.

[24]  Scott M. Swinton,et al.  Evaluating Cover Crops for Benefits, Costs and Performance within Cropping System Niches , 2005 .

[25]  Helen C Bowen,et al.  A comparison of the Thlaspi caerulescens and Thlaspi arvense shoot transcriptomes. , 2006, The New phytologist.

[26]  S. Polasky,et al.  Environmental, economic, and energetic costs and benefits of biodiesel and ethanol biofuels. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[27]  I. Parkin,et al.  Exploiting the wild crucifer Thlaspi arvense to identify conserved and novel genes expressed during a plant’s response to cold stress , 2006, Plant Molecular Biology.

[28]  S. Polasky,et al.  Land Clearing and the Biofuel Carbon Debt , 2008, Science.

[29]  L. Kochian,et al.  Investigating heavy-metal hyperaccumulation using Thlaspi caerulescens as a model system. , 2008, Annals of botany.

[30]  G. Bonnema,et al.  A naturally occurring splicing site mutation in the Brassica rapa FLC1 gene is associated with variation in flowering time , 2009, Journal of experimental botany.

[31]  L. Lynd,et al.  Beneficial Biofuels—The Food, Energy, and Environment Trilemma , 2009, Science.

[32]  Christian Jung,et al.  Flowering time control and applications in plant breeding. , 2009, Trends in plant science.

[33]  Gerhard Knothe,et al.  Production and Evaluation of Biodiesel from Field Pennycress (Thlaspi arvense L.) Oil , 2009 .

[34]  S. Jackson,et al.  Next-generation sequencing technologies and their implications for crop genetics and breeding. , 2009, Trends in biotechnology.

[35]  Shailesh N. Shah,et al.  Composition and physical properties of cress (Lepidium sativum L.) and field pennycress (Thlaspi arvense L.) oils , 2009 .

[36]  R. Amasino,et al.  Vernalization: winter and the timing of flowering in plants. , 2009, Annual review of cell and developmental biology.

[37]  M. Burow,et al.  Glucosinolate Breakdown in Arabidopsis: Mechanism, Regulation and Biological Significance , 2010, The arabidopsis book.

[38]  M. Clements,et al.  Dated molecular phylogenies indicate a Miocene origin for Arabidopsis thaliana , 2010, Proceedings of the National Academy of Sciences.

[39]  Charles A. Mullen,et al.  Producing Stable Pyrolysis Liquids from the Oil-Seed Presscakes of Mustard Family Plants: Pennycress (Thlaspi arvense L.) and Camelina (Camelina sativa) , 2010 .

[40]  Cutoffs and k-mers: implications from a transcriptome study in allopolyploid plants , 2012, BMC Genomics.

[41]  Ilha Lee,et al.  The FRIGIDA Complex Activates Transcription of FLC, a Strong Flowering Repressor in Arabidopsis, by Recruiting Chromatin Modification Factors[C][W] , 2011, Plant Cell.

[42]  M. Burow,et al.  A thiocyanate-forming protein generates multiple products upon allylglucosinolate breakdown in Thlaspi arvense. , 2011, Phytochemistry.

[43]  C. Jung,et al.  Flowering time variation in oilseed rape (Brassica napus L.) is associated with allelic variation in the FRIGIDA homologue BnaA.FRI.a , 2011, Journal of experimental botany.

[44]  Richard M. Clark,et al.  The Arabidopsis lyrata genome sequence and the basis of rapid genome size change , 2011, Nature Genetics.

[45]  I. Szarejko,et al.  TILLING - a shortcut in functional genomics , 2011, Journal of Applied Genetics.

[46]  I. Al‐Shehbaz,et al.  Cabbage family affairs: the evolutionary history of Brassicaceae. , 2011, Trends in plant science.

[47]  J. Poulain,et al.  The genome of the mesopolyploid crop species Brassica rapa , 2011, Nature Genetics.

[48]  Xuan Li,et al.  Optimizing de novo transcriptome assembly from short-read RNA-Seq data: a comparative study , 2011, BMC Bioinformatics.

[49]  C. Lister,et al.  Functional alleles of the flowering time regulator FRIGIDA in the Brassica oleracea genome , 2012, BMC Plant Biology.

[50]  Tanya Z. Berardini,et al.  The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools , 2011, Nucleic Acids Res..

[51]  R. Hedrich,et al.  NRT/PTR transporters are essential for translocation of glucosinolate defence compounds to seeds , 2012, Nature.

[52]  W. Phippen,et al.  Soybean Seed Yield and Quality as a Response to Field Pennycress Residue , 2012 .

[53]  T. Isbell,et al.  Extraction of pennycress (Thlaspi arvense L.) seed oil by full pressing. , 2012 .

[54]  Tom N. Kalnes,et al.  A life cycle assessment of pennycress (Thlaspi arvense L.) -derived jet fuel and diesel , 2013 .

[55]  Mathieu Blanchette,et al.  The Capsella rubella genome and the genomic consequences of rapid mating system evolution , 2013, Nature Genetics.

[56]  Ian A. Waitz,et al.  Market Cost of Renewable Jet Fuel Adoption in the United States , 2013 .

[57]  C. Buell,et al.  Bioinformatics challenges in de novo transcriptome assembly using short read sequences in the absence of a reference genome sequence. , 2013, Natural product reports.

[58]  T. Isbell,et al.  Extraction of proteins from pennycress seeds and press cake , 2013 .

[59]  T. Isbell,et al.  Effects of cold-pressing and seed cooking on functional properties of protein in pennycress (Thlaspi arvense L.) seed and press cakes. , 2013 .

[60]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .