A draft genome of field pennycress (Thlaspi arvense) provides tools for the domestication of a new winter biofuel crop

Field pennycress (Thlaspi arvense L.) is being domesticated as a new winter cover crop and biofuel species for the Midwestern United States that can be double-cropped between corn and soybeans. A genome sequence will enable the use of new technologies to make improvements in pennycress. To generate a draft genome, a hybrid sequencing approach was used to generate 47 Gb of DNA sequencing reads from both the Illumina and PacBio platforms. These reads were used to assemble 6,768 genomic scaffolds. The draft genome was annotated using the MAKER pipeline, which identified 27,390 predicted protein-coding genes, with almost all of these predicted peptides having significant sequence similarity to Arabidopsis proteins. A comprehensive analysis of pennycress gene homologues involved in glucosinolate biosynthesis, metabolism, and transport pathways revealed high sequence conservation compared with other Brassicaceae species, and helps validate the assembly of the pennycress gene space in this draft genome. Additional comparative genomic analyses indicate that the knowledge gained from years of basic Brassicaceae research will serve as a powerful tool for identifying gene targets whose manipulation can be predicted to result in improvements for pennycress.

[1]  A. Virtanen,et al.  A New Type of Enzymatic Cleavage of Mustard Oil Glucosides. Formation of Allylthiocyanate in Thlaspi arvense L. and Benzylthiocyanate in Lepidium ruderale L. and Lepidium Sativum L. , 1959 .

[2]  G. Mcintyre,et al.  THE BIOLOGY OF CANADIAN WEEDS: 9. Thlaspi arvense L. , 1975 .

[3]  R. Turkington,et al.  The biology of Canadian weeds , 1978 .

[4]  G. A. Mulligan The biology of Canadian weeds , 1979 .

[5]  H. Saini,et al.  Breakage of Seed Dormancy of Field Pennycress (Thlaspi arvense) by Growth Regulators, Nitrate, and Environmental Factors , 1987, Weed Science.

[6]  F. Ausubel,et al.  Specialized binary vector for plant transformation: expression of the Arabidopsis thaliana AHAS gene in Nicotiana tabacum. , 1988, Nucleic acids research.

[7]  F. Ausubel,et al.  A procedure for mapping Arabidopsis mutations using co-dominant ecotype-specific PCR-based markers. , 1993, The Plant journal : for cell and molecular biology.

[8]  M. Koch,et al.  Molecular data reveal convergence in fruit characters used in the classification of Thlaspi s. l. (Brassicaceae) , 1997 .

[9]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[10]  L. Kochian,et al.  The molecular physiology of heavy metal transport in the Zn/Cd hyperaccumulator Thlaspi caerulescens. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Thlaspi arvense L. , 2000 .

[12]  S. Henikoff,et al.  Targeting induced local lesions IN genomes (TILLING) for plant functional genomics. , 2000, Plant physiology.

[13]  S. Warwick,et al.  The biology of Canadian weeds. 9. Thlaspi arvense L. (updated) , 2002 .

[14]  S. Polasky,et al.  Agricultural sustainability and intensive production practices , 2002, Nature.

[15]  I. Al‐Shehbaz,et al.  Taxonomic and Phylogenetic Evaluation of the American “Thlaspi” Species: Identity and Relationship to the Eurasian Genus Noccaea (Brassicaceae) , 2004 .

[16]  Z. Chen,et al.  Evolution of genome size in Brassicaceae. , 2005, Annals of botany.

[17]  Barbara Ann Halkier,et al.  Biology and biochemistry of glucosinolates. , 2006, Annual review of plant biology.

[18]  S. Abel,et al.  Glucosinolate metabolism and its control. , 2006, Trends in plant science.

[19]  M. Koornneef,et al.  Cloning of DOG1, a quantitative trait locus controlling seed dormancy in Arabidopsis , 2006, Proceedings of the National Academy of Sciences.

[20]  D. Kliebenstein,et al.  The Gene Controlling the Quantitative Trait Locus EPITHIOSPECIFIER MODIFIER1 Alters Glucosinolate Hydrolysis and Insect Resistance in Arabidopsis[W] , 2006, The Plant Cell Online.

[21]  Michael J Holdsworth,et al.  Molecular networks regulating Arabidopsis seed maturation, after-ripening, dormancy and germination. , 2008, The New phytologist.

[22]  Sofia M. C. Robb,et al.  MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. , 2007, Genome research.

[23]  S. Baud,et al.  Regulation of de novo fatty acid synthesis in maturing oilseeds of Arabidopsis. , 2009, Plant physiology and biochemistry : PPB.

[24]  Adam M. Wentzell,et al.  MODIFIED VACUOLE PHENOTYPE1 Is an Arabidopsis Myrosinase-Associated Protein Involved in Endomembrane Protein Trafficking1[W][OA] , 2009, Plant Physiology.

[25]  Christian Jung,et al.  Flowering time control and applications in plant breeding. , 2009, Trends in plant science.

[26]  C. Rogers,et al.  Deletion-Based Reverse Genetics in Medicago truncatula1[W][OA] , 2009, Plant Physiology.

[27]  Gerhard Knothe,et al.  Production and Evaluation of Biodiesel from Field Pennycress (Thlaspi arvense L.) Oil , 2009 .

[28]  Shailesh N. Shah,et al.  Composition and physical properties of cress (Lepidium sativum L.) and field pennycress (Thlaspi arvense L.) oils , 2009 .

[29]  M. Koornneef,et al.  The development of Arabidopsis as a model plant. , 2010, The Plant journal : for cell and molecular biology.

[30]  M. Burow,et al.  A thiocyanate-forming protein generates multiple products upon allylglucosinolate breakdown in Thlaspi arvense. , 2011, Phytochemistry.

[31]  Richard M. Clark,et al.  The Arabidopsis lyrata genome sequence and the basis of rapid genome size change , 2011, Nature Genetics.

[32]  I. Al‐Shehbaz,et al.  Cabbage family affairs: the evolutionary history of Brassicaceae. , 2011, Trends in plant science.

[33]  J. Poulain,et al.  The genome of the mesopolyploid crop species Brassica rapa , 2011, Nature Genetics.

[34]  Walter Pirovano,et al.  BIOINFORMATICS APPLICATIONS , 2022 .

[35]  M. Seo,et al.  The Time Required for Dormancy Release in Arabidopsis Is Determined by DELAY OF GERMINATION1 Protein Levels in Freshly Harvested Seeds[OA] , 2012, Plant Cell.

[36]  R. Hedrich,et al.  NRT/PTR transporters are essential for translocation of glucosinolate defence compounds to seeds , 2012, Nature.

[37]  H. Hoekstra,et al.  Double Digest RADseq: An Inexpensive Method for De Novo SNP Discovery and Genotyping in Model and Non-Model Species , 2012, PloS one.

[38]  W. Phippen,et al.  Soybean Seed Yield and Quality as a Response to Field Pennycress Residue , 2012 .

[39]  Mihaela M. Martis,et al.  A physical, genetic and functional sequence assembly of the barley genome. , 2022 .

[40]  W. Pirovano,et al.  Toward almost closed genomes with GapFiller , 2012, Genome Biology.

[41]  M. Burow,et al.  Evolution of specifier proteins in glucosinolate-containing plants , 2012, BMC Evolutionary Biology.

[42]  Fui Ling Ng,et al.  Draft genome sequence of the rubber tree Hevea brasiliensis , 2013, BMC Genomics.

[43]  Trevor W. Rife,et al.  Genotyping‐by‐Sequencing for Plant Breeding and Genetics , 2012 .

[44]  Jun Wang,et al.  Insights into salt tolerance from the genome of Thellungiella salsuginea , 2012, Proceedings of the National Academy of Sciences.

[45]  T. Isbell,et al.  Extraction of pennycress (Thlaspi arvense L.) seed oil by full pressing. , 2012 .

[46]  M. Koch,et al.  Taxonomy and systematics are key to biological information: Arabidopsis, Eutrema (Thellungiella), Noccaea and Schrenkiella (Brassicaceae) as examples , 2013, Front. Plant Sci..

[47]  B. Halkier,et al.  Integration of Biosynthesis and Long-Distance Transport Establish Organ-Specific Glucosinolate Profiles in Vegetative Arabidopsis[W] , 2013, Plant Cell.

[48]  Tom N. Kalnes,et al.  A life cycle assessment of pennycress (Thlaspi arvense L.) -derived jet fuel and diesel , 2013 .

[49]  James K. Hane,et al.  Draft genome sequence of chickpea (Cicer arietinum) provides a resource for trait improvement , 2013, Nature Biotechnology.

[50]  K. Dorn,et al.  De novo assembly of the pennycress (Thlaspi arvense) transcriptome provides tools for the development of a winter cover crop and biodiesel feedstock , 2013, The Plant journal : for cell and molecular biology.

[51]  Meng-Han Yang,et al.  Identification of cucurbitacins and assembly of a draft genome for Aquilaria agallocha , 2013, BMC Genomics.

[52]  Mathieu Blanchette,et al.  The Capsella rubella genome and the genomic consequences of rapid mating system evolution , 2013, Nature Genetics.

[53]  Oscar Westesson,et al.  Visualizing next-generation sequencing data with JBrowse , 2013, Briefings Bioinform..

[54]  Jun Wang,et al.  Genome sequencing of the high oil crop sesame provides insight into oil biosynthesis , 2014, Genome Biology.

[55]  Ian A. Waitz,et al.  Market Cost of Renewable Jet Fuel Adoption in the United States , 2013 .

[56]  T. Isbell,et al.  Extraction of proteins from pennycress seeds and press cake , 2013 .

[57]  W. J. Lucas,et al.  The draft genome of watermelon (Citrullus lanatus) and resequencing of 20 diverse accessions , 2012, Nature Genetics.

[58]  Vladimir Nekrasov,et al.  Plant genome editing made easy: targeted mutagenesis in model and crop plants using the CRISPR/Cas system , 2013, Plant Methods.

[59]  Simon Prochnik,et al.  The Reference Genome of the Halophytic Plant Eutrema salsugineum , 2013, Front. Plant Sci..

[60]  T. Isbell,et al.  Effects of cold-pressing and seed cooking on functional properties of protein in pennycress (Thlaspi arvense L.) seed and press cakes. , 2013 .

[61]  E. Lyons,et al.  Whole Genome and Tandem Duplicate Retention Facilitated Glucosinolate Pathway Diversification in the Mustard Family , 2013, Genome biology and evolution.

[62]  Aaron M. Newman,et al.  The genome sequence of the colonial chordate, Botryllus schlosseri , 2013, eLife.

[63]  Z. Fei,et al.  Root and shoot transcriptome analysis of two ecotypes of Noccaea caerulescens uncovers the role of NcNramp1 in Cd hyperaccumulation. , 2014, The Plant journal : for cell and molecular biology.

[64]  Andrew G. Sharpe,et al.  The emerging biofuel crop Camelina sativa retains a highly undifferentiated hexaploid genome structure , 2014, Nature Communications.

[65]  Kun Lu,et al.  The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes , 2014, Nature Communications.

[66]  H. Karimmojeni,et al.  Dormancy breaking and seed germination of the annual weeds Thlaspi arvense, Descurainia sophia and Malcolmia africana (Brassicaceae) , 2014 .

[67]  R. Terauchi,et al.  Harvesting the Promising Fruits of Genomics: Applying Genome Sequencing Technologies to Crop Breeding , 2014, PLoS biology.

[68]  P. Satya,et al.  Next generation sequencing technologies for next generation plant breeding , 2014, Front. Plant Sci..

[69]  W. Phippen,et al.  New approaches to facilitate rapid domestication of a wild plant to an oilseed crop: example pennycress (Thlaspi arvense L.). , 2014, Plant science : an international journal of experimental plant biology.