The genome of woodland strawberry (Fragaria vesca)

The woodland strawberry, Fragaria vesca (2n = 2x = 14), is a versatile experimental plant system. This diminutive herbaceous perennial has a small genome (240 Mb), is amenable to genetic transformation and shares substantial sequence identity with the cultivated strawberry (Fragaria × ananassa) and other economically important rosaceous plants. Here we report the draft F. vesca genome, which was sequenced to ×39 coverage using second-generation technology, assembled de novo and then anchored to the genetic linkage map into seven pseudochromosomes. This diploid strawberry sequence lacks the large genome duplications seen in other rosids. Gene prediction modeling identified 34,809 genes, with most being supported by transcriptome mapping. Genes critical to valuable horticultural traits including flavor, nutritional value and flowering time were identified. Macrosyntenic relationships between Fragaria and Prunus predict a hypothetical ancestral Rosaceae genome that had nine chromosomes. New phylogenetic analysis of 154 protein-coding genes suggests that assignment of Populus to Malvidae, rather than Fabidae, is warranted.

[1]  G. Darrow The strawberry : history, breeding and physiology , 1966 .

[2]  D. Lipman,et al.  Trees, stars, and multiple biological sequence alignment , 1989 .

[3]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[4]  Y. Benjamini,et al.  More powerful procedures for multiple significance testing. , 1990, Statistics in medicine.

[5]  Nature Genetics , 1991, Nature.

[6]  T. Davis,et al.  A Linkage Map of the Diploid Strawberry, Fragaria vesca , 1997 .

[7]  Steffen Kecke,et al.  Analysis of strawberry flavour – discrimination of aroma types by quantification of volatile compounds , 1997 .

[8]  D. Klessig,et al.  Engineering disease and pest resistance in plants. , 1998, Trends in microbiology.

[9]  E. Lam,et al.  Nitric oxide and salicylic acid signaling in plant defense. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[10]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[11]  Masami Hasegawa,et al.  CONSEL: for assessing the confidence of phylogenetic tree selection , 2001, Bioinform..

[12]  A. Chenchik,et al.  Reverse transcriptase template switching: a SMART approach for full-length cDNA library construction. , 2001, BioTechniques.

[13]  Rolf Apweiler,et al.  InterProScan - an integration platform for the signature-recognition methods in InterPro , 2001, Bioinform..

[14]  Hidetoshi Shimodaira An approximately unbiased test of phylogenetic tree selection. , 2002, Systematic biology.

[15]  N. Battey,et al.  Appropriate choice of antibiotic and Agrobacterium strain improves transformation of antibiotic-sensitive Fragaria vesca and F. v. semperflorens , 2002, Plant Cell Reports.

[16]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[17]  A. Aharoni,et al.  Gene expression analysis of strawberry achene and receptacle maturation using DNA microarrays. , 2002, Journal of experimental botany.

[18]  S. Shafir,et al.  O-Methyltransferases Involved in the Biosynthesis of Volatile Phenolic Derivatives in Rose Petals1 , 2002, Plant Physiology.

[19]  S. Carroll,et al.  Genome-scale approaches to resolving incongruence in molecular phylogenies , 2003, Nature.

[20]  S. Salzberg,et al.  Versatile and open software for comparing large genomes , 2004, Genome Biology.

[21]  Alexander Isaev,et al.  PyEvolve: a toolkit for statistical modelling of molecular evolution , 2004, BMC Bioinformatics.

[22]  John Quackenbush,et al.  TIGR Gene Indices clustering tools (TGICL): a software system for fast clustering of large EST datasets , 2003, Bioinform..

[23]  F. Legeai,et al.  Predotar: A tool for rapidly screening proteomes for N‐terminal targeting sequences , 2004, Proteomics.

[24]  KARYOTYPE AND RIBOSOMAL GENE MAPPING IN FRAGARIA VESCA L. , 2004 .

[25]  A. Aharoni,et al.  Gain and Loss of Fruit Flavor Compounds Produced by Wild and Cultivated Strawberry Species , 2004, The Plant Cell Online.

[26]  A. Aharoni,et al.  Functional Characterization of Enzymes Forming Volatile Esters from Strawberry and Banana[w] , 2004, Plant Physiology.

[27]  Robert C. Edgar,et al.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity , 2004, BMC Bioinformatics.

[28]  C. Robin Buell,et al.  The TIGR Plant Repeat Databases: a collective resource for the identification of repetitive sequences in plants , 2004, Nucleic Acids Res..

[29]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[30]  Hope A. Gruszewski,et al.  High-efficiency transformation of the diploid strawberry (Fragaria vesca) for functional genomics , 2006, Planta.

[31]  Martin Kuiper,et al.  BiNGO: a Cytoscape plugin to assess overrepresentation of Gene Ontology categories in Biological Networks , 2005, Bioinform..

[32]  J. Jurka,et al.  Repbase Update, a database of eukaryotic repetitive elements , 2005, Cytogenetic and Genome Research.

[33]  Saurabh Raghuvanshi,et al.  The sequence of rice chromosomes 11 and 12, rich in disease resistance genes and recent gene duplications , 2005, BMC Biology.

[34]  Thomas Ludwig,et al.  RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees , 2005, Bioinform..

[35]  M. Borodovsky,et al.  Gene identification in novel eukaryotic genomes by self-training algorithm , 2005, Nucleic acids research.

[36]  E. M. Friis,et al.  Rosids - Reproductive structures, fossil and extant, and their bearing on deep relationships: Introduction , 2006, Plant Systematics and Evolution.

[37]  W. Schwab,et al.  FaQR, Required for the Biosynthesis of the Strawberry Flavor Compound 4-Hydroxy-2,5-Dimethyl-3(2H)-Furanone, Encodes an Enone Oxidoreductase , 2006, The Plant Cell Online.

[38]  Dmitrij Frishman,et al.  MIPS: analysis and annotation of proteins from whole genomes in 2005 , 2006, Nucleic Acids Res..

[39]  P. K. Endress,et al.  First steps towards a floral structural characterization of the major rosid subclades , 2006, Plant Systematics and Evolution.

[40]  Y. Chai,et al.  Molecular cloning of Brassica napus TRANSPARENT TESTA 2 gene family encoding potential MYB regulatory proteins of proanthocyanidin biosynthesis , 2007, Molecular Biology Reports.

[41]  T. Lan,et al.  Molecular Cytogenetic Analysis of Four Larix Species by Bicolor Fluorescence In Situ Hybridization and DAPI Banding , 2006, International Journal of Plant Sciences.

[42]  M. Chase,et al.  Mitochondrial matR sequences help to resolve deep phylogenetic relationships in rosids , 2007, BMC Evolutionary Biology.

[43]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[44]  Gerard Talavera,et al.  Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. , 2007, Systematic biology.

[45]  W. Eisenreich,et al.  Functional characterization of enone oxidoreductases from strawberry and tomato fruit. , 2007, Journal of agricultural and food chemistry.

[46]  S. Brunak,et al.  Locating proteins in the cell using TargetP, SignalP and related tools , 2007, Nature Protocols.

[47]  J. Poulain,et al.  The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla , 2007, Nature.

[48]  Yuying Tian,et al.  GeneTrees: a phylogenomics resource for prokaryotes , 2006, Nucleic Acids Res..

[49]  R. Breaker,et al.  Riboswitch Control of Gene Expression in Plants by Splicing and Alternative 3′ End Processing of mRNAs[W][OA] , 2007, The Plant Cell Online.

[50]  D. R. Morgan,et al.  Phylogeny and classification of Rosaceae , 2007, Plant Systematics and Evolution.

[51]  A. Kanagaraj,et al.  The complete nucleotide sequence of the cassava (Manihot esculenta) chloroplast genome and the evolution of atpF in Malpighiales: RNA editing and multiple losses of a group II intron , 2008, Theoretical and Applied Genetics.

[52]  E. Birney,et al.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs. , 2008, Genome research.

[53]  S. Tabata,et al.  Functional differentiation of Lotus japonicus TT2s, R2R3-MYB transcription factors comprising a multigene family. , 2008, Plant & cell physiology.

[54]  G. Cipriani,et al.  The development of a bin mapping population and the selective mapping of 103 markers in the diploid Fragaria reference map. , 2008, Genome.

[55]  P. Auvinen,et al.  Identification of flowering genes in strawberry, a perennial SD plant , 2009, BMC Plant Biology.

[56]  Dorrie Main,et al.  Multiple Models for Rosaceae Genomics[OA] , 2008, Plant Physiology.

[57]  Jim Leebens-Mack,et al.  Identification of shared single copy nuclear genes in Arabidopsis, Populus, Vitis and Oryza and their phylogenetic utility across various taxonomic levels , 2010, BMC Evolutionary Biology.

[58]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[59]  Cristian Chaparro,et al.  Exceptional Diversity, Non-Random Distribution, and Rapid Evolution of Retroelements in the B73 Maize Genome , 2009, PLoS genetics.

[60]  Robert D. Finn,et al.  InterPro: the integrative protein signature database , 2008, Nucleic Acids Res..

[61]  Dawn H. Nagel,et al.  The B73 Maize Genome: Complexity, Diversity, and Dynamics , 2009, Science.

[62]  R. Herrera,et al.  Aroma development during ripening of Fragaria chiloensis fruit and participation of an alcohol acyltransferase (FcAAT1) gene. , 2009, Journal of agricultural and food chemistry.

[63]  J. Bennetzen,et al.  Gene Content and Distribution in the Nuclear Genome of Fragaria vesca , 2009 .

[64]  J. Bennetzen,et al.  An examination of targeted gene neighborhoods in strawberry , 2010, BMC Plant Biology.

[65]  Mihaela M. Martis,et al.  The Sorghum bicolor genome and the diversification of grasses , 2009, Nature.

[66]  C. Davis,et al.  Malpighiales phylogenetics: Gaining ground on one of the most recalcitrant clades in the angiosperm tree of life. , 2009, American journal of botany.

[67]  T. Mockler,et al.  Analysis of transcriptome changes induced by Ptr ToxA in wheat provides insights into the mechanisms of plant susceptibility. , 2009, Molecular plant.

[68]  D. Soltis,et al.  Rosid radiation and the rapid rise of angiosperm-dominated forests , 2009, Proceedings of the National Academy of Sciences.

[69]  F. Kjellberg,et al.  Cyto-nuclear discordance in the phylogeny of Ficus section Galoglychia and host shifts in plant-pollinator associations , 2009, BMC Evolutionary Biology.

[70]  Esther van der Knaap,et al.  Development and bin mapping of a Rosaceae Conserved Ortholog Set (COS) of markers , 2009, BMC Genomics.

[71]  Roger E Bumgarner,et al.  The genome of the domesticated apple (Malus × domestica Borkh.) , 2010, Nature Genetics.

[72]  Erik L. L. Sonnhammer,et al.  InParanoid 7: new algorithms and tools for eukaryotic orthology analysis , 2009, Nucleic Acids Res..

[73]  V. Shulaev,et al.  Implementing reverse genetics in Rosaceae: analysis of T-DNA flanking sequences of insertional mutant lines in the diploid strawberry, Fragaria vesca. , 2010, Physiologia plantarum.

[74]  V. Shulaev,et al.  SNP discovery and genetic mapping of T-DNA insertional mutants in Fragaria vesca L. , 2010, Theoretical and Applied Genetics.

[75]  Kelly P Williams,et al.  Phylogeny of Gammaproteobacteria , 2010, Journal of bacteriology.

[76]  Weng-Keen Wong,et al.  Gene expression Advance Access publication April 21, 2010 Supersplat—spliced RNA-seq alignment , 2009 .

[77]  Rob J. Kulathinal,et al.  A Transcript Accounting from Diverse Tissues of a Cultivated Strawberry , 2010 .

[78]  Y. Qiu,et al.  Angiosperm phylogeny inferred from sequences of four mitochondrial genes , 2010 .

[79]  K. Folta,et al.  A Review of Photoperiodic Flowering Research in Strawberry (Fragaria spp.) , 2010 .

[80]  Jing Hao Figure , 1972, Analysing Scientific Discourse From a Systemic Functional Linguistic Perspective.