Gene Content and Distribution in the Nuclear Genome of Fragaria vesca

Thirty fosmids were randomly selected from a library of Fragaria vesca subsp. americana (cv. Pawtuckaway) DNA. These fosmid clones were individually sheared, and ∼4‐ to 5‐kb fragments were subcloned. Subclones on a single 384‐well plate were sequenced bidirectionally for each fosmid. Assembly of these data yielded 12 fosmid inserts completely sequenced, 14 inserts as 2 to 3 contiguous sequences (contigs), and 4 inserts with 5 to 9 contigs. In most cases, a single unambiguous contig order and orientation was determined, so no further finishing was required to identify genes and their relative arrangement. One hundred fifty‐eight genes were identified in the ∼1.0 Mb of nuclear genomic DNA that was assembled. Because these fosmids were randomly chosen, this allowed prediction of the genetic content of the entire ∼200 Mb F. vesca genome as about 30,500 protein‐encoding genes, plus >4700 truncated gene fragments. The genes are mostly arranged in gene‐rich regions, to a variable degree intermixed with transposable elements (TEs). The most abundant TEs in F. vesca were found to be long terminal repeat (LTR) retrotransposons, and these comprised about 13% of the DNA analyzed. Over 30 new repeat families were discovered, mostly TEs, and the total TE content of F. vesca is predicted to be at least 16%.

[1]  David Simpson,et al.  Comparative Genetic Mapping Between Octoploid and Diploid Fragaria Species Reveals a High Level of Colinearity Between Their Genomes and the Essentially Disomic Behavior of the Cultivated Octoploid Strawberry , 2008, Genetics.

[2]  J. Bennetzen,et al.  Enchilada redux: how complete is your genome sequence? , 2008, The New phytologist.

[3]  Andrew H. Paterson,et al.  Synteny and Collinearity in Plant Genomes , 2008, Science.

[4]  Stephen M. Mount,et al.  The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus) , 2008, Nature.

[5]  E. Mardis The impact of next-generation sequencing technology on genetics. , 2008, Trends in genetics : TIG.

[6]  Ncbi National Center for Biotechnology Information , 2008 .

[7]  M. Freeling,et al.  The evolutionary position of subfunctionalization, downgraded. , 2008, Genome dynamics.

[8]  D. Sargent,et al.  Synteny conservation between two distantly-related Rosaceae genomes: Prunus (the stone fruits) and Fragaria (the strawberry) , 2008, BMC Plant Biology.

[9]  Renyi Liu,et al.  Discovery and assembly of repeat family pseudomolecules from sparse genomic sequence data using the Assisted Automated Assembler of Repeat Families (AAARF) algorithm , 2008, BMC Bioinformatics.

[10]  J. Poulain,et al.  The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla , 2007, Nature.

[11]  J. Bennetzen,et al.  A GeneTrek analysis of the maize genome , 2007, Proceedings of the National Academy of Sciences.

[12]  Z. Chen,et al.  Genetic and epigenetic mechanisms for gene expression and phenotypic variation in plant polyploids. , 2007, Annual review of plant biology.

[13]  J. Bennetzen,et al.  Patterns in grass genome evolution. , 2007, Current opinion in plant biology.

[14]  Kanako O. Koyanagi,et al.  Curated genome annotation of Oryza sativa ssp. japonica and comparative genome analysis with Arabidopsis thaliana. , 2007, Genome research.

[15]  J. Bennetzen,et al.  Analysis of retrotransposon structural diversity uncovers properties and propensities in angiosperm genome evolution , 2006, Proceedings of the National Academy of Sciences.

[16]  S. Wessler,et al.  Transposable elements and the evolution of eukaryotic genomes , 2006, Proceedings of the National Academy of Sciences.

[17]  T. Davis,et al.  Strawberry Genes and Genomics , 2006 .

[18]  Rod A Wing,et al.  Differential lineage-specific amplification of transposable elements is responsible for genome size variation in Gossypium. , 2006, Genome research.

[19]  S. Jackson,et al.  Doubling genome size without polyploidization: dynamics of retrotransposition-driven genomic expansions in Oryza australiensis, a wild relative of rice. , 2006, Genome research.

[20]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[21]  Richard M. Clark,et al.  A distant upstream enhancer at the maize domestication gene tb1 has pleiotropic effects on plant and inflorescent architecture , 2006, Nature Genetics.

[22]  Jianxin Ma,et al.  Analysis and mapping of randomly chosen bacterial artificial chromosome clones from hexaploid bread wheat. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[23]  J. Bennetzen,et al.  Gene enrichment in maize with hypomethylated partial restriction (HMPR) libraries. , 2005, Genome research.

[24]  Gene A. Brewer,et al.  Comparative physical mapping links conservation of microsynteny to chromosome structure and recombination in grasses. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[25]  M. Morgante,et al.  Gene duplication and exon shuffling by helitron-like transposons generate intraspecies diversity in maize , 2005, Nature Genetics.

[26]  D. Leister,et al.  Generation and evolutionary fate of insertions of organelle DNA in the nuclear genomes of flowering plants. , 2005, Genome research.

[27]  Jianxin Ma,et al.  Consistent over-estimation of gene number in complex plant genomes. , 2004, Current opinion in plant biology.

[28]  J. Bennetzen,et al.  Gene loss and movement in the maize genome. , 2004, Genome research.

[29]  Sean R. Eddy,et al.  Pack-MULE transposable elements mediate gene evolution in plants , 2004, Nature.

[30]  Jianxin Ma,et al.  Rapid recent growth and divergence of rice nuclear genomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[31]  Jianxin Ma,et al.  Analyses of LTR-retrotransposon structures reveal recent and rapid genomic DNA loss in rice. , 2004, Genome research.

[32]  Srinivas Aluru,et al.  A strategy for assembling the maize (Zea mays L.) genome , 2004, Bioinform..

[33]  J. Wendel,et al.  Feast and famine in plant genomes , 2002, Genetica.

[34]  J Quackenbush,et al.  Enrichment of Gene-Coding Sequences in Maize by Genome Filtration , 2003, Science.

[35]  J. Bennetzen,et al.  A complex history of rearrangement in an orthologous region of the maize, sorghum, and rice genomes , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Yinan Yuan,et al.  High-Cot sequence analysis of the maize genome. , 2003, The Plant journal : for cell and molecular biology.

[37]  O. Panaud,et al.  Formation of solo-LTRs through unequal homologous recombination counterbalances amplifications of LTR retrotransposons in rice Oryza sativa L. , 2003, Molecular biology and evolution.

[38]  John F. McDonald,et al.  LTR_STRUC: a novel search and identification program for LTR retrotransposons , 2003, Bioinform..

[39]  T. Wicker,et al.  Rapid Genome Divergence at Orthologous Low Molecular Weight Glutenin Loci of the A and A m Genomes of Wheat , 2003 .

[40]  D. Church,et al.  Cross-species sequence comparisons: a review of methods and available resources. , 2003, Genome research.

[41]  E. Birney,et al.  Apollo: a sequence annotation editor , 2002, Genome Biology.

[42]  J. Messing,et al.  Mosaic organization of orthologous sequences in grass genomes. , 2002, Genome research.

[43]  J. Bennetzen,et al.  The regulatory regions required for B' paramutation and expression are located far upstream of the maize b1 transcribed sequences. , 2002, Genetics.

[44]  J. Bennetzen,et al.  Methylation-spanning linker libraries link gene-rich regions and identify epigenetic boundaries in Zea mays. , 2002, Genome research.

[45]  James K. M. Brown,et al.  Genome size reduction through illegitimate recombination counteracts genome expansion in Arabidopsis. , 2002, Genome research.

[46]  H. Fu,et al.  Intraspecific violation of genetic colinearity and its implications in maize , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[47]  K. Fukui,et al.  Estimation of the nuclear DNA content of strawberries (Fragaria spp.) compared with Arabidopsis thaliana by using dual-step flow cytometry , 2001 .

[48]  J. Bennetzen,et al.  National Science Foundation-sponsored workshop report. Maize genome sequencing project. , 2001, Plant physiology.

[49]  K. Theres,et al.  Comparative sequence analysis reveals extensive microcolinearity in the lateral suppressor regions of the tomato, Arabidopsis, and Capsella genomes. , 2001, The Plant cell.

[50]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[51]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[52]  Robert A. Martienssen,et al.  Differential methylation of genes and retrotransposons facilitates shotgun sequencing of the maize genome , 1999, Nature Genetics.

[53]  J. Bennetzen,et al.  Colinearity and its exceptions in orthologous adh regions of maize and sorghum. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[54]  Phillip SanMiguel,et al.  Evidence that a Recent Increase in Maize Genome Size was Caused by the Massive Amplification of Intergene Retrotransposons , 1998 .

[55]  P. Green,et al.  Consed: a graphical tool for sequence finishing. , 1998, Genome research.

[56]  P. Green,et al.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment. , 1998, Genome research.

[57]  J. Bennetzen,et al.  Sequence organization and conservation in sh2/a1-homologous regions of sorghum and rice. , 1998, Genetics.

[58]  J. Bennetzen,et al.  Integration and nonrandom mutation of a plasma membrane proton ATPase gene fragment within the Bs1 retroelement of maize. , 1994, The Plant cell.

[59]  G. Stauct THE SPECIES OF FRAGARIA, THEIR TAXONOMY AND GEOGRAPHICAL DISTRIBUTION , 1989 .

[60]  J. Sambrook,et al.  Molecular Cloning: A Laboratory Manual , 2001 .