The genome of the mesopolyploid crop species Brassica rapa

We report the annotation and analysis of the draft genome sequence of Brassica rapa accession Chiifu-401-42, a Chinese cabbage. We modeled 41,174 protein coding genes in the B. rapa genome, which has undergone genome triplication. We used Arabidopsis thaliana as an outgroup for investigating the consequences of genome triplication, such as structural and functional evolution. The extent of gene loss (fractionation) among triplicated genome segments varies, with one of the three copies consistently retaining a disproportionately large fraction of the genes expected to have been present in its ancestor. Variation in the number of members of gene families present in the genome may contribute to the remarkable morphological plasticity of Brassica species. The B. rapa genome sequence provides an important resource for studying the evolution of polyploid genomes and underpins the genetic improvement of Brassica oil and vegetable crops.

[1]  Nu Genome analysis in Brassica with special reference to the experimental formation of B. napus and peculiar mode of fertilization. , 1935 .

[2]  K. S. Labana,et al.  Importance and Origin , 1993 .

[3]  R. Amasino,et al.  FLOWERING LOCUS C Encodes a Novel MADS Domain Protein That Acts as a Repressor of Flowering , 1999, Plant Cell.

[4]  Wen-Hsiung Li,et al.  Rates of Nucleotide Substitution in Angiosperm Mitochondrial DNA Sequences and Dates of Divergence Between Brassica and Other Angiosperm Lineages , 1999, Journal of Molecular Evolution.

[5]  The Arabidopsis Genome Initiative Analysis of the genome sequence of the flowering plant Arabidopsis thaliana , 2000, Nature.

[6]  I. Bancroft,et al.  Comparative physical mapping of segments of the genome of Brassica oleracea var. alboglabra that are homoeologous to sequenced regions of chromosomes 4 and 5 of Arabidopsis thaliana. , 2000, The Plant journal : for cell and molecular biology.

[7]  Paul Shinn,et al.  Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana , 2000, Nature.

[8]  R. Amasino,et al.  Brassica genomics: a complement to, and early beneficiary of, the Arabidopsis sequence , 2001, Genome Biology.

[9]  S. Kay,et al.  Analysis of the function of two circadian-regulated CONSTANS-LIKE genes. , 2001, The Plant journal : for cell and molecular biology.

[10]  Caroline Dean,et al.  Multiple Roles of Arabidopsis VRN1 in Vernalization and Flowering Time Control , 2002, Science.

[11]  S. Salzberg,et al.  Fast algorithms for large-scale genome alignment and comparison. , 2002, Nucleic acids research.

[12]  Brad A. Chapman,et al.  Unravelling angiosperm genome evolution by phylogenetic analysis of chromosomal duplication events , 2003, Nature.

[13]  R. Olmstead,et al.  Evolution of the TCP gene family in Asteridae: cladistic and network approaches to understanding regulatory gene family diversification and its impact on morphological evolution. , 2003, Molecular biology and evolution.

[14]  C. Stoeckert,et al.  OrthoMCL: identification of ortholog groups for eukaryotic genomes. , 2003, Genome research.

[15]  Stephen M. Mount,et al.  Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. , 2003, Nucleic acids research.

[16]  R. Durbin,et al.  GeneWise and Genomewise. , 2004, Genome research.

[17]  R. Wing,et al.  Sequence composition and genome organization of maize. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[18]  M. Kiefer,et al.  Genome evolution among cruciferous plants: a lecture from the comparison of the genetic maps of three diploid species--Capsella rubella, Arabidopsis lyrata subsp. petraea, and A. thaliana. , 2005, American journal of botany.

[19]  C. P. Hong,et al.  Physical mapping and microsynteny of Brassica rapa ssp. pekinensis genome corresponding to a 222 kbp gene-rich region of Arabidopsis chromosome 4 and partially duplicated on chromosome 5 , 2005, Molecular Genetics and Genomics.

[20]  M. Koch,et al.  Chromosome triplication found across the tribe Brassiceae. , 2005, Genome Research.

[21]  Z. Chen,et al.  Evolution of genome size in Brassicaceae. , 2005, Annals of botany.

[22]  L. Lukens,et al.  Segmental Structure of the Brassica napus Genome Based on Comparative Analysis With Arabidopsis thaliana , 2005, Genetics.

[23]  Thomas L York,et al.  Comparative genome analyses of Arabidopsis spp.: inferring chromosomal rearrangement events in the evolutionary history of A. thaliana. , 2005, Genome research.

[24]  B. Haas,et al.  Comparative Genomics of Brassica oleracea and Arabidopsis thaliana Reveal Gene Loss, Fragmentation, and Dispersal after Polyploidy[W][OA] , 2006, The Plant Cell Online.

[25]  Brian C. Thomas,et al.  Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes. , 2006, Genome research.

[26]  Klaus Palme,et al.  Auxin in action: signalling, transport and the control of plant growth and development , 2006, Nature Reviews Molecular Cell Biology.

[27]  G. Weinstock,et al.  Creating a honey bee consensus gene set , 2007, Genome Biology.

[28]  R. Veitia,et al.  The Gene Balance Hypothesis: From Classical Genetics to Modern Genomics , 2007, The Plant Cell Online.

[29]  M. Nei,et al.  MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. , 2007, Molecular biology and evolution.

[30]  J. Poulain,et al.  The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla , 2007, Nature.

[31]  M. Nei,et al.  Molecular Evolutionary Genetics Analysis , 2007 .

[32]  Yang Wu,et al.  A repressor complex governs the integration of flowering signals in Arabidopsis. , 2008, Developmental cell.

[33]  Stephen M. Mount,et al.  The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus) , 2008, Nature.

[34]  Haibao Tang,et al.  Unraveling ancient hexaploidy through multiply-aligned angiosperm gene maps. , 2008, Genome research.

[35]  Z. Chen,et al.  Duplicate genes increase expression diversity in closely related species and allopolyploids , 2009, Proceedings of the National Academy of Sciences.

[36]  Jung Sun Kim,et al.  Genome-wide comparative analysis of the Brassica rapa gene space reveals genome shrinkage and differential loss of duplicated genes after whole genome triplication , 2009, Genome Biology.

[37]  Mihaela M. Martis,et al.  The Sorghum bicolor genome and the diversification of grasses , 2009, Nature.

[38]  David Sankoff,et al.  The collapse of gene complement following whole genome duplication , 2010, BMC Genomics.

[39]  Eric Fungmin Liew,et al.  Analysis of a post-translational steroid induction system for GIGANTEA in Arabidopsis , 2009, BMC Plant Biology.

[40]  Haibao Tang,et al.  Comparative inference of illegitimate recombination between rice and sorghum duplicated genes produced by polyploidization. , 2009, Genome research.

[41]  Steffen Vanneste,et al.  Auxin: A Trigger for Change in Plant Development , 2009, Cell.

[42]  M. Clements,et al.  Dated molecular phylogenies indicate a Miocene origin for Arabidopsis thaliana , 2010, Proceedings of the National Academy of Sciences.

[43]  Hee-Ju Yu,et al.  Sequence and structure of Brassica rapa chromosome A3 , 2010, Genome Biology.

[44]  Dawei Li,et al.  The sequence and de novo assembly of the giant panda genome , 2010, Nature.

[45]  James C. Schnable,et al.  Following Tetraploidy in Maize, a Short Deletion Mechanism Removed Genes Preferentially from One of the Two Homeologs , 2010, PLoS biology.

[46]  Huanming Yang,et al.  De novo assembly of human genomes with massively parallel short read sequencing. , 2010, Genome research.

[47]  Haibao Tang,et al.  Seventy Million Years of Concerted Evolution of a Homoeologous Chromosome Pair, in Parallel, in Major Poaceae Lineages[W] , 2011, Plant Cell.

[48]  James C. Schnable,et al.  Differentiation of the maize subgenomes by genome dominance and both ancient and ongoing gene loss , 2011, Proceedings of the National Academy of Sciences.