Genome-Wide Analysis of Syntenic Gene Deletion in the Grasses

The grasses, Poaceae, are one of the largest and most successful angiosperm families. Like many radiations of flowering plants, the divergence of the major grass lineages was preceded by a whole-genome duplication (WGD), although these events are not rare for flowering plants. By combining identification of syntenic gene blocks with measures of gene pair divergence and different frequencies of ancient gene loss, we have separated the two subgenomes present in modern grasses. Reciprocal loss of duplicated genes or genomic regions has been hypothesized to reproductively isolate populations and, thus, speciation. However, in contrast to previous studies in yeast and teleost fishes, we found very little evidence of reciprocal loss of homeologous genes between the grasses, suggesting that post-WGD gene loss may not be the cause of the grass radiation. The sets of homeologous and orthologous genes and predicted locations of deleted genes identified in this study, as well as links to the CoGe comparative genomics web platform for analyzing pan-grass syntenic regions, are provided along with this paper as a resource for the grass genetics community.

[1]  Dawn H. Nagel,et al.  The B73 Maize Genome: Complexity, Diversity, and Dynamics , 2009, Science.

[2]  Haibao Tang,et al.  Seventy Million Years of Concerted Evolution of a Homoeologous Chromosome Pair, in Parallel, in Major Poaceae Lineages[W] , 2011, Plant Cell.

[3]  Lex E. Flagel,et al.  Evolutionary rate variation, genomic dominance and duplicate gene expression evolution during allotetraploid cotton speciation. , 2010, The New phytologist.

[4]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[5]  Mihaela M. Martis,et al.  The Sorghum bicolor genome and the diversification of grasses , 2009, Nature.

[6]  Marie Sémon,et al.  Reciprocal gene loss between Tetraodon and zebrafish after whole genome duplication in their ancestor. , 2007, Trends in genetics : TIG.

[7]  Brent S. Pedersen,et al.  Screening synteny blocks in pairwise genome comparisons through integer programming , 2011, BMC Bioinformatics.

[8]  James C. Schnable,et al.  Following Tetraploidy in Maize, a Short Deletion Mechanism Removed Genes Preferentially from One of the Two Homeologs , 2010, PLoS biology.

[9]  J. Mallet Hybrid speciation , 2007, Nature.

[10]  A. Meyer,et al.  The evolutionary significance of ancient genome duplications , 2009, Nature Reviews Genetics.

[11]  Jianxin Ma,et al.  Close split of sorghum and maize genome progenitors. , 2004, Genome research.

[12]  Cathal Seoighe,et al.  Genome duplication led to highly selective expansion of the Arabidopsis thaliana proteome. , 2004, Trends in genetics : TIG.

[13]  John A. Hamilton,et al.  The TIGR Rice Genome Annotation Resource: improvements and new features , 2006, Nucleic Acids Res..

[14]  J. Pires,et al.  Homoeologous shuffling and chromosome compensation maintain genome balance in resynthesized allopolyploid Brassica napus , 2011, Proceedings of the National Academy of Sciences.

[15]  D. Soltis,et al.  Characterization of duplicate gene evolution in the recent natural allopolyploid Tragopogon miscellus by next‐generation sequencing and Sequenom iPLEX MassARRAY genotyping , 2010, Molecular ecology.

[16]  Loretta Auvil,et al.  Breakpoint regions and homologous synteny blocks in chromosomes have different evolutionary histories. , 2009, Genome research.

[17]  L. Rieseberg,et al.  Plant Speciation , 2007, Science.

[18]  J. Dvorak,et al.  Gene Space Dynamics During the Evolution of Aegilops tauschii, Brachypodium distachyon, Oryza sativa, and Sorghum bicolor Genomes , 2011, Molecular biology and evolution.

[19]  Vincent Colot,et al.  Understanding mechanisms of novel gene expression in polyploids. , 2003, Trends in genetics : TIG.

[20]  James C. Schnable,et al.  Genes Identified by Visible Mutant Phenotypes Show Increased Bias toward One of Two Subgenomes of Maize , 2011, PloS one.

[21]  Haibao Tang,et al.  Angiosperm genome comparisons reveal early polyploidy in the monocot lineage , 2009, Proceedings of the National Academy of Sciences.

[22]  Brian C. Thomas,et al.  G-Boxes, Bigfoot Genes, and Environmental Response: Characterization of Intragenomic Conserved Noncoding Sequences in Arabidopsis[W] , 2007, The Plant Cell Online.

[23]  Michael Lynch,et al.  The Origin of Interspecific Genomic Incompatibility via Gene Duplication , 2000, The American Naturalist.

[24]  O. Loudet,et al.  Divergent Evolution of Duplicate Genes Leads to Genetic Incompatibilities Within A. thaliana , 2009, Science.

[25]  Kevin P. Byrne,et al.  Multiple rounds of speciation associated with reciprocal gene loss in polyploid yeasts , 2006, Nature.

[26]  D. Sankoff,et al.  Polyploidy and angiosperm diversification. , 2009, American journal of botany.

[27]  M. Gribskov,et al.  The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray) , 2006, Science.

[28]  Brian C. Thomas,et al.  Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes. , 2006, Genome research.

[29]  Michael Freeling,et al.  Many or most genes in Arabidopsis transposed after the origin of the order Brassicales. , 2008, Genome research.

[30]  Robert S. Harris,et al.  Improved pairwise alignment of genomic dna , 2007 .

[31]  Pamela S Soltis,et al.  The role of hybridization in plant speciation. , 2009, Annual review of plant biology.

[32]  P. L. Chang,et al.  Homoeolog-specific retention and use in allotetraploid Arabidopsis suecica depends on parent of origin and network partners , 2010, Genome Biology.

[33]  D. Schemske,et al.  PATHWAYS, MECHANISMS, AND RATES OF POLYPLOID FORMATION IN FLOWERING PLANTS , 1998 .

[34]  A. Oliphant,et al.  A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). , 2002, Science.

[35]  A. Paterson,et al.  Ancient polyploidization predating divergence of the cereals, and its consequences for comparative genomics. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Jerrold I. Davis,et al.  Phylogeny and subfamilial classification of the grasses (Poaceae) , 2001 .

[37]  A. Sahni,et al.  Dinosaur Coprolites and the Early Evolution of Grasses and Grazers , 2005, Science.

[38]  Sai Guna Ranjan Gurazada,et al.  Genome sequencing and analysis of the model grass Brachypodium distachyon , 2010, Nature.

[39]  Michael Freeling,et al.  The Value of Nonmodel Genomes and an Example Using SynMap Within CoGe to Dissect the Hexaploidy that Predates the Rosids , 2008, Tropical Plant Biology.

[40]  Patrick S. Schnable,et al.  Maize Inbreds Exhibit High Levels of Copy Number Variation (CNV) and Presence/Absence Variation (PAV) in Genome Content , 2009, PLoS genetics.

[41]  David Sankoff,et al.  The collapse of gene complement following whole genome duplication , 2010, BMC Genomics.

[42]  Jian Wang,et al.  Genome-wide patterns of genetic variation among elite maize inbred lines , 2010, Nature Genetics.

[43]  Brian C. Thomas,et al.  Gene-balanced duplications, like tetraploidy, provide predictable drive to increase morphological complexity. , 2006, Genome research.

[44]  Cristian Chaparro,et al.  Exceptional Diversity, Non-Random Distribution, and Rapid Evolution of Retroelements in the B73 Maize Genome , 2009, PLoS genetics.

[45]  Marta Matvienko,et al.  Multiple paleopolyploidizations during the evolution of the Compositae reveal parallel patterns of duplicate gene retention after millions of years. , 2008, Molecular biology and evolution.

[46]  Michael Freeling,et al.  Genomic duplication, fractionation and the origin of regulatory novelty. , 2004, Genetics.

[47]  Peter Tiffin,et al.  Pervasive gene content variation and copy number variation in maize and its undomesticated progenitor. , 2010, Genome research.

[48]  Steven Maere,et al.  Genome duplication and the origin of angiosperms. , 2005, Trends in ecology & evolution.

[49]  James C. Schnable,et al.  Differentiation of the maize subgenomes by genome dominance and both ancient and ongoing gene loss , 2011, Proceedings of the National Academy of Sciences.

[50]  Carol Soderlund,et al.  SyMAP v3.4: a turnkey synteny system with application to plant genomes , 2011, Nucleic acids research.

[51]  J. Pires,et al.  Genomic Changes in Resynthesized Brassica napus and Their Effect on Gene Expression and Phenotype[W][OA] , 2007, The Plant Cell Online.

[52]  Y. Mizuta,et al.  Rice pollen hybrid incompatibility caused by reciprocal gene loss of duplicated genes , 2010, Proceedings of the National Academy of Sciences.

[53]  Huanming Yang,et al.  A Draft Sequence of the Rice Genome (Oryza sativa L. ssp. indica) , 2002, Science.

[54]  Dawei Li,et al.  The Genomes of Oryza sativa: A History of Duplications , 2005, PLoS biology.

[55]  Joachim Messing,et al.  Reconstruction of monocotelydoneous proto-chromosomes reveals faster evolution in plants than in animals , 2009, Proceedings of the National Academy of Sciences.

[56]  M. Freeling,et al.  Dose–Sensitivity, Conserved Non-Coding Sequences, and Duplicate Gene Retention Through Multiple Tetraploidies in the Grasses , 2011, Front. Plant Sci..

[57]  J. Bouck,et al.  Insights into corn genes derived from large-scale cDNA sequencing , 2008, Plant Molecular Biology.

[58]  Paramvir S. Dehal,et al.  Two Rounds of Whole Genome Duplication in the Ancestral Vertebrate , 2005, PLoS biology.

[59]  Guillaume Blanc,et al.  Functional Divergence of Duplicated Genes Formed by Polyploidy during Arabidopsis Evolution , 2004, The Plant Cell Online.