Rapid diversification of five Oryza AA genomes associated with rice adaptation

Significance Asian rice (Oryza sativa) is among the world’s most important crops. The genus Oryza has become a model for the study of plant genome structure, function, and evolution. We have undertaken de novo, full-genome sequence analysis of five diploid AA-genome species that are closely related to O. sativa. These species are native to quite different environments, representing four continents, thus exhibiting very different adaptations. Our studies identify specific genetic changes, in both gene copy number and the degree of diversifying natural selection, that indicate specific genes responsible for these adaptations, particularly in genes related to defense against pathogens and reproductive diversification. This genome discovery and comparative analysis provide a powerful tool for future Oryza study and rice improvement. Comparative genomic analyses among closely related species can greatly enhance our understanding of plant gene and genome evolution. We report de novo-assembled AA-genome sequences for Oryza nivara, Oryza glaberrima, Oryza barthii, Oryza glumaepatula, and Oryza meridionalis. Our analyses reveal massive levels of genomic structural variation, including segmental duplication and rapid gene family turnover, with particularly high instability in defense-related genes. We show, on a genomic scale, how lineage-specific expansion or contraction of gene families has led to their morphological and reproductive diversification, thus enlightening the evolutionary process of speciation and adaptation. Despite strong purifying selective pressures on most Oryza genes, we documented a large number of positively selected genes, especially those genes involved in flower development, reproduction, and resistance-related processes. These diversifying genes are expected to have played key roles in adaptations to their ecological niches in Asia, South America, Africa and Australia. Extensive variation in noncoding RNA gene numbers, function enrichment, and rates of sequence divergence might also help account for the different genetic adaptations of these rice species. Collectively, these resources provide new opportunities for evolutionary genomics, numerous insights into recent speciation, a valuable database of functional variation for crop improvement, and tools for efficient conservation of wild rice germplasm.

[1]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.

[2]  M. Adams,et al.  Recent Segmental Duplications in the Human Genome , 2002, Science.

[3]  Lior Pachter,et al.  MAVID: constrained ancestral alignment of multiple sequences. , 2003, Genome research.

[4]  Jared L. Strasburg,et al.  What can patterns of differentiation across plant genomes tell us about adaptation and speciation? , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[5]  Richard Bonneau,et al.  The Plant Proteome Folding Project: Structure and Positive Selection in Plant Protein Families , 2012, Genome biology and evolution.

[6]  Kevin P. Byrne,et al.  Multiple rounds of speciation associated with reciprocal gene loss in polyploid yeasts , 2006, Nature.

[7]  Lei Wang,et al.  A triallelic system of S5 is a major regulator of the reproductive barrier and compatibility of indica–japonica hybrids in rice , 2008, Proceedings of the National Academy of Sciences.

[8]  S. Tanksley,et al.  Seed banks and molecular maps: unlocking genetic potential from the wild. , 1997, Science.

[9]  M. Pagel,et al.  Speciation as an active force in promoting genetic evolution. , 2010, Trends in ecology & evolution.

[10]  Yulin Jia,et al.  Haplotype diversity at the Pi-ta locus in cultivated rice and its wild relatives. , 2008, Phytopathology.

[11]  J. Bennetzen,et al.  Dynamic Evolution of Oryza Genomes Is Revealed by Comparative Genomic Analysis of a Genus-Wide Vertical Data Set[W][OA] , 2008, The Plant Cell Online.

[12]  Lijun Luo,et al.  Natural variation in GS5 plays an important role in regulating grain size and yield in rice , 2011, Nature Genetics.

[13]  A PevznerPavel,et al.  De novo identification of repeat families in large genomes , 2005 .

[14]  Liwen Jiang,et al.  A Killer-Protector System Regulates Both Hybrid Sterility and Segregation Distortion in Rice , 2012, Science.

[15]  Zhao Xu,et al.  LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons , 2007, Nucleic Acids Res..

[16]  D. Vaughan The wild relatives of rice: a genetic resources handbook. , 1994 .

[17]  Ana Kozomara,et al.  miRBase: integrating microRNA annotation and deep-sequencing data , 2010, Nucleic Acids Res..

[18]  Jianxin Ma,et al.  Rapid recent growth and divergence of rice nuclear genomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[19]  S. Jackson,et al.  Evolutionary dynamics of an ancient retrotransposon family provides insights into evolution of genome size in the genus Oryza. , 2007, The Plant journal : for cell and molecular biology.

[20]  S. Eddy,et al.  Automated de novo identification of repeat sequence families in sequenced genomes. , 2002, Genome research.

[21]  Elaine R. Mardis,et al.  The draft genome of the parasitic nematode Trichinella spiralis , 2011, Nature Genetics.

[22]  D. Schwartz,et al.  Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data , 2013, Rice.

[23]  L. Rieseberg,et al.  Speciation genes in plants. , 2010, Annals of botany.

[24]  Jianguo Liu,et al.  Phylogenetic relationships and genome divergence among the AA- genome species of the genus Oryza as revealed by 53 nuclear genes and 16 intergenic regions. , 2014, Molecular phylogenetics and evolution.

[25]  C. Kole Wild Crop Relatives: Genomic and Breeding Resources , 2011 .

[26]  K. Fukui,et al.  Repetitive sequences: cause for variation in genome size and chromosome morphology in the genus Oryza , 1997, Plant Molecular Biology.

[27]  Xianran Li,et al.  Control of a key transition from prostrate to erect growth in rice domestication , 2008, Nature Genetics.

[28]  D. Presgraves,et al.  The molecular evolutionary basis of species formation , 2010, Nature Reviews Genetics.

[29]  S. Jackson,et al.  Orthologous comparisons of the Hd1 region across genera reveal Hd1 gene lability within diploid Oryza species and disruptions to microsynteny in Sorghum. , 2010, Molecular biology and evolution.

[30]  J. Bennetzen,et al.  Patterns in grass genome evolution. , 2007, Current opinion in plant biology.

[31]  John F. McDonald,et al.  LTR_STRUC: a novel search and identification program for LTR retrotransposons , 2003, Bioinform..

[32]  Albert J. Vilella,et al.  Insights into hominid evolution from the gorilla genome sequence , 2012, Nature.

[33]  Richard M. Clark,et al.  The Arabidopsis lyrata genome sequence and the basis of rapid genome size change , 2011, Nature Genetics.

[34]  J. Bennetzen,et al.  Comparative sequence analysis of MONOCULM1-orthologous regions in 14 Oryza genomes , 2009, Proceedings of the National Academy of Sciences.

[35]  Li-zhi Gao Population structure and conservation genetics of wild rice Oryza rufipogon (Poaceae): a region‐wide perspective from microsatellite variation , 2004, Molecular ecology.

[36]  O. Voinnet Origin, Biogenesis, and Activity of Plant MicroRNAs , 2009, Cell.

[37]  Lin Fang,et al.  Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes , 2011, Nature Biotechnology.

[38]  Ziheng Yang,et al.  Multilocus estimation of divergence times and ancestral effective population sizes of Oryza species and implications for the rapid diversification of the genus. , 2013, The New phytologist.

[39]  Ye Yin,et al.  Whole-genome sequencing of Oryza brachyantha reveals mechanisms underlying Oryza genome evolution , 2013, Nature Communications.

[40]  N. Kurata,et al.  Genome Size of Twenty Wild Species of Oryza Determined by Flow Cytometric and Chromosome Analyses , 2007 .

[41]  S. Jackson,et al.  The Oryza bacterial artificial chromosome library resource: construction and analysis of 12 deep-coverage large-insert BAC libraries that represent the 10 genome types of the genus Oryza. , 2005, Genome research.

[42]  Pavel A. Pevzner,et al.  De novo identification of repeat families in large genomes , 2005, ISMB.

[43]  Hiroaki Sakai,et al.  Massive gene losses in Asian cultivated rice unveiled by comparative genome analysis , 2010, BMC Genomics.

[44]  D. M. Krylov,et al.  Gene loss, protein sequence divergence, gene dispensability, expression level, and interactivity are correlated in eukaryotic evolution. , 2003, Genome research.

[45]  S. Salzberg,et al.  Versatile and open software for comparing large genomes , 2004, Genome Biology.

[46]  V. Grant,et al.  Origin of Cultivated Rice , 1988 .

[47]  Haibao Tang,et al.  Insights from the comparison of plant genome sequences. , 2010, Annual review of plant biology.

[48]  Melanie A. Huntley,et al.  Evolution of genes and genomes on the Drosophila phylogeny , 2007, Nature.

[49]  Jing Wang,et al.  Identification of a New Rice Blast Resistance Gene, Pid3, by Genomewide Comparison of Paired Nucleotide-Binding Site–Leucine-Rich Repeat Genes and Their Pseudogene Alleles Between the Two Sequenced Rice Genomes , 2009, Genetics.

[50]  Ashutosh Kumar Singh,et al.  MADS-box gene family in rice: genome-wide identification, organization and expression profiling during reproductive development and stress , 2007, BMC Genomics.

[51]  D. Haussler,et al.  Human-mouse alignments with BLASTZ. , 2003, Genome research.

[52]  A. Leaché,et al.  Late Cretaceous origin of the rice tribe provides evidence for early diversification in Poaceae. , 2011, Nature communications.

[53]  李佩芳 International Rice Genome Sequencing Project. 2005. The map-based sequence of the rice genome. , 2005 .

[54]  D. Luo,et al.  Genetic control of rice plant architecture under domestication , 2008, Nature Genetics.

[55]  M. Purugganan,et al.  Molecular evolution of flower development: diversification of the plant MADS-box regulatory gene family. , 1995, Genetics.

[56]  T. Chang The origin, evolution, cultivation, dissemination, and diversification of Asian and African rices , 2004, Euphytica.

[57]  Huanming Yang,et al.  De novo assembly of human genomes with massively parallel short read sequencing. , 2010, Genome research.

[58]  Andrea Zuccolo,et al.  Transposable element distribution, abundance and role in genome size variation in the genus Oryza , 2007, BMC Evolutionary Biology.

[59]  C. Baker,et al.  A burst of segmental duplications in the genome of the African great ape ancestor , 2009, Nature.

[60]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.