High Occurrence of Functional New Chimeric Genes in Survey of Rice Chromosome 3 Short Arm Genome Sequences

In an effort to identify newly evolved genes in rice, we searched the genomes of Asian-cultivated rice Oryza sativa ssp. japonica and its wild progenitors, looking for lineage-specific genes. Using genome pairwise comparison of approximately 20-Mb DNA sequences from the chromosome 3 short arm (Chr3s) in six rice species, O. sativa, O. nivara, O. rufipogon, O. glaberrima, O. barthii, and O. punctata, combined with synonymous substitution rate tests and other evidence, we were able to identify potential recently duplicated genes, which evolved within the last 1 Myr. We identified 28 functional O. sativa genes, which likely originated after O. sativa diverged from O. glaberrima. These genes account for around 1% (28/3,176) of all annotated genes on O. sativa’s Chr3s. Among the 28 new genes, two recently duplicated segments contained eight genes. Fourteen of the 28 new genes consist of chimeric gene structure derived from one or multiple parental genes and flanking targeting sequences. Although the majority of these 28 new genes were formed by single or segmental DNA-based gene duplication and recombination, we found two genes that were likely originated partially through exon shuffling. Sequence divergence tests between new genes and their putative progenitors indicated that new genes were most likely evolving under natural selection. We showed all 28 new genes appeared to be functional, as suggested by Ka/Ks analysis and the presence of RNA-seq, cDNA, expressed sequence tag, massively parallel signature sequencing, and/or small RNA data. The high rate of new gene origination and of chimeric gene formation in rice may demonstrate rice’s broad diversification, domestication, its environmental adaptation, and the role of new genes in rice speciation

[1]  Jun Wang,et al.  gKaKs: the pipeline for genome-level Ka/Ks calculation , 2013, Bioinform..

[2]  M. Long,et al.  New genes expressed in human brains: Implications for annotating evolving genomes , 2012, BioEssays : news and reviews in molecular, cellular and developmental biology.

[3]  S. Shiu,et al.  Comparative transcriptomics of three Poaceae species reveals patterns of gene expression evolution. , 2012, The Plant journal : for cell and molecular biology.

[4]  J. Parsch,et al.  Newly evolved genes: Moving from comparative genomics to functional studies in model systems , 2012, BioEssays : news and reviews in molecular, cellular and developmental biology.

[5]  Anirvan Ghosh,et al.  Inhibition of SRGAP2 Function by Its Human-Specific Paralogs Induces Neoteny during Spine Maturation , 2012, Cell.

[6]  M. Long,et al.  Retrogenes Moved Out of the Z Chromosome in the Silkworm , 2012, Journal of Molecular Evolution.

[7]  Albert J. Vilella,et al.  Insights into hominid evolution from the gorilla genome sequence , 2012, Nature.

[8]  D. Hartl,et al.  Functional evidence that a recently evolved Drosophila sperm-specific gene boosts sperm competition , 2012, Proceedings of the National Academy of Sciences.

[9]  M. Long,et al.  The origin and evolution of new genes. , 2012, Methods in molecular biology.

[10]  Hiroaki Sakai,et al.  Retrogenes in Rice (Oryza sativa L. ssp. japonica) Exhibit Correlated Expression with Their Source Genes , 2011, Genome biology and evolution.

[11]  L. Peshkin,et al.  Genome sequencing reveals insights into physiology and longevity of the naked mole rat , 2011, Nature.

[12]  M. Long,et al.  Accelerated Recruitment of New Brain Development Genes into the Human Genome , 2011, PLoS biology.

[13]  Vipin T. Sreedharan,et al.  Multiple reference genomes and transcriptomes for Arabidopsis thaliana , 2011, Nature.

[14]  Richard M. Clark,et al.  The Arabidopsis lyrata genome sequence and the basis of rapid genome size change , 2011, Nature Genetics.

[15]  Albert J. Vilella,et al.  Comparative and demographic analysis of orang-utan genomes , 2011, Nature.

[16]  Manyuan Long,et al.  New Genes in Drosophila Quickly Become Essential , 2010, Science.

[17]  Li Zhao,et al.  A Young Drosophila Duplicate Gene Plays Essential Roles in Spermatogenesis by Regulating Several Y-Linked Male Fertility Genes , 2010, PLoS genetics.

[18]  J. Jensen,et al.  Characterizing Recurrent Positive Selection at Fast-Evolving Genes in Drosophila miranda and Drosophila pseudoobscura , 2010, Genome biology and evolution.

[19]  D. Zilberman,et al.  Genome-Wide Evolutionary Analysis of Eukaryotic DNA Methylation , 2010, Science.

[20]  Philip L. F. Johnson,et al.  A Draft Sequence of the Neandertal Genome , 2010, Science.

[21]  Cole Trapnell,et al.  Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. , 2010, Nature biotechnology.

[22]  Y. Qi,et al.  Global Epigenetic and Transcriptional Trends among Two Rice Subspecies and Their Reciprocal Hybrids[W] , 2010, Plant Cell.

[23]  L. Tang,et al.  Phylogeny and biogeography of the rice tribe (Oryzeae): evidence from combined analysis of 20 chloroplast fragments. , 2010, Molecular phylogenetics and evolution.

[24]  N. Vinckenbosch,et al.  The emergence of new genes on the young therian X. , 2010, Trends in genetics : TIG.

[25]  M. Long,et al.  Extensive Structural Renovation of Retrogenes in the Evolution of the Populus Genome1[W][OA] , 2009, Plant Physiology.

[26]  D. Tautz,et al.  Emergence of a New Gene from an Intergenic Region , 2009, Current Biology.

[27]  Jin Jun,et al.  Duplication Mechanism and Disruptions in Flanking Regions Determine the Fate of Mammalian Gene Duplicates , 2009, J. Comput. Biol..

[28]  E. H. Margulies,et al.  An Expressed Fgf4 Retrogene Is Associated with Breed-Defining Chondrodysplasia in Domestic Dogs , 2009, Science.

[29]  Lior Pachter,et al.  Sequence Analysis , 2020, Definitions.

[30]  Robert P. Davey,et al.  Population genomics of domestic and wild yeasts , 2008, Nature.

[31]  E. Eichler,et al.  The evolution of human segmental duplications and the core duplicon hypothesis. , 2009, Cold Spring Harbor symposia on quantitative biology.

[32]  N. Vinckenbosch,et al.  RNA-based gene duplication: mechanistic and evolutionary insights , 2009, Nature Reviews Genetics.

[33]  Arcadi Navarro,et al.  A burst of segmental duplications in the genome of the African great ape ancestor , 2009, Nature.

[34]  J. Bennetzen,et al.  Dynamic Evolution of Oryza Genomes Is Revealed by Comparative Genomic Analysis of a Genus-Wide Vertical Data Set[W][OA] , 2008, The Plant Cell Online.

[35]  Yun Ding,et al.  On the origin of new genes in Drosophila. , 2008, Genome research.

[36]  M. Long,et al.  The subtelomere of Oryza sativa chromosome 3 short arm as a hot bed of new gene origination in rice. , 2008, Molecular plant.

[37]  Xing Wang Deng,et al.  Rice 2020: a call for an international coordinated effort in rice functional genomics. , 2008, Molecular plant.

[38]  Mark D. Rausher,et al.  Escape from adaptive conflict after duplication in an anthocyanin pathway gene , 2008, Nature.

[39]  Xin Li,et al.  Repetitive Element-Mediated Recombination as a Mechanism for New Gene Origination in Drosophila , 2007, PLoS genetics.

[40]  Melanie A. Huntley,et al.  Evolution of genes and genomes on the Drosophila phylogeny , 2007, Nature.

[41]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[42]  Ziheng Yang PAML 4: phylogenetic analysis by maximum likelihood. , 2007, Molecular biology and evolution.

[43]  Maria D. Vibranovski,et al.  A Microarray Based Genomic Hybridization Method for Identification of New Genes in Plants: Case Analyses of Arabidopsis and Oryza , 2007 .

[44]  B. Meyers,et al.  An expression atlas of rice mRNAs and small RNAs , 2007, Nature Biotechnology.

[45]  Jun Wang,et al.  High Rate of Chimeric Gene Origination by Retroposition in Plant Genomes[W] , 2006, The Plant Cell Online.

[46]  Kan Nobuta,et al.  Plant MPSS databases: signature-based transcriptional resources for analyses of mRNA and small RNA , 2005, Nucleic Acids Res..

[47]  K. Tamura,et al.  A Novel Chimeric Gene, siren, With Retroposed Promoter Sequence in the Drosophila bipectinata Complex , 2005, Genetics.

[48]  A. Reymond,et al.  Emergence of Young Human Genes after a Burst of Retroposition in Primates , 2005, PLoS biology.

[49]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[50]  Corbin D. Jones,et al.  Parallel evolution of chimeric fusion genes. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Qihui Zhu,et al.  Phylogenetic relationships among A-genome species of the genus Oryza revealed by intron sequences of four nuclear genes. , 2005, The New phytologist.

[52]  Yujun Zhang,et al.  Computational Identification of 69 Retroposons in Arabidopsis1[w] , 2005, Plant Physiology.

[53]  Dawei Li,et al.  The Genomes of Oryza sativa: A History of Duplications , 2005, PLoS biology.

[54]  K. Katoh,et al.  MAFFT version 5: improvement in accuracy of multiple sequence alignment , 2005, Nucleic acids research.

[55]  Sean R. Eddy,et al.  Pack-MULE transposable elements mediate gene evolution in plants , 2004, Nature.

[56]  J. Luban,et al.  Cyclophilin A retrotransposition into TRIM5 explains owl monkey resistance to HIV-1 , 2004, Nature.

[57]  Manyuan Long,et al.  Duplication-degeneration as a mechanism of gene fission and the origin of new genes in Drosophila species , 2004, Nature Genetics.

[58]  Kevin R. Thornton,et al.  The origin of new genes: glimpses from the young and old , 2003, Nature Reviews Genetics.

[59]  R. Durbin,et al.  The Genome Sequence of Caenorhabditis briggsae: A Platform for Comparative Genomics , 2003, PLoS biology.

[60]  B. Charlesworth,et al.  On the genomic location of the exuperantia1 gene in Drosophila miranda: the limits of in situ hybridization experiments. , 2003, Genetics.

[61]  K. Kadowaki,et al.  Diversity in the Oryza genus. , 2003, Current opinion in plant biology.

[62]  D. Haber,et al.  The Tre2 (USP6) oncogene is a hominoid-specific gene , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[63]  L. Wagner,et al.  21. UniGene: A Unified View of the Transcriptome , 2003 .

[64]  Kevin R. Thornton,et al.  Retroposed new genes out of the X in Drosophila. , 2002, Genome research.

[65]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[66]  Jianzhi Zhang,et al.  Adaptive evolution of a duplicated pancreatic ribonuclease gene in a leaf-eating monkey , 2002, Nature Genetics.

[67]  M. Long,et al.  The origin of the Jingwei gene and the complex modular structure of its parental gene, yellow emperor, in Drosophila melanogaster. , 2000, Molecular biology and evolution.

[68]  Z. Yang,et al.  Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. , 2000, Molecular biology and evolution.

[69]  R. Lewontin,et al.  Origin of new genes , 2000 .

[70]  T. Sang,et al.  Phylogeny of rice genomes with emphasis on origins of allotetraploid species. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[71]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[72]  W. Gilbert,et al.  Exon shuffling and the origin of the mitochondrial targeting function in plant cytochrome c1 precursor. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[73]  M. Long,et al.  Natural selection and the origin of jingwei, a chimeric processed functional gene in Drosophila. , 1993, Science.

[74]  D. Ord,et al.  PAUP:Phylogenetic analysis using parsi-mony , 1993 .