Evolution of Gene Structural Complexity: An Alternative-Splicing-Based Model Accounts for Intron-Containing Retrogenes1[W]

An alternative-splicing-based model explains the formation of retrogenes that retained the parental intron structure and indicates that plants have a much higher percentage of this kind of complex retrogene than animals. The structure of eukaryotic genes evolves extensively by intron loss or gain. Previous studies have revealed two models for gene structure evolution through the loss of introns: RNA-based gene conversion, dubbed the Fink model and retroposition model. However, retrogenes that experienced both intron loss and intron-retaining events have been ignored; evolutionary processes responsible for the variation in complex exon-intron structure were unknown. We detected hundreds of retroduplication-derived genes in human (Homo sapiens), fly (Drosophila melanogaster), rice (Oryza sativa), and Arabidopsis (Arabidopsis thaliana) and categorized them either as duplicated genes that have all introns lost or as duplicated genes that have at least lost one and retained one intron compared with the parental copy (intron-retaining [IR] type). Our new model attributes intron retention alternative splicing to the generation of these IR-type gene pairs. We presented 25 parental genes that have an intron retention isoform and have retained introns in the same locations in the IR-type duplicate genes, which directly support our hypothesis. Our alternative-splicing-based model in conjunction with the retroposition and Fink models can explain the IR-type gene observed. We discovered a greater percentage of IR-type genes in plants than in animals, which may be due to the abundance of intron retention cases in plants. Given the prevalence of intron retention in plants, this new model gives a support that plant genomes have very complex gene structures.

[1]  M. Long,et al.  Extensive Structural Renovation of Retrogenes in the Evolution of the Populus Genome1[W][OA] , 2009, Plant Physiology.

[2]  M. Long,et al.  Intron presence–absence polymorphism in Drosophila driven by positive Darwinian selection , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Xuehui Huang,et al.  Function annotation of the rice transcriptome at single-nucleotide resolution by RNA-seq. , 2010, Genome research.

[4]  Walter Gilbert,et al.  The evolution of spliceosomal introns: patterns, puzzles and progress , 2006, Nature Reviews Genetics.

[5]  G. Ast,et al.  Alternative splicing and evolution: diversification, exon definition and function , 2010, Nature Reviews Genetics.

[6]  Scott A. Rifkin,et al.  A Gene Expression Map for the Euchromatic Genome of Drosophila melanogaster , 2004, Science.

[7]  W. Barbazuk,et al.  Genome-wide analyses of alternative splicing in plants: opportunities and challenges. , 2008, Genome research.

[8]  J. Brosius,et al.  Retroposons--seeds of evolution. , 1991, Science.

[9]  M. Long,et al.  New genes as drivers of phenotypic evolution , 2013, Nature Reviews Genetics.

[10]  D. Garfinkel,et al.  RNA-mediated recombination in S. cerevisiae , 1991, Cell.

[11]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[12]  Yujun Zhang,et al.  Computational Identification of 69 Retroposons in Arabidopsis1[w] , 2005, Plant Physiology.

[13]  Noboru Jo Sakabe,et al.  Detection and evaluation of intron retention events in the human transcriptome. , 2004, RNA.

[14]  Z. Yang,et al.  Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. , 2000, Molecular biology and evolution.

[15]  O. Gotoh,et al.  Species-specific variation of alternative splicing and transcriptional initiation in six eukaryotes. , 2005, Gene.

[16]  C. Ben-Dov,et al.  Unconstrained mining of transcript data reveals increased alternative splicing complexity in the human transcriptome , 2010, Nucleic acids research.

[17]  Tobias Mourier,et al.  Eukaryotic Intron Loss , 2003, Science.

[18]  L K Derr,et al.  The involvement of cellular recombination and repair genes in RNA-mediated recombination in Saccharomyces cerevisiae. , 1998, Genetics.

[19]  Masakazu Satou,et al.  Genome-wide analysis of alternative pre-mRNA splicing in Arabidopsis thaliana based on full-length cDNA sequences. , 2004, Nucleic acids research.

[20]  Jun Wang,et al.  High Rate of Chimeric Gene Origination by Retroposition in Plant Genomes[W] , 2006, The Plant Cell Online.

[21]  M. Irimia,et al.  Mystery of intron gain: new data and new models. , 2009, Trends in genetics : TIG.

[22]  N. Vinckenbosch,et al.  RNA-based gene duplication: mechanistic and evolutionary insights , 2009, Nature Reviews Genetics.

[23]  Kevin R. Thornton,et al.  The origin of new genes: glimpses from the young and old , 2003, Nature Reviews Genetics.

[24]  V. Brendel,et al.  Refined Annotation of the Arabidopsis Genome by Complete Expressed Sequence Tag Mapping1 , 2003, Plant Physiology.

[25]  M. Nei,et al.  MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. , 2011, Molecular biology and evolution.

[26]  M. Long,et al.  The rapid generation of chimerical genes expanding protein diversity in zebrafish , 2010, BMC Genomics.

[27]  Yamile Marquez,et al.  Transcriptome survey reveals increased complexity of the alternative splicing landscape in Arabidopsis , 2012, Genome research.

[28]  Hiroaki Sakai,et al.  Retrogenes in Rice (Oryza sativa L. ssp. japonica) Exhibit Correlated Expression with Their Source Genes , 2011, Genome biology and evolution.

[29]  Jun Wang,et al.  gKaKs: the pipeline for genome-level Ka/Ks calculation , 2013, Bioinform..

[30]  W R Pearson,et al.  Flexible sequence similarity searching with the FASTA3 program package. , 2000, Methods in molecular biology.

[31]  W. Stiekema,et al.  Comparative analysis indicates that alternative splicing in plants has a limited role in functional expansion of the proteome , 2009, BMC Genomics.

[32]  V. Brendel,et al.  Genomewide comparative analysis of alternative splicing in plants. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[33]  B. Frey,et al.  Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing , 2008, Nature Genetics.

[34]  M. Long,et al.  Origins of new male germ-line functions from X-derived autosomal retrogenes in the mouse. , 2007, Molecular biology and evolution.

[35]  Henry D. Priest,et al.  Genome-wide mapping of alternative splicing in Arabidopsis thaliana. , 2010, Genome research.

[36]  E. Koonin,et al.  Conservation versus parallel gains in intron evolution , 2005, Nucleic acids research.

[37]  Liqing Zhang,et al.  Burst of Young Retrogenes and Independent Retrogene Formation in Mammals , 2009, PloS one.

[38]  Eric T. Wang,et al.  Alternative Isoform Regulation in Human Tissue Transcriptomes , 2008, Nature.

[39]  B. Blencowe Alternative Splicing: New Insights from Global Analyses , 2006, Cell.

[40]  N. Vinckenbosch,et al.  Evolutionary fate of retroposed gene copies in the human genome. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[41]  G. Ast,et al.  Different levels of alternative splicing among eukaryotes , 2006, Nucleic acids research.

[42]  R. Plasterk,et al.  Frequent germline mutations and somatic repeat instability in DNA mismatch-repair-deficient Caenorhabditis elegans. , 2002, Genetics.

[43]  David States,et al.  Selecting for functional alternative splices in ESTs. , 2002, Genome research.

[44]  S. Farrington,et al.  Hypermutability at a poly(A/T) tract in the human germline. , 2001, Nucleic acids research.

[45]  Tin Wee Tan,et al.  DEDB: a database of Drosophila melanogaster exons in splicing graph form , 2004, BMC Bioinformatics.

[46]  Paola Bonizzoni,et al.  ASPicDB: a database of annotated transcript and protein variants generated by alternative splicing , 2010, Nucleic Acids Res..

[47]  A. Reymond,et al.  Emergence of Young Human Genes after a Burst of Retroposition in Primates , 2005, PLoS biology.

[48]  F. B. Pickett,et al.  Splitting pairs: the diverging fates of duplicated genes , 2002, Nature Reviews Genetics.

[49]  B. Barrell,et al.  Life with 6000 Genes , 1996, Science.

[50]  A. Saito,et al.  Evolutionary relationship of plant catalase genes inferred from exon-intron structures: isozyme divergence after the separation of monocots and dicots , 1998, Theoretical and Applied Genetics.

[51]  X. Gu,et al.  Intron gain and loss in segmentally duplicated genes in rice , 2006, Genome Biology.

[52]  Manyuan Long,et al.  Duplication-degeneration as a mechanism of gene fission and the origin of new genes in Drosophila species , 2004, Nature Genetics.

[53]  Michael Lynch,et al.  The evolution of spliceosomal introns. , 2002, Current opinion in genetics & development.

[54]  Walter Gilbert,et al.  The pattern of intron loss. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[55]  B. Graveley The developmental transcriptome of Drosophila melanogaster , 2010, Nature.

[56]  R. Ophir,et al.  Intron retention is a major phenomenon in alternative splicing in Arabidopsis. , 2004, The Plant journal : for cell and molecular biology.

[57]  Douglas R Hoen,et al.  The evolutionary fate of MULE-mediated duplications of host gene fragments in rice. , 2005, Genome research.

[58]  Yong E. Zhang,et al.  A cautionary note for retrocopy identification: DNA-based duplication of intron-containing genes significantly contributes to the origination of single exon genes , 2011, Bioinform..

[59]  Kevin R. Thornton,et al.  Retroposed new genes out of the X in Drosophila. , 2002, Genome research.

[60]  G. Fink,et al.  Pseudogenes in yeast? , 1987, Cell.

[61]  Ning Ma,et al.  BLAST+: architecture and applications , 2009, BMC Bioinformatics.

[62]  Stephen M. Mount,et al.  Comprehensive analysis of alternative splicing in rice and comparative analyses with Arabidopsis , 2006, BMC Genomics.