Toward a General Model for the Evolutionary Dynamics of Gene Duplicates

Gene duplication is an important process in the functional divergence of genes and genomes. Several processes have been described that lead to duplicate gene retention over different timescales after both smaller-scale events and whole-genome duplication, including neofunctionalization, subfunctionalization, and dosage balance. Two common modes of duplicate gene loss include nonfunctionalization and loss due to population dynamics (failed fixation). Previous work has characterized expectations of duplicate gene retention under the neofunctionalization and subfunctionalization models. Here, that work is extended to dosage balance using simulations. A general model for duplicate gene loss/retention is then presented that is capable of fitting expectations under the different models, is defined at t = 0, and decays to an orthologous asymptotic rate rather than zero, based upon a modified Weibull hazard function. The model in a maximum likelihood framework shows the property of identifiability, recovering the evolutionary mechanism and parameters of simulation. This model is also capable of recovering the evolutionary mechanism of simulation from data generated using an unrelated network population genetic model. Lastly, the general model is applied as part of a mixture model to recent gene duplicates from the Oikopleura dioica genome, suggesting that neofunctionalization may be an important process leading to duplicate gene retention in that organism.

[1]  Johan A. Grahnen,et al.  Binding constraints on the evolution of enzymes and signalling proteins: the important role of negative pleiotropy , 2011, Proceedings of the Royal Society B: Biological Sciences.

[2]  Thomas J. Hardcastle,et al.  Getting a Full Dose? Reconsidering Sex Chromosome Dosage Compensation in the Silkworm, Bombyx mori , 2011, Genome biology and evolution.

[3]  Francesca D. Ciccarelli,et al.  Modification of Gene Duplicability during the Evolution of Protein Interaction Network , 2011, PLoS Comput. Biol..

[4]  Ariel Fernández,et al.  Nonadaptive origins of interactome complexity , 2011, Nature.

[5]  Dietlind L. Gerloff,et al.  BISC: Binary SubComplexes in proteins database , 2010, Nucleic Acids Res..

[6]  Manolis Kellis,et al.  A Bayesian Approach for Fast and Accurate Gene Tree Reconstruction , 2010, Molecular biology and evolution.

[7]  Frédéric Delsuc,et al.  Plasticity of Animal Genome Architecture Unmasked by Rapid Evolution of a Pelagic Tunicate , 2010, Science.

[8]  Shao‐Lun Liu,et al.  Dramatic change in function and expression pattern of a gene duplicated by polyploidy created a paternal effect gene in the Brassicaceae. , 2010, Molecular biology and evolution.

[9]  L. Rusche,et al.  Transcriptional silencing functions of the yeast protein Orc1/Sir3 subfunctionalized after gene duplication , 2010, Proceedings of the National Academy of Sciences.

[10]  D. Liberles,et al.  Evolution after Gene Duplication: Dittmar/Evolution After Gene Duplication , 2010 .

[11]  F. Kondrashov,et al.  The evolution of gene duplications: classifying and distinguishing between models , 2010, Nature Reviews Genetics.

[12]  D. Liberles,et al.  Evolution after gene duplication , 2010 .

[13]  Yang Liu,et al.  Divergence of exonic splicing elements after gene duplication and the impact on gene structures , 2009, Genome Biology.

[14]  J. Chris Pires,et al.  Gene and genome duplications: the impact of dosage-sensitivity on the fate of nuclear genes , 2009, Chromosome Research.

[15]  Michael Freeling,et al.  Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental, or by transposition. , 2009, Annual review of plant biology.

[16]  Bengt Sennblad,et al.  The gene evolution model and computing its associated probabilities , 2009, JACM.

[17]  Kousha Etessami,et al.  Recursive Markov chains, stochastic grammars, and monotone systems of nonlinear equations , 2005, JACM.

[18]  D. Liberles,et al.  Whole-Genome Duplications in the Ancestral Vertebrate Are Detectable in the Distribution of Gene Family Sizes of Tetrapod Species , 2008, Journal of Molecular Evolution.

[19]  S. Bottani,et al.  Cellular reactions to gene dosage imbalance: genomic, transcriptomic and proteomic effects. , 2008, Trends in genetics : TIG.

[20]  David A. Liberles,et al.  The power-law distribution of gene family size is driven by the pseudogenisation rate's heterogeneity between gene families. , 2008, Gene.

[21]  Ariel Fernández,et al.  Protein Under-Wrapping Causes Dosage Sensitivity and Decreases Gene Duplicability , 2007, PLoS genetics.

[22]  M. Gerstein,et al.  Analysis of nuclear receptor pseudogenes in vertebrates: how the silent tell their stories. , 2007, Molecular biology and evolution.

[23]  Timothy Hughes,et al.  The Pattern of Evolution of Smaller-Scale Gene Duplicates in Mammalian Genomes is More Consistent with Neo- than Subfunctionalisation , 2007, Journal of Molecular Evolution.

[24]  D. Pearl,et al.  Species trees from gene trees: reconstructing Bayesian posterior distributions of a species phylogeny using estimated gene tree distributions. , 2007, Systematic biology.

[25]  Arne Elofsson,et al.  Evaluating dosage compensation as a cause of duplicate gene retention in Paramecium tetraurelia , 2007, Genome Biology.

[26]  Deyou Zheng,et al.  The ambiguous boundary between genes and pseudogenes: the dead rise up, or do they? , 2007, Trends in genetics : TIG.

[27]  Christian A. Grove,et al.  Insight into transcription factor gene duplication from Caenorhabditis elegans Promoterome-driven expression patterns , 2007, BMC Genomics.

[28]  Lars Arvestad,et al.  Evolution after gene duplication: models, mechanisms, sequences, systems, and organisms. , 2007, Journal of experimental zoology. Part B, Molecular and developmental evolution.

[29]  Philip M. Kim,et al.  Relating Three-Dimensional Structures to Protein Networks Provides Evolutionary Insights , 2006, Science.

[30]  R. Guigó,et al.  Global trends of whole-genome duplications revealed by the ciliate Paramecium tetraurelia , 2006, Nature.

[31]  A. Elofsson,et al.  What properties characterize the hub proteins of the protein-protein interaction network of Saccharomyces cerevisiae? , 2006, Genome Biology.

[32]  Steven Maere,et al.  The gain and loss of genes during 600 million years of vertebrate evolution , 2006, Genome Biology.

[33]  X. Gu,et al.  Intron gain and loss in segmentally duplicated genes in rice , 2006, Genome Biology.

[34]  M. Lynch The origins of eukaryotic gene structure. , 2006, Molecular biology and evolution.

[35]  Matthew J. Betts,et al.  Optimal Gene Trees from Sequences and Species Trees Using a Soft Interpretation of Parsimony , 2006, Journal of Molecular Evolution.

[36]  X. Gu,et al.  Expression divergence between duplicate genes. , 2005, Trends in genetics : TIG.

[37]  Andreas Wagner,et al.  Energy constraints on the evolution of gene expression. , 2005, Molecular biology and evolution.

[38]  D. Liberles,et al.  Subfunctionalization of duplicated genes as a transition state to neofunctionalization , 2005, BMC Evolutionary Biology.

[39]  J. Raes,et al.  Modeling gene and genome duplications in eukaryotes. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Jianzhi Zhang,et al.  Rapid Subfunctionalization Accompanied by Prolonged and Substantial Neofunctionalization in Duplicate Gene Evolution , 2005, Genetics.

[41]  Wen-Hsiung Li,et al.  Different age distribution patterns of human, nematode, and Arabidopsis duplicate genes. , 2004, Gene.

[42]  I. Craig,et al.  Functional effects of a tandem duplication polymorphism in the 5′flanking region of the DRD4 gene , 2004, Biological Psychiatry.

[43]  Matthew Hurles,et al.  Gene Duplication: The Genomic Trade in Spare Parts , 2004, PLoS biology.

[44]  P. Joyce,et al.  Association of a duplicated repeat polymorphism in the 5′‐untranslated region of the DRD4 gene with novelty seeking , 2004, American journal of medical genetics. Part B, Neuropsychiatric genetics : the official publication of the International Society of Psychiatric Genetics.

[45]  M. Nowak,et al.  Stochastic Tunnels in Evolutionary Dynamics , 2004, Genetics.

[46]  John S. Conery,et al.  The evolutionary demography of duplicate genes , 2004, Journal of Structural and Functional Genomics.

[47]  A. Wagner,et al.  Asymmetric sequence divergence of duplicate genes. , 2003, Genome research.

[48]  C. Pál,et al.  Dosage sensitivity and the evolution of gene families in yeast , 2003, Nature.

[49]  Wen-Hsiung Li,et al.  Divergence in the spatial pattern of gene expression between human duplicate genes. , 2003, Genome research.

[50]  Jianzhi Zhang Evolution by gene duplication: an update , 2003 .

[51]  T. Panavas,et al.  Enhancement of RNA synthesis by promoter duplication in tombusviruses. , 2003, Virology.

[52]  D. Nicolae,et al.  Rapid divergence in expression between duplicate genes inferred from microarray data. , 2002, Trends in genetics : TIG.

[53]  R. Nielsen Mapping mutations on phylogenies. , 2002, Systematic biology.

[54]  Peer Bork,et al.  Common exon duplication in animals and its role in alternative splicing. , 2002, Human molecular genetics.

[55]  R. Veitia,et al.  Exploring the etiology of haploinsufficiency. , 2002, BioEssays : news and reviews in molecular, cellular and developmental biology.

[56]  Erik L. L. Sonnhammer,et al.  Automated ortholog inference from phylogenetic trees and calculation of orthology reliability , 2002, Bioinform..

[57]  A. Force,et al.  The probability of preservation of a newly arisen gene duplicate. , 2001, Genetics.

[58]  E V Koonin,et al.  Origin of alternative splicing by tandem exon duplication. , 2001, Human molecular genetics.

[59]  T. Bardal,et al.  The two myostatin genes of Atlantic salmon (Salmo salar) are expressed in a variety of tissues. , 2001, European journal of biochemistry.

[60]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[61]  R. Young,et al.  Transcription of eukaryotic protein-coding genes. , 2000, Annual review of genetics.

[62]  A. Force,et al.  The probability of duplicate gene preservation by subfunctionalization. , 2000, Genetics.

[63]  Dannie Durand,et al.  NOTUNG: A Program for Dating Gene Duplications and Optimizing Gene Family Trees , 2000, J. Comput. Biol..

[64]  A. Force,et al.  Preservation of duplicate genes by complementary, degenerative mutations. , 1999, Genetics.

[65]  F J Ayala,et al.  New Drosophila introns originate by duplication. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[66]  G. S. Mudholkar,et al.  A Generalization of the Weibull Distribution with Application to the Analysis of Survival Data , 1996 .

[67]  Steven Henikoff,et al.  Expansions of transgene repeats cause heterochromatin formation and gene silencing in Drosophila , 1994, Cell.

[68]  A. Hughes The evolution of functionally novel proteins after gene duplication , 1994, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[69]  Robert Kay,et al.  Duplication of CaMV 35S Promoter Sequences Creates a Strong Enhancer for Plant Genes , 1987, Science.

[70]  B. Bollobás,et al.  Cliques in random graphs , 1976, Mathematical Proceedings of the Cambridge Philosophical Society.

[71]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.