BackgroundParalog reduction, the loss of duplicate genes after whole genome duplication (WGD) is a pervasive process. Whether this loss proceeds gene by gene or through deletion of multi-gene DNA segments is controversial, as is the question of fractionation bias, namely whether one homeologous chromosome is more vulnerable to gene deletion than the other.ResultsAs a null hypothesis, we first assume deletion events, on one homeolog only, excise a geometrically distributed number of genes with unknown mean µ, and these events combine to produce deleted runs of length l, distributed approximately as a negative binomial with unknown parameter r, itself a random variable with distribution π(·). A more realistic model requires deletion events on both homeologs distributed as a truncated geometric. We simulate the distribution of run lengths l in both models, as well as the underlying π(r), as a function of µ, and show how sampling l allows us to estimate µ. We apply this to data on a total of 15 genomes descended from 6 distinct WGD events and show how to correct the bias towards shorter runs caused by genome rearrangements. Because of the difficulty in deriving π(·) analytically, we develop a deterministic recurrence to calculate each π(r) as a function of µ and the proportion of unreduced paralog pairs.ConclusionsThe parameter µ can be estimated based on run lengths of single-copy regions. Estimates of µ in real data do not exclude the possibility that duplicate gene deletion is largely gene by gene, although it may sometimes involve longer segments.
[1]
Brian C. Thomas,et al.
Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes.
,
2006,
Genome research.
[2]
David Sankoff,et al.
The Reconstruction of Doubled Genomes
,
2003,
SIAM J. Comput..
[3]
Kevin P. Byrne,et al.
The Yeast Gene Order Browser: combining curated homology and syntenic context reveals gene fate in polyploid species.
,
2005,
Genome research.
[4]
David Sankoff,et al.
The collapse of gene complement following whole genome duplication
,
2010,
BMC Genomics.
[5]
J. Chris Pires,et al.
Gene and genome duplications: the impact of dosage-sensitivity on the fate of nuclear genes
,
2009,
Chromosome Research.
[6]
Michael Freeling,et al.
Genomic duplication, fractionation and the origin of regulatory novelty.
,
2004,
Genetics.
[7]
Paulien Hogeweg,et al.
The role of mutational dynamics in genome shrinkage.
,
2007,
Molecular biology and evolution.
[8]
Jake K. Byrnes,et al.
Reorganization of adjacent gene relationships in yeast genomes by whole-genome duplication and gene deletion.
,
2006,
Molecular biology and evolution.