Minimum Recombination Histories by Branch and Bound

Recombination plays an important role in creating genetic diversity within species, and inferring past recombination events is central to many problems in genetics. Given a set M of sampled sequences, finding an evolutionary history for M with the minimum number of recombination events is a computationally very challenging problem. In this paper, we present a novel branch and bound algorithm for tackling that problem. Our method is shown to be far more efficient than the only preexisting exact method, described in [1]. Our software implementing the algorithm discussed in this paper is publicly available.

[1]  M. Kimura The number of heterozygous nucleotide sites maintained in a finite population due to steady flux of mutations. , 1969, Genetics.

[2]  M. Kreitman,et al.  Nucleotide polymorphism at the alcohol dehydrogenase locus of Drosophila melanogaster , 1983, Nature.

[3]  R. Hudson,et al.  Statistical properties of the number of recombination events in the history of a sample of DNA sequences. , 1985, Genetics.

[4]  Dan Gusfield,et al.  Efficient algorithms for inferring evolutionary trees , 1991, Networks.

[5]  P. Donnelly,et al.  Progress in population genetics and human evolution , 1997 .

[6]  R. Griffiths,et al.  An ancestral recombination graph , 1997 .

[7]  A. Chakravarti It's raining SNPs, hallelujah? , 1998, Nature Genetics.

[8]  E. Boerwinkle,et al.  DNA sequence diversity in a 9.7-kb region of the human lipoprotein lipase gene , 1998, Nature Genetics.

[9]  Kaizhong Zhang,et al.  Perfect Phylogenetic Networks with Recombination , 2001, J. Comput. Biol..

[10]  Yun S. Song,et al.  Parsimonious Reconstruction of Sequence Evolution and Haplotype Blocks , 2003, WABI.

[11]  R. Griffiths,et al.  Bounds on the minimum number of recombination events in a sample history. , 2003, Genetics.

[12]  Yun S. Song,et al.  On the minimum number of recombination events in the evolutionary history of DNA sequences , 2004, Journal of mathematical biology.

[13]  Peter Donnelly,et al.  Application of Coalescent Methods to Reveal Fine-Scale Rate Variation and Recombination Hotspots , 2004, Genetics.

[14]  Dan Gusfield,et al.  Optimal, Efficient Reconstruction of Phylogenetic Networks with Constrained Recombination , 2004, J. Bioinform. Comput. Biol..

[15]  Rita Casadio,et al.  Algorithms in Bioinformatics, 5th International Workshop, WABI 2005, Mallorca, Spain, October 3-6, 2005, Proceedings , 2005, WABI.

[16]  Vineet Bafna,et al.  Improved Recombination Lower Bounds for Haplotype Data , 2005, RECOMB.

[17]  Yun S. Song,et al.  Efficient computation of close lower and upper bounds on the minimum number of recombinations in biological sequence evolution , 2005, ISMB.