Isolating - a new resampling method for gene order data

The purpose of using resampling methods on phylogenetic data is to estimate the confidence value of branches. In recent years, bootstrapping and jackknifing are the two most popular resampling schemes which are widely used in biological reserach. However, for gene order data, traditional bootstrap procedures can not be applied because gene order data is viewed as one character with various states. Experience in the biological community has shown that jackknifing is a useful means of determining the confidence value of a gene order phylogeny. When genomes are distant, however, applying jackknifing tends to give low confidence values to many valid branches, causing them to be mistakenly removed. In this paper, we propose a new method that overcomes this disadvantage of jackknifing and achieves better accuracy and confidence values for gene order data. Compared to jackknifing, our experimental results show that the proposed method can produce phylogenies with lower error rates and much stronger support for good branches. We also establish a theoretic lower bound regarding how many genes should be isolated, which is confirmed empirically.

[1]  Pavel A. Pevzner,et al.  Transforming cabbage into turnip: polynomial algorithm for sorting signed permutations by reversals , 1995, JACM.

[2]  William Arndt,et al.  Gene Order Phylogeny of the Genus Prochlorococcus , 2008, PloS one.

[3]  Bernard M. E. Moret,et al.  Advances in phylogeny reconstruction from gene order and content data. , 2005, Methods in enzymology.

[4]  Andrés Moya,et al.  Genome Rearrangement Distances and Gene Order Phylogeny in γ-Proteobacteria , 2005 .

[5]  Richard Friedberg,et al.  Efficient sorting of genomic permutations by translocation, inversion and block interchange , 2005, Bioinform..

[6]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[7]  Tandy J. Warnow,et al.  Distance-Based Genome Rearrangement Phylogeny , 2006, Journal of Molecular Evolution.

[8]  David A. Bader,et al.  A detailed study of breakpoint analysis , 2001 .

[9]  P. Pevzner,et al.  Genome-scale evolution: reconstructing gene orders in the ancestral species. , 2002, Genome research.

[10]  Bret Larget,et al.  A Bayesian approach to the estimation of ancestral genome arrangements. , 2005, Molecular phylogenetics and evolution.

[11]  Jian Shi,et al.  Using jackknife to assess the quality of gene order phylogenies , 2010, BMC Bioinformatics.

[12]  P. Holland,et al.  Rare genomic changes as a tool for phylogenetics. , 2000, Trends in ecology & evolution.

[13]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[14]  Yu Lin,et al.  Estimating true evolutionary distances under the DCJ model , 2008, ISMB.

[15]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[16]  Olivier Gascuel,et al.  Fast and Accurate Phylogeny Reconstruction Algorithms Based on the Minimum-Evolution Principle , 2002, WABI.