Using jackknife to assess the quality of gene order phylogenies

BackgroundIn recent years, gene order data has attracted increasing attention from both biologists and computer scientists as a new type of data for phylogenetic analysis. If gene orders are viewed as one character with a large number of states, traditional bootstrap procedures cannot be applied. Researchers began to use a jackknife resampling method to assess the quality of gene order phylogenies.ResultsIn this paper, we design and conduct a set of experiments to validate the performance of this jackknife procedure and provide discussions on how to conduct it properly. Our results show that jackknife is very useful to determine the confidence level of a phylogeny obtained from gene orders and a jackknife rate of 40% should be used. However, although a branch with support value of 85% can be trusted, low support branches require careful investigation before being discarded.ConclusionsOur experiments show that jackknife is indeed necessary and useful for gene order data, yet some caution should be taken when the results are interpreted.

[1]  David A. Bader,et al.  A New Implmentation and Detailed Study of Breakpoint Analysis , 2000, Pacific Symposium on Biocomputing.

[2]  J. Farris,et al.  PARSIMONY JACKKNIFING OUTPERFORMS NEIGHBOR‐JOINING , 1996, Cladistics : the international journal of the Willi Hennig Society.

[3]  David Sankoff,et al.  Common Intervals and Symmetric Difference in a Model-Free Phylogenomics, with an Application to Streptophyte Evolution , 2006, Comparative Genomics.

[4]  Linda A. Raubeson,et al.  Chloroplast DNA Evidence on the Ancient Evolutionary Split in Vascular Land Plants , 1992, Science.

[5]  D. Robinson,et al.  Comparison of weighted labelled trees , 1979 .

[6]  P. Pevzner,et al.  Genome-scale evolution: reconstructing gene orders in the ancestral species. , 2002, Genome research.

[7]  Tandy J. Warnow,et al.  Distance-Based Genome Rearrangement Phylogeny , 2006, Journal of Molecular Evolution.

[8]  Pavel A. Pevzner,et al.  Transforming cabbage into turnip: polynomial algorithm for sorting signed permutations by reversals , 1995, JACM.

[9]  Bernard M. E. Moret,et al.  Advances in phylogeny reconstruction from gene order and content data. , 2005, Methods in enzymology.

[10]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[11]  Alexandros Stamatakis,et al.  How Many Bootstrap Replicates Are Necessary? , 2009, RECOMB.

[12]  William Arndt,et al.  Gene Order Phylogeny of the Genus Prochlorococcus , 2008, PloS one.

[13]  Haiwei Luo,et al.  Gene Order Phylogeny and the Evolution of Methanogens , 2009, PloS one.

[14]  Bernard M. E. Moret,et al.  Fast Phylogenetic Methods For Genome Rearrangement Evolution: An Empirical Study , 2002 .

[15]  Bret Larget,et al.  A Bayesian approach to the estimation of ancestral genome arrangements. , 2005, Molecular phylogenetics and evolution.

[16]  M. P. Cummings,et al.  PAUP* Phylogenetic analysis using parsimony (*and other methods) Version 4 , 2000 .

[17]  Tandy J. Warnow,et al.  Fast Phylogenetic Methods for the Analysis of Genome Rearrangement Data: An Empirical Study , 2001, Pacific Symposium on Biocomputing.

[18]  BMC Bioinformatics , 2005 .

[19]  J. Felsenstein CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP , 1985, Evolution; international journal of organic evolution.

[20]  Olivier Gascuel,et al.  Fast and Accurate Phylogeny Reconstruction Algorithms Based on the Minimum-Evolution Principle , 2002, WABI.

[21]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[22]  Andrés Moya,et al.  Genome Rearrangement Distances and Gene Order Phylogeny in γ-Proteobacteria , 2005 .

[23]  Guillaume Fertin,et al.  Combinatorics of Genome Rearrangements , 2009, Computational molecular biology.

[24]  Bernard M. E. Moret,et al.  New approaches for reconstructing phylogenies based on gene order , 2001 .

[25]  M. P. Cummings PHYLIP (Phylogeny Inference Package) , 2004 .

[26]  Michael P. Cummings,et al.  PAUP* [Phylogenetic Analysis Using Parsimony (and Other Methods)] , 2004 .