BackgroundInferring gene orders of ancestral genomes has the potential to provide detailed information about the recent evolution of species descended from them. Current popular tools to infer ancestral genome data (such as GRAPPA and MGR) are all parsimony-based direct optimization methods with the aim to minimize the number of evolutionary events. Recently a new method based on the approach of maximum likelihood is proposed. The current implementation of these direct optimization methods are all based on solving the median problems and achieve more accurate results than the maximum likelihood method. However, both GRAPPA and MGR are extremely time consuming under high rearrangement rates. The maximum likelihood method, on the contrary, runs much faster with less accurate results.ResultsWe propose a mixture method to optimize the inference of ancestral gene orders. This method first uses the maximum likelihood approach to identify gene adjacencies that are likely to be present in the ancestral genomes, which are then fixed in the branch-and-bound search of median calculations. This hybrid approach not only greatly speeds up the direct optimization methods, but also retains high accuracy even when the genomes are evolutionary very distant.ConclusionsOur mixture method produces more accurate ancestral genomes compared with the maximum likelihood method while the computation time is far less than that of the parsimony-based direct optimization methods. It can effectively deal with genome data of relatively high rearrangement rates which is hard for the direct optimization methods to solve in a reasonable amount of time, thus extends the range of data that can be analyzed by the existing methods.
[1]
Costas S. Iliopoulos,et al.
An algorithm for mapping short reads to a dynamically changing genomic sequence
,
2010,
2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).
[2]
Jian Ma.
A probabilistic framework for inferring ancestral genomic orders
,
2010,
2010 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).
[3]
P. Holland,et al.
Rare genomic changes as a tool for phylogenetics.
,
2000,
Trends in ecology & evolution.
[4]
David A. Bader,et al.
A New Implmentation and Detailed Study of Breakpoint Analysis
,
2000,
Pacific Symposium on Biocomputing.
[5]
Yu Lin,et al.
Estimating true evolutionary distances under rearrangements, duplications, and losses
,
2010,
BMC Bioinformatics.
[6]
Pavel A. Pevzner,et al.
Transforming cabbage into turnip: polynomial algorithm for sorting signed permutations by reversals
,
1995,
JACM.
[7]
David Sankoff,et al.
Decompositions of Multiple Breakpoint Graphs and Rapid Exact Solutions to the Median Problem
,
2008,
WABI.
[8]
P. Pevzner,et al.
Genome-scale evolution: reconstructing gene orders in the ancestral species.
,
2002,
Genome research.
[9]
Alberto Caprara.
The Reversal Median Problem
,
2003,
INFORMS J. Comput..