论文信息 - Parallelizing the Phylogeny Problem

Parallelizing the Phylogeny Problem

The problem of determining the evolutionary history of species in the form of phylogenetic trees is known as the phylogeny problem. We present a parallelization of the character compatibility method for solving the phylogeny problem. Abstractly, the algorithm searches through all subsets of characters, which may be traits like opposable thumbs or DNA sequence values, looking for a maximal consistent subset. The notion of consistency in this case is the existence of a particular kind of phylogenetic tree called a perfect phylogeny tree. The two challenges to achieving an efficient implementation are load balancing and efficient sharing of information to enable pruning. In both cases, there is a trade-off between communication overhead and the quality of the solution. For load balancing we use a distributed task queue, which has imperfect load information but avoids centralization bottlenecks. For sharing pruning information, we use a distributed trie, which also avoids centralization but maintains incomplete information. We evaluate several implementations of the trie, the best of which achieves speedups of 50 on a 64-processor CM-5.

J.A. Jones | K.A. Yelick

[1] V. Rich. Personal communication , 1989, Nature.

[2] Katherine Yelick,et al. Data Structures for Irregular Applications , 1993 .

[3] Katherine Yelick,et al. Randomized load balancing for tree-structured computation , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.

[4] Jean Vuillemin,et al. A data structure for manipulating priority queues , 1978, CACM.

[5] Katherine A. Yelick,et al. Parallelizing the Phylogeny Problem , 1995, SC.

[6] Eric A. Brewer,et al. How to get good performance from the CM-5 data network , 1994, Proceedings of 8th International Parallel Processing Symposium.

[7] Michael R. Fellows,et al. Two Strikes Against Perfect Phylogeny , 1992, ICALP.

[8] J. M. Hvllot. Associative commutative pattern matching , 1979, IJCAI 1979.

[9] David Fernández-Baca,et al. A Polynomial-Time Algorithm for the Perfect Phylogeny Problem when the Number of Character States is Fixed , 1993, FOCS.

[10] W. J. Quesne,et al. A Method of Selection of Characters in Numerical Taxonomy , 1969 .

[11] J. Felsenstein. Numerical Methods for Inferring Evolutionary Trees , 1982, The Quarterly Review of Biology.

[12] Jean-Marie Hullot,et al. Associative Commutative Pattern Matching , 1979, IJCAI.

[13] Fred R. McMorris,et al. Triangulating vertex colored graphs , 1994, SODA '93.