Efficient Parallel Execution of Sequence Similarity Analysis Via Dynamic Load Balancing

We present a parallel approach to analyzing sequence similarity in a set of genomes that employs dynamic load balancing to address the variation in execution time for genes of different lengths and complexity, the variation in processor power in different nodes of the computer cluster, and the variation in other load on the nodes. Our approach executes using MPI on a cluster of computers. We provide experimental results to demonstrate the effectiveness of using our approach in conjunction with NCBI BLAST.