BLAST Distributed Execution on Partitioned Databases with Primary Fragments

BLAST is one of the most popular computational biology tools. The execution cost of BLAST is highly dependent on database sizes, which have considerably increased following all recent advances in sequencing methods. The evaluation of BLAST in distributed and parallel environments like PC clusters and Grids has been largely investigated in order to obtain better performances. This work evaluates a replicated allocation of the (sequences) database, where each copy is also physically fragmented. We investigate two dynamic workload balancing methods that focus on our database allocation strategy. Preliminary practical results show that we achieve both a balanced workload and very good performances. We briefly discuss ideas that would make our approach feasible for Grid computational environments.

[1]  Rogério Luís de Carvalho Costa,et al.  Database Allocation Strategies for Parallel BLAST Evaluation on Clusters , 2004, Distributed and Parallel Databases.

[2]  Chih-Wei Huang,et al.  Using distributed computing platform to solve high computing and data processing problems in bioinformatics , 2004, Proceedings. Fourth IEEE Symposium on Bioinformatics and Bioengineering.

[3]  Chao-Tung Yang,et al.  G-BLAST: a Grid-based solution for mpiBLAST on computational Grids , 2009 .

[4]  Rogério Luís de Carvalho Costa,et al.  Skew Handling for Parallel BLAST Processing , 2003, WOB.

[5]  Enis Afgan,et al.  Dynamic Task Distribution in the Grid for BLAST , 2006, 2006 IEEE International Conference on Granular Computing.

[6]  Ying Sun,et al.  ABCGrid: Application for Bioinformatics Computing Grid , 2007, Bioinform..

[7]  Jarek Nieplocha,et al.  ScalaBLAST: A Scalable Implementation of BLAST for High-Performance Data-Intensive Bioinformatics Analysis , 2006, IEEE Transactions on Parallel and Distributed Systems.

[8]  Marta Mattoso,et al.  Grid Data Management: Open Problems and New Issues , 2007, Journal of Grid Computing.

[9]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.