An agent-based solution for dynamic multi-node wavefront balancing in biological sequence comparison

Abstract Many parallel and distributed strategies were created to reduce the execution time of bioinformatics algorithms. One well-known bioinformatics algorithm is the Smith–Waterman, that may be parallelized using the wavefront method. When the wavefront is distributed across many heterogeneous nodes, it must be balanced to create a synchronous data flow. This is a very challenging problem if the nodes have variable computational power. This paper presents an agent-based solution for parallel biological sequence comparison applications that use the multi-node wavefront method. In our approach, autonomous agents are able to identify unbalanced computations and dynamically rebalance the load among the nodes. Two strategies were developed to the balancer agent in order to identify if the computations are balanced, one using global information and other using only local information. The global strategy demands a huge amount of data transfers, incurring in more communication, whereas the local strategy can decide about the balancing status using only local information. The results show that the balancing gains of strategies are very close. Thus, the local strategy is preferred, since it can be implemented in real wavefront balancers with almost the same benefits as the global strategy.

[1]  Qing Yang,et al.  A Collaborative Multiagent System for Mining Transcriptional Regulatory Elements , 2009, IEEE Intelligent Systems.

[2]  O. Gotoh An improved algorithm for matching biological sequences. , 1982, Journal of molecular biology.

[3]  Azzedine Boukerche,et al.  Exact Parallel Alignment of megabase genomic Sequences with Tunable Work Distribution , 2012, Int. J. Found. Comput. Sci..

[4]  Gonzalo Nápoles,et al.  Two-steps learning of Fuzzy Cognitive Maps for prediction and knowledge discovery on the HIV-1 drug resistance , 2014, Expert Syst. Appl..

[5]  Yong Dou,et al.  Optimization schemes and performance evaluation of Smith–Waterman algorithm on CPU, GPU and FPGA , 2012, Concurr. Comput. Pract. Exp..

[6]  Gregory Francis Pfister,et al.  In search of clusters: the coming battle in lowly parallel computing , 1995 .

[7]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[8]  Fumihiko Ino,et al.  Sequence Homology Search Using Fine Grained Cycle Sharing of Idle GPUs , 2012, IEEE Transactions on Parallel and Distributed Systems.

[9]  Michael E. Bratman,et al.  Intention, Plans, and Practical Reason , 1991 .

[10]  D. Mount Bioinformatics: Sequence and Genome Analysis , 2001 .

[11]  Eduard Ayguadé,et al.  Fine-grain parallel megabase sequence comparison with multiple heterogeneous GPUs , 2014, PPoPP '14.

[12]  Deepti Shrimankar,et al.  Parallelizing and Analyzing the Behavior of Sequence Alignment Algorithm on a Cluster of Workstations for Large Datasets , 2013 .

[13]  Wayne Luk,et al.  FPGA-Based Smith-Waterman Algorithm: Analysis and Novel Design , 2011, ARC.

[14]  Javier Bajo,et al.  Automatic knowledge extraction in sequencing analysis with multiagent system and grid computing , 2012, J. Integr. Bioinform..

[15]  Michael Wooldridge,et al.  An Introduction to MultiAgent Systems John Wiley & Sons , 2002 .

[16]  Zoheir Ezziane,et al.  Applications of artificial intelligence in bioinformatics: A review , 2006, Expert Syst. Appl..

[17]  Yi-Ping Phoebe Chen,et al.  Cell cycle phase detection with cell deformation analysis , 2014, Expert Syst. Appl..

[18]  Yongchao Liu,et al.  CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions , 2013, BMC Bioinformatics.

[19]  Michael Wooldridge,et al.  Introduction to multiagent systems , 2001 .

[20]  Fabio Bellifemine,et al.  Developing Multi-Agent Systems with JADE (Wiley Series in Agent Technology) , 2007 .

[21]  Chien-Feng Huang,et al.  Combinatorial optimization in Biology using Probability Collectives Multi-agent Systems , 2012, Expert Syst. Appl..

[22]  Pierre-Emmanuel Leni,et al.  Case-Based Reasoning adaptation of numerical representations of human organs by interpolation , 2014, Expert Syst. Appl..

[23]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[24]  Agostino Poggi,et al.  Developing Multi-agent Systems with JADE , 2007, ATAL.