Parallel Pipelines for DNA Sequence Alignment on a Cluster of Multicores: A Comparison of Communication Models

HPC (high perfomance computing) based on clusters of multicores is one of the main research lines in parallel programming. It is important to study the impact of programming paradigms of shared memory, message passing or a combination of both on these architectures in order to efficiently exploit the power of these architectures. The Smith-Waterman algorithm is used as study case for the local alignment of DNA sequences, which allows establishing the similarity degree between two sequences. In this paper, the Smith-Waterman algorithm is parallelized by means of a pipeline scheme due to the data dependencies that are inherent to the problem, using the various communication/synchronization models mentioned above and then carrying out a comparative analysis. Finally, experimental results are presented, as well as future research lines.

[1]  M.D. McCool,et al.  Scalable Programming Models for Massively Multicore Processors , 2008, Proceedings of the IEEE.

[2]  Dieter Kranzlmüller,et al.  Distributed and Parallel Systems: Cluster and Grid Computing , 2002 .

[3]  Dhabaleswar K. Panda,et al.  Understanding the Impact of Multi-Core Architecture in Cluster Computing: A Case Study with Intel Dual-Core System , 2007, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07).

[4]  Claudia Leopold,et al.  Parallel and Distributed Computing: A Survey of Models, Paradigms and Approaches , 2008 .

[5]  Suresh Siddha Process Scheduling Challenges in the Era of Multicore Processors , 2007 .

[6]  Claudia Leopold,et al.  Parallel and distributed computing , 2000 .

[7]  Mordechai Ben-Ari,et al.  Principles of concurrent and distributed programming , 2005, PHI Series in computer science.

[8]  Jack Dongarra,et al.  Sourcebook of parallel computing , 2003 .

[9]  George Karypis,et al.  Introduction to Parallel Computing , 1994 .

[10]  Emilio Luque,et al.  Impact of parallel programming models and CPUs clock frequency on energy consumption of HPC systems , 2011, 2011 9th IEEE/ACS International Conference on Computer Systems and Applications (AICCSA).

[11]  Marek Olszewski,et al.  Kendo: efficient deterministic multithreading in software , 2009, ASPLOS.

[12]  Teresa K. Attwood,et al.  Introduction to Bioinformatics , 2001 .

[13]  Emilio Luque,et al.  Dynamic on Demand Virtual Clusters in Grid , 2008, Euro-Par Workshops.

[14]  Michael Miller,et al.  Cloud Computing: Web-Based Applications That Change the Way You Work and Collaborate Online , 2008 .

[15]  Michael Allen,et al.  Parallel programming: techniques and applications using networked workstations and parallel computers , 1998 .

[16]  Jens Volkert,et al.  Distributed and Parallel Systems , 2000 .

[17]  Michael Di Stefano Distributed Data Management for Grid Computing , 2005 .