A parallel accelerated approach of HMM Forward Algorithm for IBM Roadrunner clusters

In this work we propose an efficient parallel approach of Hidden Markov Model classification based on Cell Broadband Engine (Cell/B.E.) architecture. The Cell/B.E. technology offers us the possibility of developing two level parallelization algorithms due to the hybrid architecture feature of this kind of processors. Based on this concept, we developed and analyzed two different techniques of parallelization of Forward Algorithm using the Message Passing Interface (MPI) model and the MPI combined with the Synergistic Processing Elements (SPEs) benefits on the Cell Broadband Engine processors. We evaluate the performance of our parallel approach for Markov models with long sequence length and large state spaces on an IBM Roadrunner cluster equipped with PowerXCell8i processors. We obtained encouraging results in terms of parallel efficiency gained on Cell/B.E. processors.

[1]  Jun Li,et al.  The parallel evaluation of hidden Markov models on graphic processing units in supervised recognition , 2010, 2010 2nd International Conference on Computer Engineering and Technology.

[2]  Alexandros Stamatakis,et al.  RAxML-Cell: Parallel Phylogenetic Tree Inference on the Cell Broadband Engine , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[3]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[4]  Chuang Liu,et al.  cuHMM : a CUDA Implementation of Hidden Markov Model Training and Classification , 2009 .

[5]  Thomas Mailund,et al.  HMMlib: A C++ Library for General Hidden Markov Models Exploiting Modern CPUs , 2010, 2010 Ninth International Workshop on Parallel and Distributed Methods in Verification, and Second International Workshop on High Performance Computational Systems Biology.

[6]  Michael Kistler,et al.  Exploring the Viability of the Cell Broadband Engine for Bioinformatics Applications , 2007, 2007 IEEE International Parallel and Distributed Processing Symposium.

[7]  Jason N. Dale,et al.  Cell Broadband Engine Architecture and its first implementation - A performance view , 2007, IBM J. Res. Dev..

[8]  Uzi Vishkin,et al.  Thinking in Parallel: Some Basic Data-Parallel Algorithms and Techniques , 2008 .

[9]  Jesper Nielsen,et al.  Algorithms for a Parallel Implementation of Hidden Markov Models with a Small State Space , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[10]  Yanqing Ji,et al.  Modern Computational Techniques for the HMMER Sequence Analysis , 2013, ISRN bioinformatics.