MPI-HMMER-Boost: Distributed FPGA Acceleration

HMMER, based on the profile Hidden Markov Model (HMM) is one of the most widely used sequence database searching tools, allowing researchers to compare HMMs to sequence databases or sequences to HMM databases. Such searches often take many hours and consume a great number of CPU cycles on modern computers. We present a cluster-enabled hardware/software-accelerated implementation of the HMMER search tool hmmsearch. Our results show that combining the parallel efficiency of a cluster with one or more high-speed hardware accelerators (FPGAs) can significantly improve performance for even the most time consuming searches, often reducing search times from several hours to minutes.

[1]  Patrick Crowley,et al.  Exploiting coarse-grained parallelism to accelerate protein motif finding with a network processor , 2005, 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05).

[2]  Forum Mpi MPI: A Message-Passing Interface , 1994 .

[3]  E. Lusk,et al.  Installation guide to mpich, a portable implementation of MPI , 1996 .

[4]  Vaidy S. Sunderam,et al.  PVM: A Framework for Parallel Distributed Computing , 1990, Concurr. Pract. Exp..

[5]  SkjellumAnthony,et al.  A high-performance, portable implementation of the MPI message passing interface standard , 1996 .

[6]  William Gropp,et al.  Users guide for mpich, a portable implementation of MPI , 1996 .

[7]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[8]  J. Issa,et al.  A simple method for estimating global DNA methylation using bisulfite PCR of repetitive DNA elements. , 2004, Nucleic acids research.

[9]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[10]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[11]  Bertil Schmidt,et al.  Accelerating the Viterbi Algorithm for Profile Hidden Markov Models Using Reconfigurable Hardware , 2006, International Conference on Computational Science.

[12]  Greg Burns,et al.  LAM: An Open Cluster Environment for MPI , 2002 .

[13]  Brandon Harris,et al.  Accelerator design for protein sequence HMM search , 2006, ICS '06.

[14]  Pat Hanrahan,et al.  ClawHMMER: A Streaming HMMer-Search Implementatio , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[15]  John Paul Walters,et al.  Accelerating HMMer searches on Opteron processors with minimally invasive recoding , 2006, 20th International Conference on Advanced Information Networking and Applications - Volume 1 (AINA'06).

[16]  Bashar Qudah,et al.  Accelerating the HMMER sequence analysis suite using conventional processors , 2006, 20th International Conference on Advanced Information Networking and Applications - Volume 1 (AINA'06).

[17]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[18]  Guang R. Gao,et al.  Implementing parallel hmm-pfam on the EARTH multithreaded architecture , 2003, Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference. CSB2003.

[19]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[20]  Chittibabu Guda,et al.  SledgeHMMER: a web server for batch searching the Pfam database , 2004, Nucleic Acids Res..

[21]  Sean R. Eddy,et al.  Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids , 1998 .

[22]  Anthony Skjellum,et al.  A High-Performance, Portable Implementation of the MPI Message Passing Interface Standard , 1996, Parallel Comput..

[23]  John Paul Walters,et al.  Optimized Cluster-Enabled HMMER Searches , 2007, Grid Computing for Bioinformatics and Computational Biology.

[24]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[25]  Pat Hanrahan,et al.  ClawHMMER: A Streaming HMMer-Search Implementation , 2005, SC.