A high‐performance FPGA‐based BWA‐MEM DNA sequence alignment

Over the last decades, Bioinformatics has been being in its honeymoon phase with more and more new algorithms as well as their improvements proposed. In Bioinformatics, the sequence alignment step is considered as an integral part that directly contributes to the DNA, RNA, or protein identifications. Despite the undeniable enhancements from the provided algorithms and computing architectures in the recent years, it is still far more to state that sequence alignment has already achieved the ideal performance. In this work, we focus on one of the most perfect justifiable steps in a state‐of‐the‐art DNA/RNA alignment algorithm, the seed extension step in the BWA‐MEM algorithm. We propose a high‐speed and less power consumption FPGA‐based IP core that is designed in a pipeline model under various FPGA technologies. Our core is able to operate at more than 200 MHz in almost all FPGA architectures and even up to 529 MHz on a Xilinx Virtex 6 FPGA device. The core can provide speed‐ups by up to 350× when compared with an Intel Core i5 general purpose processor.

[1]  Roderic Guigó,et al.  The GEM mapper: fast, accurate and versatile alignment by filtration , 2012, Nature Methods.

[2]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[3]  Cuong Pham-Quoc,et al.  An FPGA-Based Seed Extension IP Core for BWA-MEM DNA Alignment , 2018, 2018 International Conference on Advanced Computing and Applications (ACOMP).

[4]  Mariagrazia Graziano,et al.  Dynamic Gap Selector: A Smith Waterman Sequence Alignment Algorithm with Affine Gap Model Optimization , 2014, IWBBIO.

[5]  Ying Liu,et al.  High Performance Biological Pairwise Sequence Alignment: FPGA versus GPU versus Cell BE versus GPP , 2012, Int. J. Reconfigurable Comput..

[6]  Yongchao Liu,et al.  Long read alignment based on maximal exact match seeds , 2012, Bioinform..

[7]  Ernst Houtgast,et al.  Heterogeneous hardware/software acceleration of the BWA-MEM DNA alignment algorithm , 2015, 2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[8]  K. Voelkerding,et al.  Next-generation sequencing: from basic research to diagnostics. , 2009, Clinical chemistry.

[9]  Xin Chang,et al.  Exploring Sequence Alignment Algorithms on FPGA-based Heterogeneous Architectures , 2014, IWBBIO.

[10]  Adam Milik,et al.  An Efficient Hardware Implementation of Smith-Waterman Algorithm Based on the Incremental Approach , 2011 .

[11]  Ernst Houtgast,et al.  An FPGA-based systolic array to accelerate the BWA-MEM genomic mapping algorithm , 2015, 2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS).

[12]  M S Waterman,et al.  Identification of common molecular subsequences. , 1981, Journal of molecular biology.

[13]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.