Dataflow Acceleration of Smith-Waterman with Traceback for High Throughput Next Generation Sequencing

Smith-Waterman algorithm is widely adopted by most popular DNA sequence aligners. The inherent algorithm computational intensity and the vast amount of NGS input data it operates on, create a bottleneck in genomic analysis flows for short-read alignment. FPGA architectures have been extensively leveraged to alleviate the problem, each one adopting a different approach. In existing solutions, effective co-design of the NGS short-read alignment still remains an open issue, mainly due to narrow view on real integration aspects, such as system wide communication and accelerator call overheads. In this paper, we propose a dataflow architecture for Smith-Waterman Matrix-fill and Traceback alignment stages, to perform short-read alignment on NGS data. The architectural decision of moving both stages on chip extinguishes the communication overhead, and coupled with radical software restructuring, allows for efficient integration into widely-used Bowtie2 aligner. This approach delivers x18 speedup over the respective Bowtie2 standalone components, while our co-designed Bowtie2 demonstrates a 35% boost in performance.

[1]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[2]  Wayne Luk,et al.  Hardware Acceleration of Genetic Sequence Alignment , 2013, ARC.

[3]  Wayne Luk,et al.  Reconfigurable Acceleration of Short Read Mapping , 2013, 2013 IEEE 21st Annual International Symposium on Field-Programmable Custom Computing Machines.

[4]  Christopher G Chute,et al.  Preemptive genotyping for personalized medicine: design of the right drug, right dose, right time-using genomic data to individualize treatment protocol. , 2014, Mayo Clinic proceedings.

[5]  Wayne Luk,et al.  Reconfigurable acceleration of genetic sequence alignment: A survey of two decades of efforts , 2017, 2017 27th International Conference on Field Programmable Logic and Applications (FPL).

[6]  Jason Cong,et al.  When apache spark meets FPGAs: a case study for next-generation DNA sequencing acceleration , 2016, CloudCom 2016.

[7]  William J. Dally,et al.  Darwin: A Genomics Co-processor Provides up to 15,000X Acceleration on Long Read Assembly , 2018, USENIX Annual Technical Conference.

[8]  Xianyang Jiang,et al.  A Reconfigurable Accelerator for Smith–Waterman Algorithm , 2007, IEEE Transactions on Circuits and Systems II: Express Briefs.

[9]  Mustafa Gök,et al.  Efficient Cell Designs for Systolic Smith-Waterman Implementations , 2006, 2006 International Conference on Field Programmable Logic and Applications.

[10]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[11]  Oliver Pell,et al.  Maximum Performance Computing with Dataflow Engines , 2012, Computing in Science & Engineering.

[12]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[13]  Steven A. Guccione,et al.  Gene Matching Using JBits , 2002, FPL.

[14]  Steven L Salzberg,et al.  Fast gapped-read alignment with Bowtie 2 , 2012, Nature Methods.

[15]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[16]  Guang R. Gao,et al.  Implementation of the Smith-Waterman algorithm on a reconfigurable supercomputing platform , 2007, HPRCTA.

[17]  Jayoung Kim,et al.  Trends in Next-Generation Sequencing and a New Era for Whole Genome Sequencing , 2016, International neurourology journal.

[18]  Ernst Houtgast,et al.  An FPGA-based systolic array to accelerate the BWA-MEM genomic mapping algorithm , 2015, 2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS).

[19]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[20]  Quinn Snell,et al.  Hardware Accelerated Sequence Alignment with Traceback , 2009, Int. J. Reconfigurable Comput..

[21]  Peter M. Athanas,et al.  A run-time reconfigurable system for gene-sequence searching , 2003, 16th International Conference on VLSI Design, 2003. Proceedings..

[22]  Bertil Schmidt,et al.  Next-generation sequencing: big data meets high performance computing. , 2017, Drug discovery today.

[23]  Ying Liu,et al.  A Highly Parameterized and Efficient FPGA-Based Skeleton for Pairwise Biological Sequence Alignment , 2009, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[24]  Wen Tang,et al.  Accelerating Millions of Short Reads Mapping on a Heterogeneous Architecture with FPGA Accelerator , 2012, 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines.

[25]  A. Kasarskis,et al.  A window into third-generation sequencing. , 2010, Human molecular genetics.