FPGA-based hardware acceleration for local complexity analysis of massive genomic data

While genomics have significantly advanced modern biological achievements, it requires extensive computational power, traditionally employed on large-scale cluster machines as well as multi-core systems. However, emerging research results show that FPGA-based acceleration of algorithms for genomic applications greatly improves the performance and energy efficiency when compared to multi-core systems and clusters. In this work, we present a parallel, hardware acceleration architecture of the CAST (Complexity Analysis of Sequence Tracts) algorithm, employed by biologists for complexity analysis of protein sequences encoded in genomic data. CAST is used for detecting (and subsequently masking) low-complexity regions (LCRs) in protein sequences. We designed and implemented the CAST accelerator architecture and built an FPGA prototype, with the purpose of benchmarking its performance against serial and multithreaded implementations of the CAST algorithm in software. The proposed architecture achieves remarkable speedup compared to both serial and multithreaded software CAST implementations ranging from approx. 100x-5000x, depending on the system configuration and the dataset features, such as low-complexity content and sequence length distribution. Such performance may enable complex analyses of voluminous sequence datasets, and has the potential to interoperate with other hardware architectures for protein sequence analysis.

[1]  Wu-chun Feng,et al.  The design, implementation, and evaluation of mpiBLAST , 2003 .

[2]  Elaine R Mardis,et al.  Anticipating the $1,000 genome , 2006, Genome Biology.

[3]  George L.-T. Chiu,et al.  Overview of the Blue Gene/L system architecture , 2005, IBM J. Res. Dev..

[4]  John Wawrzynek,et al.  Bridging the GPGPU-FPGA efficiency gap , 2011, FPGA '11.

[5]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[6]  Partha Pratim Pande,et al.  Network-on-Chip Hardware Accelerators for Biological Sequence Alignment , 2010, IEEE Transactions on Computers.

[7]  John C. Wootton,et al.  Sequences with ‘unusual’ amino acid compositions , 1994 .

[8]  Anton J. Enright,et al.  An efficient algorithm for large-scale detection of protein families. , 2002, Nucleic acids research.

[9]  Burkhard Rost,et al.  Sequence conserved for subcellular localization , 2002, Protein science : a publication of the Protein Society.

[10]  Apostolos Dollas,et al.  A General Reconfigurable Architecture for the BLAST Algorithm , 2007, J. VLSI Signal Process..

[11]  M. Morris,et al.  The Design , 1998 .

[12]  Wilfried Haerty,et al.  Low-complexity sequences and single amino acid repeats: not just "junk" peptide sequences. , 2010, Genome.

[13]  Yong Dou,et al.  FPGA accelerator for protein secondary structure prediction based on the GOR algorithm , 2011, BMC Bioinformatics.

[14]  Robert C. Edgar,et al.  BIOINFORMATICS APPLICATIONS NOTE , 2001 .

[15]  Nikolaos V. Sahinidis,et al.  GPU-BLAST: using graphics processors to accelerate protein sequence alignment , 2010, Bioinform..

[16]  Eugene W. Myers,et al.  Basic local alignment search tool. Journal of Molecular Biology , 1990 .

[17]  Greg Brown,et al.  A performance and energy comparison of FPGAs, GPUs, and multicores for sliding-window applications , 2012, FPGA '12.

[18]  Dionisios N. Pnevmatikatos,et al.  A rate-based prefiltering approach to blast acceleration , 2008, 2008 International Conference on Field Programmable Logic and Applications.

[20]  M. Metzker Sequencing technologies — the next generation , 2010, Nature Reviews Genetics.

[21]  Ying Liu,et al.  A high performance fpga-based implementation of position specific iterated blast , 2009, FPGA '09.

[22]  S. Karlin,et al.  Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. , 1990, Proceedings of the National Academy of Sciences of the United States of America.

[23]  Theocharis Theocharides,et al.  Towards systolic hardware acceleration for local complexity analysis of massive genomic data , 2012, GLSVLSI '12.

[24]  Chris Sander,et al.  CAST: an iterative algorithm for the complexity analysis of sequence tracts , 2000, Bioinform..

[25]  John C. Wootton,et al.  Statistics of Local Complexity in Amino Acid Sequences and Sequence Databases , 1993, Comput. Chem..

[26]  J. Gibrat,et al.  GOR method for predicting protein secondary structure from amino acid sequence. , 1996, Methods in enzymology.

[27]  Martin C. Herbordt,et al.  Single pass streaming BLAST on FPGAs , 2007, Parallel Comput..