Accelerating ncRNA homology search with FPGAs

Over the last decade, the number of known biologically important non-coding RNAs (ncRNAs) has increased by orders of magnitude. The function performed by a specific ncRNA is partially determined by its structure, defined by which nucleotides of the molecule form pairs. These correlations may span large and variable distances in the linear RNA molecule. Because of these characteristics, algorithms that search for ncRNAs belonging to known families are computationally expensive, often taking many CPU weeks to run. To improve the speed of this search, multiple search algorithms arranged into a series of progressively more stringent filters can be used. In this paper, we present an FPGA based implementation of some of these algorithms. This is the first FPGA based approach to attempt to accelerate multiple filters used in ncRNA search. The FPGA is reconfigured for each filter, resulting in a total system speedup of 25x when compared with a single CPU.

[1]  Zasha Weinberg,et al.  Faster genome annotation of non-coding RNA families without loss of accuracy , 2004, RECOMB.

[2]  R D Klausner,et al.  Structure and dynamics of the iron responsive element RNA: implications for binding of the RNA by iron regulatory binding proteins. , 1997, Journal of molecular biology.

[3]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[4]  Bertil Schmidt,et al.  Integrating FPGA acceleration into HMMer , 2008, Parallel Comput..

[5]  Sean R. Eddy,et al.  Infernal 1.0: inference of RNA alignments , 2009, Bioinform..

[6]  Yong Dou,et al.  Fine-grained parallel RNA secondary structure prediction using SCFGs on FPGA , 2010, Parallel Comput..

[7]  Patrice Quinton,et al.  Accelerating HMMER on FPGA using parallel prefixes and reductions , 2010, 2010 International Conference on Field-Programmable Technology.

[8]  Young H. Cho,et al.  Hardware-Accelerated RNA Secondary-Structure Alignment , 2010, TRETS.

[9]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[10]  M. Esteller Non-coding RNAs in human disease , 2011, Nature Reviews Genetics.

[11]  Zasha Weinberg,et al.  Sequence-based heuristics for faster annotation of non-coding RNA families , 2006, Bioinform..

[12]  J. Mattick The Genetic Signatures of Noncoding RNAs , 2009, PLoS genetics.

[13]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[14]  Jean-Cédric Chappelier,et al.  An FPGA-based coprocessor for the parsing of context-free grammars , 2000, Proceedings 2000 IEEE Symposium on Field-Programmable Custom Computing Machines (Cat. No.PR00871).

[15]  R. Durbin,et al.  RNA sequence analysis using covariance models. , 1994, Nucleic acids research.

[16]  A. Hüttenhofer,et al.  The expanding snoRNA world. , 2002, Biochimie.

[17]  Sean R. Eddy,et al.  Rfam: annotating non-coding RNAs in complete genomes , 2004, Nucleic Acids Res..

[18]  Daniel H. Younger,et al.  Recognition and Parsing of Context-Free Languages in Time n^3 , 1967, Inf. Control..

[19]  Tsutomu Maruyama,et al.  Accelerating HMMER search using FPGA , 2009, 2009 International Conference on Field Programmable Logic and Applications.

[20]  S. Eddy Computational Genomics of Noncoding RNA Genes , 2002, Cell.

[21]  Jr. G. Forney,et al.  The viterbi algorithm , 1973 .

[22]  Eric P. Nawrocki,et al.  Structural rna homology search and alignment using covariance models , 2009 .

[23]  Sean R. Eddy,et al.  A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure , 2002, BMC Bioinformatics.

[24]  Karl S. Hemmert,et al.  A comparison of floating point and logarithmic number systems for FPGAs , 2005, 13th Annual IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM'05).