State-of-the-Art GPGPU Applications in Bioinformatics

The exponential growth of available biological data in recent years coupled with their increasing complexity has made their analysis a computationally challenging process. Traditional central processing unist CPUs are reaching their limit in processing power and are not designed primarily for multithreaded applications. Graphics processing units GPUs on the other hand are affordable, scalable computer powerhouses that, thanks to the ever increasing demand for higher quality graphics, have yet to reach their limit. Typically high-end CPUs have 8-16 cores, whereas GPUs can have more than 2,500 cores. GPUs are also, by design, highly parallel, multicore and multithreaded, able of handling thousands of threads doing the same calculation on different subsets of a large data set. This ability is what makes them perfectly suited for biological analysis tasks. Lately this potential has been realized by many bioinformatics researches and a huge variety of tools and algorithms have been ported to GPUs, or designed from the ground up to maximize the usage of available cores. Here, we present a comprehensive review of available bioinformatics tools ranging from sequence and image analysis to protein structure prediction and systems biology that use NVIDIA Compute Unified Device Architecture CUDA general-purpose computing on graphics processing units GPGPU programming language.

[1]  Dario Ghersi,et al.  SITEHOUND-web: a server for ligand binding site identification in protein structures , 2009, Nucleic Acids Res..

[2]  Matthew He,et al.  Symmetrical Analysis Techniques for Genetic Systems and Bioinformatics: Advanced Patterns and Applications , 2009 .

[3]  Fan Meng,et al.  The gputools package enables GPU computing in R , 2010, Bioinform..

[4]  Chi-Ren Shyu,et al.  Accelerating large-scale protein structure alignments with graphics processing units , 2012, BMC Research Notes.

[5]  Sheng-Ta Lee,et al.  GPU-Based Cloud Service for Smith-Waterman Algorithm Using Frequency Distance Filtration Scheme , 2013, BioMed research international.

[6]  Peter Bankhead,et al.  cudaMap: a GPU accelerated program for gene expression connectivity mapping , 2013, BMC Bioinformatics.

[7]  Y. Blume,et al.  Docking small ligands to molecule of the plant FtsZ protein: Application of the CUDA technology for faster computations , 2012, Cytology and Genetics.

[8]  Edans Flavius de Oliveira Sandes,et al.  CUDAlign: using GPU to accelerate the comparison of megabase genomic sequences , 2010, PPoPP '10.

[9]  Takako Takeda,et al.  High performance transcription factor-DNA docking with GPU computing , 2012, Proteome Science.

[10]  Yongchao Liu,et al.  CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions , 2013, BMC Bioinformatics.

[11]  Yongchao Liu,et al.  CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform , 2012, Bioinform..

[12]  Li Lan,et al.  Classification of breast lesions with multimodality computer-aided diagnosis: observer study results on an independent clinical data set. , 2006, Radiology.

[13]  Siu-Ming Yiu,et al.  SOAP3: ultra-fast GPU-based parallel alignment tool for short reads , 2012, Bioinform..

[14]  Christian N. S. Pedersen,et al.  GPU-accelerated high-accuracy molecular docking using guided differential evolution: real world applications , 2011, GECCO '11.

[15]  Dominique Lavenier,et al.  GPU Accelerated RNA Folding Algorithm , 2009, ICCS.

[16]  Amitabh Varshney,et al.  Parallel, stochastic measurement of molecular surface area. , 2008, Journal of molecular graphics & modelling.

[17]  Michael P. H. Stumpf,et al.  GPU accelerated biochemical network simulation , 2011, Bioinform..

[18]  M. Giger,et al.  Multimodality computer-aided breast cancer diagnosis with FFDM and DCE-MRI. , 2010, Academic radiology.

[19]  Lorenzo Dematté,et al.  GPU computing for systems biology , 2010, Briefings Bioinform..

[20]  Weiguo Liu,et al.  Parallel mutual information estimation for inferring gene regulatory networks on GPUs , 2011, BMC Research Notes.

[21]  Shan Chang,et al.  A Parallel Molecular Docking Approach Based on Graphic Processing Unit , 2010, 2010 4th International Conference on Bioinformatics and Biomedical Engineering.

[22]  Sergios Theodoridis,et al.  Pattern Recognition, Fourth Edition , 2008 .

[23]  Takashi Ishida,et al.  GHOSTM: A GPU-Accelerated Homology Search Tool for Metagenomics , 2012, PloS one.

[24]  Julie C. Mitchell,et al.  CUSA and CUDE: GPU-Accelerated Methods for Estimating Solvent Accessible Surface Area and Desolvation , 2009, J. Comput. Biol..

[25]  Alexander A. Kantardjiev Quantum.Ligand.Dock: protein–ligand docking with quantum entanglement refinement on a GPU system , 2012, Nucleic Acids Res..

[26]  Wu-chun Feng,et al.  Accelerating electrostatic surface potential calculation with multi-scale approximation on graphics processing units. , 2010, Journal of molecular graphics & modelling.

[27]  Yongchao Liu,et al.  MSA-CUDA: Multiple Sequence Alignment on Graphics Processing Units with CUDA , 2009, 2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors.

[28]  Weiguo Liu,et al.  GPU-ClustalW: Using Graphics Hardware to Accelerate Multiple Sequence Alignment , 2006, HiPC.

[29]  Paola Lecca,et al.  Systemic Approaches in Bioinformatics and Computational Systems Biology: Recent Advances , 2011 .

[30]  Peter B. Noël,et al.  GPU-based cone beam computed tomography , 2010, Comput. Methods Programs Biomed..

[31]  Brett A. McKinney,et al.  Real-world comparison of CPU and GPU implementations of SNPrank: a network analysis tool for GWAS , 2011, Bioinform..

[32]  Ahmad Taher Azar,et al.  Statistical Analysis for Radiologists’ Interpretations Variability in Mammograms , 2012 .

[33]  John D. Owens,et al.  Fast Deformable Registration on the GPU: A CUDA Implementation of Demons , 2008, 2008 International Conference on Computational Sciences and Its Applications.

[34]  Pradeep Dubey,et al.  High-Performance 3D Compressive Sensing MRI Reconstruction Using Many-Core Architectures , 2011, Int. J. Biomed. Imaging.

[35]  Nicholas A. Hamilton,et al.  Fast Parallel Markov Clustering in Bioinformatics Using Massively Parallel Computing on GPU with CUDA and ELLPACK-R Sparse Format , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[36]  Amitabh Varshney,et al.  High-throughput sequence alignment using Graphics Processing Units , 2007, BMC Bioinformatics.

[37]  Takashi Ishida,et al.  MEGADOCK-GPU: Acceleration of Protein-Protein Docking Calculation on GPUs , 2013, BCB.

[38]  Regina Berretta,et al.  GPU-FS-kNN: A Software Tool for Fast and Scalable kNN Computation Using GPUs , 2012, PloS one.

[39]  Klaus Schulten,et al.  Multilevel summation of electrostatic potentials using graphics processing units , 2009, Parallel Comput..

[40]  D. Hill,et al.  Medical image registration , 2001, Physics in medicine and biology.

[41]  Chris S. Haley,et al.  EpiGPU: exhaustive pairwise epistasis scans parallelized on consumer level graphics cards , 2011, Bioinform..

[42]  Jens Stoye,et al.  Exact and complete short-read alignment to microbial genomes using Graphics Processing Unit programming , 2011, Bioinform..

[43]  Mile Šikić,et al.  SW#–GPU-enabled exact alignments on genome scale , 2013, Bioinform..

[44]  Pat Hanrahan,et al.  ClawHMMER: A Streaming HMMer-Search Implementatio , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[45]  Yongchao Liu,et al.  DecGPU: distributed error correction on massively parallel graphics processing units using CUDA and MPI , 2011, BMC Bioinformatics.

[46]  Erika Cule,et al.  ABC-SysBio—approximate Bayesian computation in Python with GPU support , 2010, Bioinform..

[47]  Jean-Stéphane Varré,et al.  Parallel Position Weight Matrices algorithms , 2011, Parallel Comput..

[48]  Cole Trapnell,et al.  Optimizing data intensive GPGPU computations for DNA sequence alignment , 2009, Parallel Comput..

[49]  David W. Ritchie,et al.  Ultra-fast FFT protein docking on graphics processors , 2010, Bioinform..

[50]  H. Chan,et al.  Multi-modality CADx: ROC study of the effect on radiologists' accuracy in characterizing breast masses on mammograms and 3D ultrasound images. , 2009, Academic radiology.

[51]  Zhaohui S. Qin,et al.  GPUmotif: An Ultra-Fast and Energy-Efficient Motif Analysis Program Using Graphics Processing Units , 2012, PloS one.

[52]  Yongchao Liu,et al.  CUDASW++: optimizing Smith-Waterman sequence database searches for CUDA-enabled graphics processing units , 2009, BMC Research Notes.

[53]  Nikolaos V. Sahinidis,et al.  GPU-BLAST: using graphics processors to accelerate protein sequence alignment , 2010, Bioinform..

[54]  Jacek Blazewicz,et al.  Protein alignment algorithms with an efficient backtracking routine on multiple GPUs , 2011, BMC Bioinformatics.

[55]  Jiří Filipovič,et al.  FastGrid -- The Accelerated AutoGrid Potential Maps Generation for Molecular Docking , 2009 .

[56]  Harry A. Stern,et al.  Accelerating calculations of RNA secondary structure partition functions using GPUs , 2013, Algorithms for Molecular Biology.

[57]  Yongchao Liu,et al.  CUDASW++2.0: enhanced Smith-Waterman protein database search on CUDA-enabled GPUs based on SIMT and virtualized SIMD abstractions , 2010, BMC Research Notes.

[58]  Weiguo Liu,et al.  CUDA-BLASTP: Accelerating BLASTP on CUDA-Enabled Graphics Hardware , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[59]  Robert Giegerich,et al.  GPU Parallelization of Algebraic Dynamic Programming , 2009, PPAM.

[60]  Witold R. Rudnicki,et al.  An efficient implementation of Smith Waterman algorithm on GPU using CUDA, for massively parallel scanning of sequence databases , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[61]  Athanasia Pavlopoulou,et al.  In Silico Analysis of the CST6 Tumor Suppressor Gene , 2013 .

[62]  Sotiris Ioannidis,et al.  Gnort: High Performance Network Intrusion Detection Using Graphics Processors , 2008, RAID.

[63]  Liang Li,et al.  A cone-beam tomography system with a reduced size planar detector: a backprojection-filtration reconstruction algorithm as well as numerical and practical experiments. , 2007, Applied radiation and isotopes : including data, instrumentation and methods for use in agriculture, industry and medicine.

[64]  Nikos Dimitropoulos,et al.  Multimodality GPU-based computer-assisted diagnosis of breast cancer using ultrasound and digital mammography images , 2013, International Journal of Computer Assisted Radiology and Surgery.

[65]  Weiguo Liu,et al.  Quality-score guided error correction for short-read sequencing data using CUDA , 2010, ICCS.

[66]  David P. Sexton,et al.  Managing and Analyzing Next-Generation Sequence Data , 2009, PLoS Comput. Biol..

[67]  Graham Pullan,et al.  BarraCUDA - a fast short read sequence aligner using graphics processing units , 2011, BMC Research Notes.

[68]  Roger D. Chamberlain,et al.  Accelerating HMMER on GPUs by implementing hybrid data and task parallelism , 2010, BCB '10.

[69]  Jan Schröder,et al.  Genome analysis SHREC : a short-read error correction method , 2009 .

[70]  Christian N. S. Pedersen,et al.  GPU-Accelerated High-Accuracy Molecular Docking Using Guided Differential Evolution , 2013, Massively Parallel Evolutionary Computation on GPGPUs.

[71]  Antonino Tumeo,et al.  Accelerating DNA analysis applications on GPU clusters , 2010, 2010 IEEE 8th Symposium on Application Specific Processors (SASP).

[72]  Jean-Philippe Thirion,et al.  Image matching as a diffusion process: an analogy with Maxwell's demons , 1998, Medical Image Anal..

[73]  Kouros Owzar,et al.  permGPU: Using graphics processing units in RNA microarray association studies , 2010, BMC Bioinformatics.

[74]  Giorgio Valle,et al.  CUDA compatible GPU cards as efficient hardware accelerators for Smith-Waterman sequence alignment , 2008, BMC Bioinformatics.

[75]  Bruno Lévy,et al.  GPU‐accelerated atom and dynamic bond visualization using hyperballs: A unified algorithm for balls, sticks, and hyperboloids , 2011, J. Comput. Chem..

[76]  Yongchao Liu,et al.  CUDA-MEME: Accelerating motif discovery in biological sequences using CUDA-enabled graphics processing units , 2010, Pattern Recognit. Lett..

[77]  Peter M. Kasson,et al.  GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit , 2013, Bioinform..

[78]  Tom R. Halfhill NVIDIA's Next-Generation CUDA Compute and Graphics Architecture, Code-Named Fermi, Adds Muscle for Parallel Processing , 2009 .

[79]  Donald F. Specht,et al.  Probabilistic neural networks and the polynomial Adaline as complementary techniques for classification , 1990, IEEE Trans. Neural Networks.

[80]  Yongchao Liu,et al.  Mapping of BLASTP Algorithm onto GPU Clusters , 2011, 2011 IEEE 17th International Conference on Parallel and Distributed Systems.

[81]  Lazaros Mavridis,et al.  HexServer: an FFT-based protein docking server powered by graphics processors , 2010, Nucleic Acids Res..

[82]  Karen R. Khar,et al.  Fast Docking on Graphics Processing Units via Ray-Casting , 2013, PloS one.

[83]  Ying-Tsang Lo,et al.  Protein-ligand binding region prediction (PLB-SAVE) based on geometric features and CUDA acceleration , 2013, BMC Bioinformatics.

[84]  Stephen W. Poole,et al.  Acceleration of the Smith-Waterman algorithm using single and multiple graphics processors , 2010, J. Comput. Phys..

[85]  Bing Zhang,et al.  Fast network centrality analysis using GPUs , 2011, BMC Bioinformatics.

[86]  Jacek Blazewicz,et al.  G-MSA - A GPU-based, fast and accurate algorithm for multiple sequence alignment , 2013, J. Parallel Distributed Comput..

[87]  S. Salzberg,et al.  Versatile and open software for comparing large genomes , 2004, Genome Biology.

[88]  Weiguo Liu,et al.  Accelerating error correction in high-throughput short-read DNA sequencing data with CUDA , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[89]  Doohwan Oh,et al.  GPU-Friendly Parallel Genome Matching with Tiled Access and Reduced State Transition Table , 2012, International Journal of Parallel Programming.

[90]  Yu Li,et al.  Identification of cavities on protein surface using multiple computational approaches for drug binding site prediction , 2011, Bioinform..

[91]  Can Yang,et al.  GBOOST: a GPU-based tool for detecting gene-gene interactions in genome-wide case control studies , 2011, Bioinform..

[92]  Weiguo Liu,et al.  A Parallel Algorithm for Error Correction in High-Throughput Short-Read Data on CUDA-Enabled Graphics Hardware , 2010, J. Comput. Biol..