DNA mapping using Processor-in-Memory architecture

This paper presents the implementation of a mapping algorithm on a new Processing-in-Memory (PIM) architecture developed by UPMEM Company. UPMEM's solution consists in adding processing units into the DRAM, to minimize data access time and maximize bandwidth, in order to drastically accelerate data-consuming algorithms. The technology developed by UPMEM makes it possible to combine 256 cores with 16 GBytes of DRAM, on a standard DIMM module. An experimentation of DNA Mapping on Human genome dataset shows that a speed-up of 25 can be obtained with UPMEM technology compared to fast mapping software such as BWA, Bowtie2 or NextGenMap running on 16 Intel threads. Experimentation also highlight that data transfer from storage device limits the performances of the implementation. The use of SSD drives can boost the speed-up to 80.

[1]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[2]  Carl Ebeling,et al.  Hardware Acceleration of Short Read Mapping , 2012, 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines.

[3]  Yongchao Liu,et al.  CUSHAW: a CUDA compatible short read aligner to large genomes based on the Burrows-Wheeler transform , 2012, Bioinform..

[4]  Véronique Martin,et al.  Mapping Reads on a Genomic Sequence: An Algorithmic Overview and a Practical Comparative Analysis , 2012, J. Comput. Biol..

[5]  Graham Pullan,et al.  BarraCUDA - a fast short read sequence aligner using graphics processing units , 2011, BMC Research Notes.

[6]  Cole Trapnell,et al.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome , 2009, Genome Biology.

[7]  Bairong Shen,et al.  Evaluation and Comparison of Multiple Aligners for Next-Generation Sequencing Data Analysis , 2014, BioMed research international.

[8]  Dominique Lavenier,et al.  MAPPING on UPMEM , 2016 .

[9]  Dominique Lavenier,et al.  BLAST on UPMEM , 2016 .

[10]  S. Nelson,et al.  BFAST: An Alignment Tool for Large Scale Genome Resequencing , 2009, PloS one.

[11]  Wayne Luk,et al.  Leveraging FPGAs for Accelerating Short Read Alignment , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[12]  Turki Turki,et al.  MaxSSmap: a GPU program for mapping divergent short reads to genomes with the maximum scoring subsequence , 2014, BMC Genomics.

[13]  Arndt von Haeseler,et al.  NextGenMap: fast and accurate read mapping in highly polymorphic genomes , 2013, Bioinform..

[14]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[15]  Siu-Ming Yiu,et al.  SOAP2: an improved ultrafast tool for short read alignment , 2009, Bioinform..

[16]  Katherine Yelick,et al.  A Case for Intelligent RAM: IRAM , 1997 .

[17]  Yongchao Liu,et al.  CUSHAW2-GPU: Empowering Faster Gapped Short-Read Alignment Using GPU Computing , 2014, IEEE Design & Test.

[18]  Ümit V. Çatalyürek,et al.  Benchmarking short sequence mapping tools , 2013, BMC Bioinformatics.

[19]  Christoforos E. Kozyrakis,et al.  A case for intelligent RAM , 1997, IEEE Micro.

[20]  Sungroh Yoon,et al.  Will solid-state drives accelerate your bioinformatics? In-depth profiling, performance analysis and beyond , 2015, Briefings Bioinform..

[21]  Wayne Luk,et al.  Reconfigurable Acceleration of Short Read Mapping , 2013, 2013 IEEE 21st Annual International Symposium on Field-Programmable Custom Computing Machines.

[22]  Thomas K. F. Wong,et al.  SOAP3-dp: Fast, Accurate and Sensitive GPU-Based Short Read Aligner , 2013, PloS one.

[23]  Dominique Lavenier,et al.  GASSST: global alignment short sequence search tool , 2010, Bioinform..

[24]  Mark Harman,et al.  Improving CUDA DNA Analysis Software with Genetic Programming , 2015, GECCO.

[25]  Peter M. Kogge,et al.  Combined DRAM and logic chip for massively parallel systems , 1995, Proceedings Sixteenth Conference on Advanced Research in VLSI.