DASH: a Recipe for a Flash-based Data Intensive Supercomputer

Data intensive computing can be defined as computation involving large datasets and complicated I/O patterns. Data intensive computing is challenging because there is a five-orders-of-magnitude latency gap between main memory DRAM and spinning hard disks; the result is that an inordinate amount of time in data intensive computing is spent accessing data on disk. To address this problem we designed and built a prototype data intensive supercomputer named DASH that exploits flash-based Solid State Drive (SSD) technology and also virtually aggregated DRAM to fill the latency gap . DASH uses commodity parts including Intel® X25-E flash drives and distributed shared memory (DSM) software from ScaleMP®. The system is highly competitive with several commercial offerings by several metrics including achieved IOPS (input output operations per second), IOPS per dollar of system acquisition cost, IOPS per watt during operation, and IOPS per gigabyte (GB) of available storage. We present here an overview of the design of DASH, an analysis of its cost efficiency, then a detailed recipe for how we designed and tuned it for high data-performance, lastly show that running data-intensive scientific applications from graph theory, biology, and astronomy, we achieved as much as two orders-of- magnitude speedup compared to the same applications run on traditional architectures.

[1]  David A. Bader Petascale Computing: Algorithms and Applications , 2007 .

[2]  Alexander S. Szalay,et al.  The Sloan Digital Sky Survey , 1999, Comput. Sci. Eng..

[3]  Jeffrey Bennett,et al.  DASH-IO: an empirical study of flash-based IO for HPC , 2010 .

[4]  Nicholas Carriero,et al.  Linda and Friends , 1986, Computer.

[5]  Rina Panigrahy,et al.  Design Tradeoffs for SSD Performance , 2008, USENIX ATC.

[6]  David A. Bader Petascale Computing (Chapman & Hall/Crc Computational Science Series) , 2007 .

[7]  Kai Li,et al.  IVY: A Shared Virtual Memory System for Parallel Computing , 1988, ICPP.

[8]  F. Collins,et al.  The Human Genome Project: Lessons from Large-Scale Biology , 2003, Science.

[9]  E. al.,et al.  The Sloan Digital Sky Survey: Technical summary , 2000, astro-ph/0006396.

[10]  Alan L. Cox,et al.  TreadMarks: shared memory computing on networks of workstations , 1996 .

[11]  Bruce Margony The Sloan Digital Sky Survey , 1999, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[12]  Xiaodong Zhang,et al.  Understanding intrinsic characteristics and system implications of flash memory based solid state drives , 2009, SIGMETRICS '09.

[13]  Kai Li,et al.  Retrospective: virtual memory mapped network interface for the SHRIMP multicomputer , 1994, ISCA '98.

[14]  Paul H. Siegel,et al.  Characterizing flash memory: Anomalies, observations, and applications , 2009, 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[15]  Jim Gray,et al.  2020 Computing: Science in an exponential world , 2006, Nature.

[16]  Shirley Moore,et al.  Workshop Report: Petascale Computing in the Geosciences in View of the National Science Foundation's Recent Announcement Entitled: Leadership- Class System Acquisition -creating a Petascale Computing Environment for Science and Engineering, Which Calls for Deployment of a Petascale Computational Fac , .

[17]  Kai Shen,et al.  A performance evaluation of scientific I/O workloads on Flash-based SSDs , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[18]  J. Kessenich,et al.  Bit error rate in NAND Flash memories , 2008, 2008 IEEE International Reliability Physics Symposium.

[19]  Jesús Labarta,et al.  Performance Modeling of HPC Applications , 2003, PARCO.

[20]  Aad J. van der Steen,et al.  Overview of recent supercomputers , 2008 .

[21]  A. Szalay Science in an Exponential World , 2008 .

[22]  Gernot Heiser,et al.  vNUMA: A Virtual Shared-Memory Multiprocessor , 2009, USENIX Annual Technical Conference.

[23]  David A. Bader Workshop Report: Petascale Computing in the Biological Sciences , 2006 .