SmartSSD: FPGA Accelerated Near-Storage Data Analytics on SSD

Faced with the increasing disparity between SSD throughput and CPU-based compute capabilities, there have been growing interests to move compute closer to storage and accelerate the data analytic workloads. In this letter, we propose SmartSSD, an SSD with onboard FPGA, which enables offloading computation within SSD. We perform a detailed model-based evaluation to evaluate the end-to-end performance and energy benefit of SmartSSD for the representative data analytic workloads with Spark SQL and Parquet columnar data format. Our evaluation shows that SmartSSD has the potential to have a transformative impact when building a high performance data analytic system, which enables 3.04x performance improvement and consuming only 45.8 percent of energy compared to the conventional CPU-based approach.

[1]  Sungjin Lee,et al.  BlueDBM: An appliance for Big Data analytics , 2015, 2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA).

[2]  David J. DeWitt,et al.  Query processing on smart SSDs: opportunities and challenges , 2013, SIGMOD '13.

[3]  Jinyoung Lee,et al.  Biscuit: A Framework for Near-Data Processing of Big Data Workloads , 2016, 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA).

[4]  Reynold Xin,et al.  Apache Spark , 2016 .