Compressive Sensing on Storage Data: An Effective Solution to Alleviate I/0 Bottleneck in Data- Intensive Workloads

The gap between computation speed and I/O access on modern computing systems imposes processing limitations in data-intensive applications. Employing high-end memory has proven not to enhance the performance for I/O bound applications, given the low utilization of memory bandwidth in such applications, as highlighted in recent studies. Despite several solutions to improve the performance of storage, none of them is able to shift the bottleneck from the I/O access to the memory subsystem for I/O bound applications. In this paper, we show that in the case of data-intensive multimedia applications, by using Compressive Sensing (CS), a lossy data compression method, the bottleneck is lifted from the storage, increasing the bandwidth utilization of the memory to gain further performance improvement from a high-end memory. The reconstruction of compressed data is however time and memory consuming. To address this challenge, we employ and compare the hardware and software acceleration of Orthogonal Matching Pursuit (OMP), a greedy algorithm, which solves the problem by choosing the most significant variable to reduce the least square error. Our implementation results show that CS increases memory bandwidth utilization by 1.4x and using high bandwidth memory results in 24% performance improvement. Overall, the proposed solution of CS of storage data with FPGA accelerator achieves up to 45% speedup in an end-to-end implementation by only 4.6% accuracy degradation.

[1]  Houman Homayoun,et al.  A comprehensive memory analysis of data intensive workloads on server class architecture , 2018, MEMSYS.

[2]  Jaejin Lee,et al.  25.2 A 1.2V 8Gb 8-channel 128GB/s high-bandwidth memory (HBM) stacked DRAM with effective microbump I/O test methods using 29nm process and TSV , 2014, 2014 IEEE International Solid-State Circuits Conference Digest of Technical Papers (ISSCC).

[3]  Jiaxing Zhang,et al.  Impression Store: Compressive Sensing-based Storage for Big Data Analytics , 2014, HotCloud.

[4]  Wei Tang,et al.  Improving I/O Performance with Adaptive Data Compression for Big Data Applications , 2014, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops.

[5]  Hassan Ghasemzadeh,et al.  Design Space Exploration for Hardware Acceleration of Machine Learning Applications in MapReduce , 2018, 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM).

[6]  Yuqing Zhu,et al.  BigDataBench: A big data benchmark suite from internet services , 2014, 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA).

[7]  Houman Homayoun,et al.  Main-Memory Requirements of Big Data Applications on Commodity Server Platform , 2018, 2018 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID).

[8]  Ioannis Kontoyiannis,et al.  Pattern matching and lossy data compression on random fields , 2003, IEEE Trans. Inf. Theory.

[9]  Scott Shenker,et al.  Spark: Cluster Computing with Working Sets , 2010, HotCloud.

[10]  Dilpreet Singh,et al.  A survey on platforms for big data analytics , 2014, Journal of Big Data.

[11]  Kenneth Moreland,et al.  Sandia National Laboratories , 2000 .

[12]  Holger Rauhut,et al.  A Mathematical Introduction to Compressive Sensing , 2013, Applied and Numerical Harmonic Analysis.

[13]  Jian Pei,et al.  A spatiotemporal compression based approach for efficient big data processing on Cloud , 2014, J. Comput. Syst. Sci..

[14]  Houman Homayoun,et al.  A parallel and reconfigurable architecture for efficient OMP compressive sensing reconstruction , 2014, GLSVLSI '14.

[15]  Lin Cai,et al.  Scalable Video Coding with Compressive Sensing for Wireless Videocast , 2011, 2011 IEEE International Conference on Communications (ICC).

[16]  C. L. Philip Chen,et al.  Data-intensive applications, challenges, techniques and technologies: A survey on Big Data , 2014, Inf. Sci..

[17]  Hassan Ghasemzadeh,et al.  Architectural considerations for FPGA acceleration of machine learning applications in MapReduce , 2018, SAMOS.

[18]  Houman Homayoun,et al.  Understanding the role of memory subsystem on performance and energy-efficiency of Hadoop applications , 2017, 2017 Eighth International Green and Sustainable Computing Conference (IGSC).

[19]  Raziel Haimi-Cohen,et al.  Image Compression Based on Compressive Sensing: End-to-End Comparison With JPEG , 2017, IEEE Transactions on Multimedia.

[20]  Vladimir Vlassov,et al.  Performance Characterization of In-Memory Data Analytics on a Modern Cloud Server , 2015, 2015 IEEE Fifth International Conference on Big Data and Cloud Computing.

[21]  Bogdan Nicolae,et al.  High Throughput Data-Compression for Cloud Storage , 2010, Globe.

[22]  G. Blelloch Introduction to Data Compression * , 2022 .

[23]  Houman Homayoun,et al.  MeNa: A memory navigator for modern hardware in a scale-out environment , 2017, 2017 IEEE International Symposium on Workload Characterization (IISWC).

[24]  Wei Tang,et al.  FlexAnalytics: A Flexible Data Analytics Framework for Big Data Applications with I/O Performance Improvement , 2014, Big Data Res..

[25]  Deepak Jain,et al.  Hardware Based Compression in Big Data , 2016, 2016 Data Compression Conference (DCC).

[26]  Hubert Kaeslin,et al.  High-speed compressed sensing reconstruction on FPGA using OMP and AMP , 2012, 2012 19th IEEE International Conference on Electronics, Circuits, and Systems (ICECS 2012).

[27]  Hosein Mohammadi Makrani Storage and Memory Characterization of Data Intensive Workloads for Bare Metal Cloud , 2018, ArXiv.

[28]  Houman Homayoun,et al.  Memory requirements of hadoop, spark, and MPI based big data applications on commodity server class architectures , 2017, 2017 IEEE International Symposium on Workload Characterization (IISWC).

[29]  Peter A. Boncz,et al.  Faster across the PCIe bus: a GPU library for lightweight decompression: including support for patched compression schemes , 2017, DaMoN.

[30]  Tinoosh Mohsenin,et al.  Accelerating compressive sensing reconstruction OMP algorithm with CPU, GPU, FPGA and domain specific many-core , 2015, 2015 IEEE International Symposium on Circuits and Systems (ISCAS).

[31]  Valery Sklyarov,et al.  Parallel FPGA-Based Implementation of Recursive Sorting Algorithms , 2010, 2010 International Conference on Reconfigurable Computing and FPGAs.

[32]  Hari Angepat,et al.  A cloud-scale acceleration architecture , 2016, 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[33]  Houman Homayoun,et al.  Customized Machine Learning-Based Hardware-Assisted Malware Detection in Embedded Devices , 2018, 2018 17th IEEE International Conference On Trust, Security And Privacy In Computing And Communications/ 12th IEEE International Conference On Big Data Science And Engineering (TrustCom/BigDataSE).