Towards Energy Efficient Data Management in HPC: The Open Ethernet Drive Approach

An Open Ethernet Drive (OED) is a new technology that encloses into a hard drive (HDD or SSD) a low-power processor, a fixed-size memory and an Ethernet card. In this study, we thoroughly evaluate the performance of such device and the energy requirements to operate it. The results show that first it is a viable solution to offload data-intensive computations on the OED while maintaining a reasonable performance, and second, the energy consumption savings from utilizing such technology are significant as it only consumes 10% of the power needed by a normal server node. We propose that by using OED devices as storage servers in HPC, we can run a reliable, scalable, cost and energy efficient storage solution.

[1]  Christos Faloutsos,et al.  Active Storage for Large-Scale Data Mining and Multimedia , 1998, VLDB.

[2]  Andrew J. Hutton,et al.  Lustre: Building a File System for 1,000-node Clusters , 2003 .

[3]  Wenji Mao,et al.  Social Computing: From Social Informatics to Social Intelligence , 2007, IEEE Intell. Syst..

[4]  Arthur M. Lesk,et al.  Introduction to bioinformatics , 2002 .

[5]  MARK A. GILLMAN,et al.  The data explosion , 1988, Nature.

[6]  Xingming Zhao,et al.  Computational Systems Biology , 2013, TheScientificWorldJournal.

[7]  Hai Jin,et al.  Active Disks: Programming Model, Algorithms and Evaluation , 2002 .

[8]  Robert B. Ross,et al.  PVFS: A Parallel File System for Linux Clusters , 2000, Annual Linux Showcase & Conference.

[9]  Tony Hey,et al.  The Fourth Paradigm: Data-Intensive Scientific Discovery , 2009 .

[10]  Rajeev Thakur,et al.  Efficient disk-to-disk sorting: a case study in the decoupled execution paradigm , 2015, DISCS '15.

[11]  Ke Wang,et al.  Albatross: An efficient cloud-enabled task scheduling and execution framework using distributed message queues , 2016, 2016 IEEE 12th International Conference on e-Science (e-Science).

[12]  Lavanya Ramakrishnan,et al.  AnalyzeThis: an analysis workflow-aware storage system , 2015, SC15: International Conference for High Performance Computing, Networking, Storage and Analysis.

[13]  Peter Desnoyers,et al.  Active Flash: Out-of-core data analytics on flash storage , 2012, 012 IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST).

[14]  Tony Hey,et al.  The Fourth Paradigm , 2009 .

[15]  Michael Lang,et al.  Active Burst-Buffer: In-Transit Processing Integrated into Hierarchical Storage , 2016, 2016 IEEE International Conference on Networking, Architecture and Storage (NAS).

[16]  William Harrod A journey to exascale computing , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[17]  Rajeev Thakur,et al.  A Decoupled Execution Paradigm for Data-Intensive High-End Computing , 2012, 2012 IEEE International Conference on Cluster Computing.