Approaches of enhancing interoperations among high performance computing and big data analytics via augmentation

The dawn of exascale computing and its convergence with big data analytics has greatly spurred research interests. The reasons are straightforward. Traditionally, high performance computing (HPC) systems have been used for scientific applications involving majority of compute-intensive tasks. At the same time, the proliferation of big data resulted into design of data-intensive processing paradigms like Apache big data stack. Big data generating at high pace necessitates faster processing mechanisms for getting insights at a real time. For this, the HPC systems may serve as panacea for solving the big data problems. Though the HPC systems have the capability to give the promising results for big data, directly integrating them with existing data-intensive frameworks like Apache big data stack is not straightforward due to challenges associated with them. This triggers a research on seamlessly integrating these two paradigms based on interoperable framework, programming model, and system architecture. The aim of this paper is to assess a progress made in HPC world as an effort to augment it with big data analytics support. As an outcome of this, the taxonomy showing the factors to be considered for augmenting HPC systems with big data support has been put forth. This paper sheds light upon how big data frameworks can be ported to HPC platforms as a preliminary step towards the convergence of big data and exascale computing ecosystem. The focus is given on research issues related to augmenting HPC paradigms with big data frameworks and corresponding approaches to address those issues. This paper also discusses data-intensive as well as compute-intensive processing paradigms, benchmark suites and workloads, and future directions in the domain of integrating HPC with big data analytics.

[1]  Javier Prades,et al.  On the effect of using rCUDA to provide CUDA acceleration to Xen virtual machines , 2018, Cluster Computing.

[2]  Surendra Byna,et al.  Accelerating Science with the NERSC Burst Buffer Early User Program , 2016 .

[3]  César A. F. De Rose,et al.  Scheduling MapReduce Jobs in HPC Clusters , 2012, Euro-Par.

[4]  Zhiwei Xu,et al.  Can MPI Benefit Hadoop and MapReduce Applications? , 2011, 2011 40th International Conference on Parallel Processing Workshops.

[5]  Jun Wang,et al.  USFD: a unified storage framework for SOAR HPC scientific workflows , 2012, Int. J. Parallel Emergent Distributed Syst..

[6]  K. Chandrasekaran,et al.  Exploring the support for high performance applications in the container runtime environment , 2018, Human-centric Computing and Information Sciences.

[7]  André Brinkmann,et al.  Zeroing memory deallocator to reduce checkpoint sizes in virtualized HPC environments , 2018, The Journal of Supercomputing.

[8]  Teng Wang,et al.  Development of a Burst Buffer System for Data-Intensive Applications , 2015, ArXiv.

[9]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[10]  Dhabaleswar K. Panda,et al.  Mizan-RMA: Accelerating Mizan Graph Processing Framework with MPI RMA , 2016, 2016 IEEE 23rd International Conference on High Performance Computing (HiPC).

[11]  Osamu Tatebe,et al.  PPFS: A Scale-Out Distributed File System for Post-Petascale Systems , 2016, 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS).

[12]  Jinkyu Jeong,et al.  Exploiting GPUs in Virtual Machine for BioCloud , 2013, BioMed research international.

[13]  Allen D. Malony,et al.  Scaling Spark on HPC Systems , 2016, HPDC.

[14]  Frank B. Schmuck,et al.  GPFS: A Shared-Disk File System for Large Computing Clusters , 2002, FAST.

[15]  James Demmel,et al.  Matrix factorizations at scale: A comparison of scientific data analytics in spark and C+MPI using three case studies , 2016, 2016 IEEE International Conference on Big Data (Big Data).

[16]  Song Jiang,et al.  IR+: Removing parallel I/O interference of MPI programs via data replication over heterogeneous storage devices , 2018, Parallel Comput..

[17]  Guangchen Ruan,et al.  Horme: Random Access Big Data Analytics , 2016, 2016 IEEE International Conference on Cluster Computing (CLUSTER).

[18]  Olivier Richard,et al.  Big data and HPC collocation: Using HPC idle resources for Big Data analytics , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[19]  Ewa Deelman,et al.  On the use of burst buffers for accelerating data-intensive scientific workflows , 2017, WORKS@SC.

[20]  Shuaiwen Song,et al.  Scaling Support Vector Machines on modern HPC platforms , 2015, J. Parallel Distributed Comput..

[21]  Juan Touriño,et al.  Analysis and evaluation of MapReduce solutions on an HPC cluster , 2016, Comput. Electr. Eng..

[22]  Michael Matheson,et al.  2016 Ieee International Conference on Big Data (big Data) Big Data Analytics on Hpc Architectures: Performance and Cost , 2022 .

[23]  Feng Luo,et al.  Accelerating big data analytics on HPC clusters using two-level storage , 2017, Parallel Comput..

[24]  R. Rivera-Rodriguez,et al.  Use of Containers for High-Performance Computing , 2018 .

[25]  Nikos Parlavantzas,et al.  Efficient execution of the WRF model and other HPC applications in the cloud , 2016, Earth Science Informatics.

[26]  Kai Ren,et al.  IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion , 2014, SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.

[27]  Tao Lu,et al.  Canopus: A Paradigm Shift Towards Elastic Extreme-Scale Data Analytics on HPC Storage , 2017, 2017 IEEE International Conference on Cluster Computing (CLUSTER).

[28]  Seyong Lee,et al.  PUMA: Purdue MapReduce Benchmarks Suite , 2012 .

[29]  Dhabaleswar K. Panda,et al.  High performance RDMA-based design of HDFS over InfiniBand , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.

[30]  Eric W. Biederman,et al.  Multiple Instances of the Global Linux Namespaces , 2010 .

[31]  Paola Caymes-Scutari,et al.  Evolutionary-Statistical System: A parallel method for improving forest fire spread prediction , 2015, J. Comput. Sci..

[32]  Sayantan Sur,et al.  Unifying UPC and MPI runtimes: experience with MVAPICH , 2010, PGAS '10.

[33]  Shantenu Jha,et al.  Hadoop on HPC: Integrating Hadoop and Pilot-Based Dynamic Resource Management , 2016, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW).

[34]  Jean Bézivin,et al.  On the unification power of models , 2005, Software & Systems Modeling.

[35]  David M. Eyers,et al.  SCONE: Secure Linux Containers with Intel SGX , 2016, OSDI.

[36]  Pavan Balaji,et al.  Fault tolerant MapReduce-MPI for HPC clusters , 2015, SC15: International Conference for High Performance Computing, Networking, Storage and Analysis.

[37]  Antonio Lioy,et al.  Integrity verification of Docker containers for a lightweight cloud environment , 2019, Future Gener. Comput. Syst..

[38]  Dhabaleswar K. Panda,et al.  Accelerating Spark with RDMA for Big Data Processing: Early Experiences , 2014, 2014 IEEE 22nd Annual Symposium on High-Performance Interconnects.

[39]  Scott Shenker,et al.  Tachyon: Reliable, Memory Speed Storage for Cluster Computing Frameworks , 2014, SoCC.

[40]  Dhabaleswar K. Panda,et al.  A Comprehensive Study of MapReduce Over Lustre for Intermediate Data Placement and Shuffle Strategies on HPC Clusters , 2017, IEEE Transactions on Parallel and Distributed Systems.

[41]  Dhabaleswar K. Panda,et al.  Performance Characterization of Hadoop Workloads on SR-IOV-Enabled Virtualized InfiniBand Clusters , 2016, 2016 IEEE/ACM 3rd International Conference on Big Data Computing Applications and Technologies (BDCAT).

[42]  Dhabaleswar K. Panda,et al.  Accelerating I/O Performance of Big Data Analytics on HPC Clusters through RDMA-Based Key-Value Store , 2015, 2015 44th International Conference on Parallel Processing.

[43]  Jack J. Dongarra,et al.  Exascale computing and big data , 2015, Commun. ACM.

[44]  Jian Zhou,et al.  UNIO: A Unified I/O System Framework for Hybrid Scientific Workflow , 2015, CloudCom-Asia.

[45]  Cong Xu,et al.  Exploiting Analytics Shipping with Virtualized MapReduce on HPC Backend Storage Servers , 2016, IEEE Transactions on Parallel and Distributed Systems.

[46]  John T. Daly,et al.  A higher order estimate of the optimum checkpoint interval for restart dumps , 2006, Future Gener. Comput. Syst..

[47]  Deepa Srinivasan,et al.  Scalable integrity monitoring in virtualized environments , 2010, STC '10.

[48]  Hal S. Stern,et al.  Managing NFS and NIS - help for Unix system administrators: covers NFS version 3 (2. ed.) , 2001 .

[49]  Albert Y. Zomaya,et al.  CloudFlow: A data-aware programming model for cloud workflow applications on modern HPC systems , 2015, Future Gener. Comput. Syst..

[50]  Dhabaleswar K. Panda,et al.  Triple-H: A Hybrid Approach to Accelerate HDFS on HPC Clusters with Heterogeneous Storage Architecture , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[51]  Vanessa Sochat,et al.  Singularity: Scientific containers for mobility of compute , 2017, PloS one.

[52]  Feroz Zahid,et al.  Efficient network isolation and load balancing in multi-tenant HPC clusters , 2017, Future Gener. Comput. Syst..

[53]  Larry L. Peterson,et al.  Container-based operating system virtualization: a scalable, high-performance alternative to hypervisors , 2007, EuroSys '07.

[54]  David E. Culler,et al.  SEDA: an architecture for well-conditioned, scalable internet services , 2001, SOSP.

[55]  Dhabaleswar K. Panda,et al.  MR-Advisor: A comprehensive tuning, profiling, and prediction tool for MapReduce execution frameworks on HPC clusters , 2018, J. Parallel Distributed Comput..

[56]  Yong Qi,et al.  Nosv: A lightweight nested-virtualization VMM for hosting high performance computing on cloud , 2017, J. Syst. Softw..

[57]  Jie Huang,et al.  The HiBench benchmark suite: Characterization of the MapReduce-based data analysis , 2010, 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010).

[58]  Geoffrey C. Fox,et al.  HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[59]  Bruno Raffin,et al.  A Flexible Framework for Asynchronous in Situ and in Transit Analytics for Scientific Simulations , 2014, 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[60]  Muli Ben-Yehuda,et al.  Direct Device Assignment for Untrusted Fully-Virtualized Virtual Machines , 2008 .

[61]  GhemawatSanjay,et al.  The Google file system , 2003 .

[62]  Seung Woo Son,et al.  Reducing I/O variability using dynamic I/O path characterization in petascale storage systems , 2016, The Journal of Supercomputing.

[63]  Dhabaleswar K. Panda,et al.  High-Performance Design of HBase with RDMA over InfiniBand , 2012, 2012 IEEE 26th International Parallel and Distributed Processing Symposium.

[64]  Sayantan Sur,et al.  Memcached Design on High Performance RDMA Capable Interconnects , 2011, 2011 International Conference on Parallel Processing.

[65]  Barbara M. Chapman,et al.  A Comparative Survey of the HPC and Big Data Paradigms: Analysis and Experiments , 2016, 2016 IEEE International Conference on Cluster Computing (CLUSTER).

[66]  Manjunath Gorentla Venkata,et al.  SharP: Towards Programming Extreme-Scale Systems with Hierarchical Heterogeneous Memory , 2017, 2017 46th International Conference on Parallel Processing Workshops (ICPPW).

[67]  Davide Anguita,et al.  Big Data Analytics in the Cloud: Spark on Hadoop vs MPI/OpenMP on Beowulf , 2015, INNS Conference on Big Data.

[68]  Reid Priedhorsky,et al.  Charliecloud: Unprivileged Containers for User-Defined Software Stacks in HPC , 2017, SC17: International Conference for High Performance Computing, Networking, Storage and Analysis.

[69]  Diana Moise,et al.  Experiences with Performing MapReduce Analysis of Scientific Data on HPC Platforms , 2016, DIDC@HPDC.

[70]  Dhabaleswar K. Panda,et al.  Slurm-V: Extending Slurm for Building Efficient HPC Cloud with SR-IOV and IVShmem , 2016, Euro-Par.

[71]  Greg Bronevetsky,et al.  Hybrid MPI: a case study on the Xeon Phi platform , 2014, ROSS@ICS.

[72]  Tao Lu,et al.  Toward Managing HPC Burst Buffers Effectively: Draining Strategy to Regulate Bursty I/O Behavior , 2017, 2017 IEEE 25th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS).

[73]  Dhabaleswar K. Panda,et al.  HOMR: a hybrid approach to exploit maximum overlapping in MapReduce over high performance interconnects , 2014, ICS '14.

[74]  Dhabaleswar K. Panda,et al.  High-Performance RDMA-based Design of Hadoop MapReduce over InfiniBand , 2013, 2013 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum.

[75]  Wanling Gao,et al.  BigDataBench: A Dwarf-based Big Data and AI Benchmark Suite , 2018, ArXiv.

[76]  Franck Cappello,et al.  Significantly Improving Lossy Compression for Scientific Data Sets Based on Multidimensional Prediction and Error-Controlled Quantization , 2017, 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS).

[77]  Teng Wang,et al.  TRIO: Burst Buffer Based I/O Orchestration , 2015, 2015 IEEE International Conference on Cluster Computing.

[78]  S. Krishnan myHadoop-Hadoop-on-Demand on Traditional HPC Resources , 2004 .

[79]  Bronis R. de Supinski,et al.  Design, Modeling, and Evaluation of a Scalable Multi-level Checkpointing System , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.

[80]  Franck Cappello,et al.  BlobCR: Virtual disk based checkpoint-restart for HPC applications on IaaS clouds , 2013, J. Parallel Distributed Comput..

[81]  D. Panda,et al.  Can High-Performance Interconnects Benefit Hadoop Distributed File System ? , 2010 .

[82]  Jeffrey Shafer,et al.  I/O virtualization bottlenecks in cloud computing today , 2010 .

[83]  Shadi Ibrahim,et al.  Eley: On the Effectiveness of Burst Buffers for Big Data Processing in HPC Systems , 2017, 2017 IEEE International Conference on Cluster Computing (CLUSTER).

[84]  Dhabaleswar K. Panda,et al.  High Performance Design for HDFS with Byte-Addressability of NVM and RDMA , 2016, ICS.

[85]  Dhabaleswar K. Panda,et al.  SOR-HDFS: a SEDA-based approach to maximize overlapping in RDMA-enhanced HDFS , 2014, HPDC '14.

[86]  Brett A. Bryan,et al.  Application note: Parallelization and optimization of spatial analysis for large scale environmental model data assembly , 2012 .

[87]  Geoffrey C. Fox,et al.  Big Data, Simulations and HPC Convergence , 2015, WBDB.

[88]  Christian Engelmann,et al.  Big Data Meets HPC Log Analytics: Scalable Approach to Understanding Systems at Extreme Scale , 2017, 2017 IEEE International Conference on Cluster Computing (CLUSTER).

[89]  Tao Yang,et al.  The Panasas ActiveScale Storage Cluster - Delivering Scalable High Bandwidth Storage , 2004, Proceedings of the ACM/IEEE SC2004 Conference.

[90]  Robert B. Ross,et al.  On the role of burst buffers in leadership-class storage systems , 2012, 012 IEEE 28th Symposium on Mass Storage Systems and Technologies (MSST).

[91]  Ching-Hsien Hsu,et al.  On implementation of GPU virtualization using PCI pass-through , 2012, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings.

[92]  Dhabaleswar K. Panda,et al.  MapReduce over Lustre: Can RDMA-Based Approach Benefit? , 2014, Euro-Par.

[93]  Anwar Haque,et al.  Large-scale machine learning based on functional networks for biomedical big data with high performance computing platforms , 2015, J. Comput. Sci..

[94]  Frank Bellosa,et al.  Virtual InfiniBand clusters for HPC clouds , 2012, CloudCP '12.

[95]  Dhabaleswar K. Panda,et al.  Designing MPI Library with On-Demand Paging (ODP) of InfiniBand: Challenges and Benefits , 2016, SC16: International Conference for High Performance Computing, Networking, Storage and Analysis.

[96]  Alfonso Pérez,et al.  Serverless computing for container-based architectures , 2018, Future Gener. Comput. Syst..

[97]  Surya S. Durbha,et al.  Big data processing using hpc for remote sensing disaster data , 2016, 2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[98]  Philippe Merle,et al.  Model-Driven Management of Docker Containers , 2016, 2016 IEEE 9th International Conference on Cloud Computing (CLOUD).

[99]  Arie Shoshani,et al.  In situ data processing for extreme-scale computing , 2011 .

[100]  Teng Wang,et al.  Characterization and Optimization of Memory-Resident MapReduce on HPC Systems , 2014, 2014 IEEE 28th International Parallel and Distributed Processing Symposium.

[101]  Johannes Albrecht Challenges for the LHC Run 3: Computing and Algorithms , 2016 .

[102]  Guillaume Aupy,et al.  Periodic I/O Scheduling for Super-Computers , 2017, PMBS@SC.

[103]  James Demmel,et al.  Matrix Factorization at Scale: a Comparison of Scientific Data Analytics in Spark and C+MPI Using Three Case Studies , 2016, ArXiv.

[104]  Zhiguang Chen,et al.  Experiences of Converging Big Data Analytics Frameworks with High Performance Computing Systems , 2018, SCFA.

[105]  Rashid Mehmood,et al.  Big Data and HPC Convergence: The Cutting Edge and Outlook , 2017 .

[106]  Lavanya Ramakrishnan,et al.  MARIANE: Using MApReduce in HPC environments , 2014, Future Gener. Comput. Syst..

[107]  Robert Latham,et al.  Leveraging burst buffer coordination to prevent I/O interference , 2016, 2016 IEEE 12th International Conference on e-Science (e-Science).

[108]  Geoffrey C. Fox,et al.  SPIDAL Java: high performance data analytics with Java and MPI on large multicore HPC clusters , 2016, SpringSim.

[109]  Rajeev Thakur,et al.  CHAIO: Enabling HPC Applications on Data-Intensive File Systems , 2012, 2012 41st International Conference on Parallel Processing.

[110]  Shantenu Jha,et al.  SAGA: A standardized access layer to heterogeneous Distributed Computing Infrastructure , 2015 .

[111]  Jure Leskovec,et al.  Hidden factors and hidden topics: understanding rating dimensions with review text , 2013, RecSys.

[112]  Arie Shoshani,et al.  Hello ADIOS: the challenges and lessons of developing leadership class I/O frameworks , 2014, Concurr. Comput. Pract. Exp..

[113]  PetersonLarry,et al.  Container-based operating system virtualization , 2007 .

[114]  Soonwook Hwang,et al.  Accelerating a Burst Buffer Via User-Level I/O Isolation , 2017, 2017 IEEE International Conference on Cluster Computing (CLUSTER).

[115]  Ian T. Foster Computing Just What You Need: Online Data Analysis and Reduction at Extreme Scales , 2017, HiPC.

[116]  Dhabaleswar K. Panda,et al.  Is Singularity-based Container Technology Ready for Running MPI Applications on HPC Clouds? , 2017, UCC.

[117]  Omer F. Rana,et al.  Client-Side Scheduling Based on Application Characterization on Kubernetes , 2017, GECON.

[118]  Zheguang Zhao,et al.  Bridging the Gap between HPC and Big Data frameworks , 2017, Proc. VLDB Endow..

[119]  Robert L. Grossman,et al.  Distributing the Sloan Digital Sky Survey Using UDT and Sector , 2006, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06).

[120]  Xian-He Sun,et al.  PortHadoop: Support direct HPC data processing in Hadoop , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[121]  Robert B. Ross,et al.  PVFS: A Parallel File System for Linux Clusters , 2000, Annual Linux Showcase & Conference.

[122]  Dhabaleswar K. Panda,et al.  High Performance VMM-Bypass I/O in Virtual Machines , 2006, USENIX Annual Technical Conference, General Track.

[123]  Srinivas Devadas,et al.  Intel SGX Explained , 2016, IACR Cryptol. ePrint Arch..

[124]  Dhabaleswar K. Panda,et al.  High-Performance Design of YARN MapReduce on Modern HPC Clusters with Lustre and RDMA , 2015, 2015 IEEE International Parallel and Distributed Processing Symposium.

[125]  Yuqiong Sun,et al.  Security Namespace: Making Linux Security Frameworks Available to Containers , 2018, USENIX Security Symposium.

[126]  Dhabaleswar K. Panda,et al.  Can Parallel Replication Benefit Hadoop Distributed File System for High Performance Interconnects? , 2013, 2013 IEEE 21st Annual Symposium on High-Performance Interconnects.

[127]  Gordon S. Blair,et al.  Using lightweight virtual machines to achieve resource adaptation in middleware , 2011, IET Softw..

[128]  Nicholas J. Wright,et al.  Architecture and Design of Cray DataWarp , 2016 .

[129]  Rafiqul Haque,et al.  CedCom: A high-performance architecture for Big Data applications , 2014, 2014 IEEE/ACS 11th International Conference on Computer Systems and Applications (AICCSA).

[130]  John Shalf,et al.  Using IOR to analyze the I/O Performance for HPC Platforms , 2007 .

[131]  Shantenu Jha,et al.  RADICAL-Pilot: Scalable Execution of Heterogeneous and Dynamic Workloads on Supercomputers , 2015, ArXiv.

[132]  Cong Xu,et al.  JVM-Bypass for Efficient Hadoop Shuffling , 2013, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing.

[133]  Michael Shuey,et al.  Containers in Research: Initial Experiences with Lightweight Infrastructure , 2016, XSEDE.

[134]  Edward B. Duffy,et al.  JUMMP: Job Uninterrupted Maneuverable MapReduce Platform , 2013, 2013 IEEE International Conference on Cluster Computing (CLUSTER).

[135]  Nathan Regola,et al.  Recommendations for Virtualization Technologies in High Performance Computing , 2010, 2010 IEEE Second International Conference on Cloud Computing Technology and Science.

[136]  Hongming Cai,et al.  GAI: A Centralized Tree-Based Scheduler for Machine Learning Workload in Large Shared Clusters , 2018, ICA3PP.

[137]  Nikolay Malitsky Bringing the HPC reconstruction algorithms to Big Data platforms , 2016, 2016 New York Scientific Data Summit (NYSDS).

[138]  Helen D. Karatza,et al.  Combining containers and virtual machines to enhance isolation and extend functionality on cloud computing , 2019, Future Gener. Comput. Syst..

[139]  Trent Jaeger,et al.  Design and Implementation of a TCG-based Integrity Measurement Architecture , 2004, USENIX Security Symposium.

[140]  Shantenu Jha,et al.  A Comprehensive Perspective on the Pilot-Job Abstraction , 2015, ArXiv.

[141]  Hairong Kuang,et al.  The Hadoop Distributed File System , 2010, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST).

[142]  Martin L. Kersten,et al.  Next generation of Exascale-class systems: ExaNeSt project and the status of its interconnect and storage development , 2018, Microprocess. Microsystems.

[143]  Teng Wang,et al.  An Ephemeral Burst-Buffer File System for Scientific Applications , 2016, SC16: International Conference for High Performance Computing, Networking, Storage and Analysis.

[144]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[145]  Judy Qiu,et al.  A Tale of Two Data-Intensive Paradigms: Applications, Abstractions, and Architectures , 2014, 2014 IEEE International Congress on Big Data.

[146]  Satoshi Matsuoka,et al.  A User-Level InfiniBand-Based File System and Checkpoint Strategy for Burst Buffers , 2014, 2014 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[147]  Dhabaleswar K. Panda,et al.  NVMD: Non-volatile memory assisted design for accelerating MapReduce and DAG execution frameworks on HPC systems , 2017, 2017 IEEE International Conference on Big Data (Big Data).

[148]  Jun Wang,et al.  SideIO: A Side I/O system framework for hybrid scientific workflow , 2017, J. Parallel Distributed Comput..

[149]  Teng Wang,et al.  BurstMem: A high-performance burst buffer system for scientific applications , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[150]  Shantenu Jha,et al.  P∗: A model of pilot-abstractions , 2012, 2012 IEEE 8th International Conference on E-Science.

[151]  Richard O. Sinnott,et al.  A performance comparison of container-based technologies for the Cloud , 2017, Future Gener. Comput. Syst..

[152]  Dhabaleswar K. Panda,et al.  High-Performance Design of Hadoop RPC with RDMA over InfiniBand , 2013, 2013 42nd International Conference on Parallel Processing.

[153]  Dhabaleswar K. Panda,et al.  A 1 PB/s file system to checkpoint three million MPI tasks , 2013, HPDC.

[154]  Shadi Ibrahim,et al.  Improving the Effectiveness of Burst Buffers for Big Data Processing in HPC Systems with Eley , 2018, Future Gener. Comput. Syst..

[155]  Andrew J. Hutton,et al.  Lustre: Building a File System for 1,000-node Clusters , 2003 .

[156]  Robert B. Ross,et al.  FusionFS: Toward supporting data-intensive scientific applications on extreme-scale high-performance computing systems , 2014, 2014 IEEE International Conference on Big Data (Big Data).