论文信息 - Big vs little core for energy-efficient Hadoop computing

Big vs little core for energy-efficient Hadoop computing

Abstract Emerging big data applications require a significant amount of server computational power. However, the rapid growth in the data yields challenges to process them efficiently using current high-performance server architectures. Furthermore, physical design constraints, such as power and density, have become the dominant limiting factor for scaling out servers. Heterogeneous architectures that combine big Xeon cores with little Atom cores have emerged as a promising solution to enhance energy-efficiency by allowing each application to run on an architecture that matches resource needs more closely than a one-size-fits-all architecture. Therefore, the question of whether to map the application to big Xeon or little Atom in heterogeneous server architecture becomes important. In this paper, through a comprehensive system level analysis, we first characterize Hadoop-based MapReduce applications on big Xeon and little Atom-based server architectures to understand how the choice of big vs little cores is affected by various parameters at application, system and architecture levels and the interplay among these parameters. Second, we study how the choice between big and little core changes across various phases of MapReduce tasks. Furthermore, we show how the choice of most efficient core for a particular MapReduce phase changes in the presence of accelerators. The characterization analysis helps guiding scheduling decisions in future cloud-computing environment equipped with heterogeneous multicore architectures and accelerators. We have also evaluated the operational and the capital cost to understand how performance, power and area constraints for big data analytics affect the choice of big vs little core server as a more cost and energy efficient architecture.

[1] Tulika Mitra,et al. Disjoint Pattern Enumeration for Custom Instructions Identification , 2007, 2007 International Conference on Field Programmable Logic and Applications.

[2] Houman Homayoun,et al. Managing distributed UPS energy for effective power capping in data centers , 2012, 2012 39th Annual International Symposium on Computer Architecture (ISCA).

[3] Chaitanya K. Baru,et al. Setting the Direction for Big Data Benchmark Standards , 2012, TPCTC.

[4] Avesta Sasan,et al. Big vs little core for energy-efficient Hadoop computing , 2017, Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017.

[5] Jie Huang,et al. The HiBench benchmark suite: Characterization of the MapReduce-based data analysis , 2010, 2010 IEEE 26th International Conference on Data Engineering Workshops (ICDEW 2010).

[6] Kushal Datta,et al. Energy efficient scheduling of MapReduce workloads on heterogeneous clusters , 2011, GCM '11.

[7] Toshimori Honjo,et al. Hardware acceleration of Hadoop MapReduce , 2013, 2013 IEEE International Conference on Big Data.

[8] Babak Falsafi,et al. Clearing the clouds: a study of emerging scale-out workloads on modern hardware , 2012, ASPLOS XVII.

[9] Kushagra Vaid,et al. Web search using mobile cores: quantifying and mitigating the price of efficiency , 2010, ISCA.

[10] Roy H. Campbell,et al. ARIA: automatic resource inference and allocation for mapreduce environments , 2011, ICAC '11.

[11] Jordi Torres,et al. GreenHadoop: leveraging green energy in data-processing frameworks , 2012, EuroSys '12.

[12] Babak Falsafi,et al. Toward Dark Silicon in Servers , 2011, IEEE Micro.

[13] Vagelis Hristidis,et al. Efficient near-duplicate document detection using FPGAs , 2013, 2013 IEEE International Conference on Big Data.

[14] Scott B. Baden,et al. Redefining the Role of the CPU in the Era of CPU-GPU Integration , 2012, IEEE Micro.

[15] Anshul Kumar,et al. Instruction Selection in ASIP Synthesis Using Functional Matching , 2010, 2010 23rd International Conference on VLSI Design.

[16] Eric S. Chung,et al. LINQits: big data on little clients , 2013, ISCA.

[17] Tulika Mitra,et al. Scalable custom instructions identification for instruction-set extensible processors , 2004, CASES '04.

[18] Timothy G. Armstrong,et al. LinkBench: a database benchmark based on the Facebook social graph , 2013, SIGMOD '13.

[19] Gang Lu,et al. CloudRank-D: benchmarking and ranking cloud computing systems for data processing applications , 2012, Frontiers of Computer Science.