Towards Energy Proportional Cloud for Data Processing Frameworks

Energy efficiency in cloud computing is becoming more and more important for IT operators of data centers. Several effort to use low power machines in the data center level has been explored. Also, data processing frameworks such as MapReduce and Hadoop are frequently used to process data intensive jobs. However, there have not been an extensive study on the impact of low power computers on such data processing frameworks. Actually, development of low power computers is demanding the architectural paradigm shift for cloud applications. In this paper, we evaluate Apache Hadoop on low power machines and study the feasibility of them in cloud systems. We also propose AnSwer (Augmentation and Substitution), an energy saving method to reduce energy consumption by introducing low power machines. In An-Swer, augmentation and substitution complement each other to prevent data loss and to improve overall power consumption.

[1]  P. Varman,et al.  Conserving Energy in Conventional Disk based RAID Systems , 2005 .

[2]  Luiz André Barroso,et al.  The Price of Performance , 2005, ACM Queue.

[3]  GhemawatSanjay,et al.  The Google file system , 2003 .

[4]  Dirk Grunwald,et al.  Massive Arrays of Idle Disks For Storage Archives , 2002, ACM/IEEE SC 2002 Conference (SC'02).

[5]  Bill Broyles Notes , 1907, The Classical Review.

[6]  Jin Qian,et al.  PARAID: A gear-shifting power-aware RAID , 2007, TOS.

[7]  Dong Li,et al.  EERAID: energy efficient redundant and inexpensive disk array , 2004, EW 11.

[8]  Ricardo Bianchini,et al.  Exploiting redundancy to conserve energy in storage systems , 2006, SIGMETRICS '06/Performance '06.

[9]  Jun Wang,et al.  RIMAC: a novel redundancy-based hierarchical cache architecture for energy efficient, high performance storage systems , 2006, EuroSys.

[10]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[11]  Kushagra Vaid,et al.  Web Search Using Small Cores: Quantifying the Price of Efficiency , 2009 .

[12]  Brian Hayes,et al.  What Is Cloud Computing? , 2019, Cloud Technologies.

[13]  Erol Gelenbe,et al.  Energy-Efficient Cloud Computing , 2010, Comput. J..

[14]  D. Andersen,et al.  A Fast Array of Wimpy Nodes , 2008 .

[15]  Robert L. Grossman,et al.  Data mining using high performance data clouds: experimental studies using sector and sphere , 2008, KDD.

[16]  Antony I. T. Rowstron,et al.  Write off-loading: Practical power management for enterprise storage , 2008, TOS.

[17]  Luiz André Barroso,et al.  The Case for Energy-Proportional Computing , 2007, Computer.

[18]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.

[19]  Dong Li,et al.  eRAID: Conserving Energy in Conventional Disk-Based RAID System , 2008, IEEE Transactions on Computers.

[20]  Ethan L. Miller,et al.  Pergamum: Replacing Tape with Energy Efficient, Reliable, Disk-Based Archival Storage , 2008, FAST.

[21]  Dalit Naor,et al.  Low power mode in cloud storage systems , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[22]  Yuanyuan Zhou,et al.  Hibernator: helping disk arrays sleep through the winter , 2005, SOSP '05.

[23]  Parthasarathy Ranganathan A Recipe for Efficiency? Some Principles of Power-aware Computing , 2010 .

[24]  Yanpei Chen,et al.  Towards Energy Efficient MapReduce , 2009 .

[25]  Robert L. Grossman,et al.  Sector and Sphere: the design and implementation of a high-performance data cloud , 2009, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[26]  Trevor N. Mudge,et al.  Understanding and Designing New Server Architectures for Emerging Warehouse-Computing Environments , 2008, 2008 International Symposium on Computer Architecture.

[27]  Christoforos E. Kozyrakis,et al.  On the energy (in)efficiency of Hadoop clusters , 2010, OPSR.

[28]  Amar Phanishayee,et al.  FAWN: a fast array of wimpy nodes , 2009, SOSP '09.

[29]  Ravi Kumar,et al.  Pig latin: a not-so-foreign language for data processing , 2008, SIGMOD Conference.