Low-power amdahl-balanced blades for data intensive computing

Enterprise and scientific data sets double every year, forcing similar growths in storage size and power consumption. As a consequence, current system architectures used to build data warehouses are about to hit a power consumption wall. In this paper we propose an alternative architecture comprising large number of so-called Amdahl blades that combine energy-efficient CPUs with solid state disks to increase sequential read I/O throughput by an order of magnitude while keeping power consumption constant. We also show that while keeping the total cost of ownership constant, Amdahl blades offer five times the throughput of a state-of-theart computing cluster for data-intensive applications. Finally, using the scaling laws originally postulated by Amdahl, we show that systems for data-intensive computing must maintain a balance between low power consumption and per-server throughput to optimize performance perWatt.

[1]  Jim Gray,et al.  TerraServer Bricks - A High Availability Cluster Alternative , 2004 .

[2]  Pedro Furtado Algorithms for efficient processing of complex queries in node-partitioned data warehouses , 2004, Proceedings. International Database Engineering and Applications Symposium, 2004. IDEAS '04..

[3]  Christoforos E. Kozyrakis,et al.  Models and Metrics to Enable Energy-Efficiency Optimizations , 2007, Computer.

[4]  Christoforos E. Kozyrakis,et al.  JouleSort: a balanced energy-efficiency benchmark , 2007, SIGMOD '07.

[5]  Gene M. Amdahl Computer Architecture and Amdahl's Law , 2007 .

[6]  Gene M. Amdahl,et al.  Computer Architecture and Amdahl's Law , 2007, Computer.

[7]  Sriram Sankar,et al.  Intra-disk Parallelism: An Idea Whose Time Has Come , 2008, 2008 International Symposium on Computer Architecture.

[8]  Trevor N. Mudge,et al.  Understanding and Designing New Server Architectures for Emerging Warehouse-Computing Environments , 2008, 2008 International Symposium on Computer Architecture.

[9]  Alexander S. Szalay,et al.  GrayWulf: Scalable Clustered Architecture for Data Intensive Computing , 2009, 2009 42nd Hawaii International Conference on System Sciences.

[10]  Amar Phanishayee,et al.  FAWNdamentally Power-efficient Clusters , 2009, HotOS.

[11]  Gordon Bell,et al.  Beyond the Data Deluge , 2009, Science.

[12]  J. Mervis Peer review. Beyond the data. , 2011, Science.