CloudPack - Exploiting Workload Flexibility through Rational Pricing

Infrastructure as a Service pricing models for resources are meant to reflect the operational costs and profit margins for providers to deliver virtualized resources to customers subject to an underlying Service Level Agreements (SLAs). While the operational costs incurred by providers are dynamic -- they vary over time depending on factors such as energy cost, cooling strategies, and aggregate demand -- the pricing models extended to customers are typically fixed -- they are static over time and independent of aggregate demand. This disconnect between the dynamic cost incurred by a provider and the fixed price paid by a customer results in an economically inefficient marketplace. In particular, it does not provide incentives for customers to express workload scheduling flexibilities that may benefit them as well as providers. In this paper, we utilize a dynamic pricing model to address this inefficiency and give customers the opportunity and incentive to take advantage of any flexibilities they may have regarding the provisioning of their workloads. We present CloudPack: a framework for workload colocation, which provides customers with the ability to formally express workload flexibilities using Directed Acyclic Graphs, optimizes the use of cloud resources to minimize total costs while allocating clients' workloads, and utilizes Shapley valuation to rationally -- and thus fairly in a game-theoretic sense -- attribute costs to the customers. Using extensive simulation, we show the practical utility of our CloudPack colocation framework and the efficacy of the resulting marketplace in terms of cost savings.

[1]  Daniel Gómez,et al.  Polynomial calculation of the Shapley value based on sampling , 2009, Comput. Oper. Res..

[2]  Roy H. Campbell,et al.  Resource Provisioning Framework for MapReduce Jobs with Performance Goals , 2011, Middleware.

[3]  Tim Roughgarden,et al.  Algorithmic game theory , 2010, Commun. ACM.

[4]  Vanish Talwar,et al.  No "power" struggles: coordinated multi-level power management for the data center , 2008, ASPLOS.

[5]  Mor Harchol-Balter,et al.  Optimal power allocation in server farms , 2009, SIGMETRICS '09.

[6]  David C. Parkes,et al.  ICE: an iterative combinatorial exchange , 2005, EC '05.

[7]  Ruth Ashley,et al.  Job Control Language , 1978 .

[8]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[9]  Heiko Ludwig,et al.  The WSLA Framework: Specifying and Monitoring Service Level Agreements for Web Services , 2003, Journal of Network and Systems Management.

[10]  Albert Y. Zomaya,et al.  Tradeoffs Between Profit and Customer Satisfaction for Service Provisioning in the Cloud , 2011, HPDC '11.

[11]  Bruce M. Maggs,et al.  Cutting the electric bill for internet-scale systems , 2009, SIGCOMM '09.

[12]  T. N. Vijaykumar,et al.  Joint optimization of idle and cooling power in data centers while maintaining response time , 2010, ASPLOS XV.

[13]  David E. Irwin,et al.  Ensemble-level Power Management for Dense Blade Servers , 2006, 33rd International Symposium on Computer Architecture (ISCA'06).

[14]  Pablo Rodriguez,et al.  On economic heavy hitters: shapley value analysis of 95th-percentile pricing , 2010, IMC '10.

[15]  Ian T. Foster,et al.  SNAP: A Protocol for Negotiating Service Level Agreements and Coordinating Resource Management in Distributed Systems , 2002, JSSPP.

[16]  Ken Kennedy,et al.  Scheduling strategies for mapping application workflows onto the grid , 2005, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005..

[17]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[18]  Kevin Lai,et al.  Markets are dead, long live markets , 2005, SECO.

[19]  Bruno Sinopoli,et al.  Reducing data center energy consumption via coordinated cooling and load management , 2008, CLUSTER 2008.

[20]  Albert G. Greenberg,et al.  The cost of a cloud: research problems in data center networks , 2008, CCRV.

[21]  Srikanth Kandula,et al.  Jockey: guaranteed job latency in data parallel clusters , 2012, EuroSys '12.

[22]  Prashant J. Shenoy,et al.  Profiling and Modeling Resource Usage of Virtualized Applications , 2008, Middleware.

[23]  Jorge Londoño,et al.  Colocation as a Service , 2010 .

[24]  Rajkumar Buyya,et al.  SLA-Based Advance Reservations with Flexible and Adaptive Time QoS Parameters , 2007, ICSOC.

[25]  Azer Bestavros,et al.  Colocation as a Service: Strategic and Operational Services for Cloud Colocation , 2010, 2010 Ninth IEEE International Symposium on Network Computing and Applications.

[26]  Rajkumar Buyya,et al.  Cost-based scheduling of scientific workflow applications on utility grids , 2005, First International Conference on e-Science and Grid Computing (e-Science'05).

[27]  Rajkumar Buyya,et al.  GridBank: a Grid Accounting Services Architecture (GASA) for distributed systems sharing and integration , 2002, Proceedings International Parallel and Distributed Processing Symposium.

[28]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[29]  Amin Vahdat,et al.  Two Auction‐Based Resource Allocation Environments: Design and Experience , 2009 .

[30]  Thomas A. Henzinger,et al.  Scheduling large jobs by abstraction refinement , 2011, EuroSys '11.

[31]  Sujata Banerjee,et al.  ElasticTree: Saving Energy in Data Center Networks , 2010, NSDI.

[32]  Hai Jin,et al.  Live migration of virtual machine based on full system trace and replay , 2009, HPDC '09.

[33]  Yolanda Gil,et al.  Pegasus: Mapping Scientific Workflows onto the Grid , 2004, European Across Grids Conference.

[34]  Wolf-Dietrich Weber,et al.  Power provisioning for a warehouse-sized computer , 2007, ISCA '07.

[35]  Kien A. Hua,et al.  Skyscraper broadcasting: a new broadcasting scheme for metropolitan video-on-demand systems , 1997, SIGCOMM '97.

[36]  Lavanya Ramakrishnan,et al.  Deadline-sensitive workflow orchestration without explicit resource control , 2011, J. Parallel Distributed Comput..

[37]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[38]  Richard Wolski,et al.  G-commerce: market formulations controlling resource allocation on the computational grid , 2001, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001.

[39]  Yellu Sreenivasulu,et al.  FAST TRANSPARENT MIGRATION FOR VIRTUAL MACHINES , 2014 .