Delay and Price Differentiation in Cloud Computing: A Service Model, Supporting Architectures, and Performance

Many cloud service providers (CSPs) provide on-demand service at a price with a small delay. We propose a QoS-differentiated model where multiple SLAs deliver both on-demand service for latency-critical users and delayed services for delay-tolerant users at lower prices. Two architectures are considered to fulfill SLAs. The first is based on priority queues. The second simply separates servers into multiple modules, each for one SLA. As an ecosystem, we show that the proposed framework is dominant-strategy incentive compatible. Although the first architecture appears more prevalent in the literature, we prove the superiority of the second architecture, under which we further leverage queueing theory to determine the optimal SLA delays and prices. Finally, the viability of the proposed framework is validated through numerical comparison with the on-demand service and it exhibits a revenue improvement in excess of 200%. Our results can help CSPs design optimal delay-differentiated services and choose appropriate serving architectures.

[1]  Giuliano Casale,et al.  OptiSpot: minimizing application deployment cost using spot cloud resources , 2016, Cluster Computing.

[2]  Haim Mendelson,et al.  Pricing and Priority Auctions in Queueing Systems with a Generalized Delay Cost Structure , 2004, Manag. Sci..

[3]  Muli Ben-Yehuda,et al.  Deconstructing Amazon EC2 Spot Instance Pricing , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[4]  Joseph Naor,et al.  Near-optimal scheduling mechanisms for deadline-sensitive jobs in large computing clusters , 2012, SPAA '12.

[5]  Esa Hyytiä,et al.  Towards Designing Cost-Optimal Policies to Utilize IaaS Clouds with Online Learning , 2017, 2017 International Conference on Cloud and Autonomic Computing (ICCAC).

[6]  Adam Wierman,et al.  The Economics of the Cloud , 2017, ACM Trans. Model. Perform. Evaluation Comput. Syst..

[7]  Sem C. Borst,et al.  Optimal Service Elasticity in Large-Scale Distributed Systems , 2017, Proc. ACM Meas. Anal. Comput. Syst..

[8]  Ian A. Kash,et al.  Pricing the Cloud , 2016, IEEE Internet Computing.

[9]  Roch Guérin,et al.  Pricing (and Bidding) Strategies for Delay Differentiated Cloud Services , 2020, ACM Trans. Economics and Comput..

[10]  Zongpeng Li,et al.  Online Auctions in IaaS Clouds: Welfare and Profit Maximization With Server Costs , 2015, IEEE/ACM Transactions on Networking.

[11]  Javad Ghaderi,et al.  On Non-Preemptive VM Scheduling in the Cloud , 2017, Abstracts of the 2018 ACM International Conference on Measurement and Modeling of Computer Systems.

[12]  Giuliano Casale,et al.  Evaluating Weighted Round Robin Load Balancing for Cloud Web Services , 2014, 2014 16th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing.

[13]  Ian A. Kash,et al.  Fixed and market pricing for cloud services , 2012, 2012 Proceedings IEEE INFOCOM Workshops.

[14]  Waltenegus Dargie,et al.  Estimation of the cost of VM migration , 2014, 2014 23rd International Conference on Computer Communication and Networks (ICCCN).

[15]  Zongpeng Li,et al.  An Efficient Cloud Market Mechanism for Computing Jobs With Soft Deadlines , 2017, IEEE/ACM Transactions on Networking.

[16]  Rajkumar Buyya,et al.  Service Level Agreement based Allocation of Cluster Resources: Handling Penalty to Enhance Utility , 2005, 2005 IEEE International Conference on Cluster Computing.

[17]  Nikhil R. Devanur A report on the workshop on the economics of cloud computing , 2017, SECO.

[18]  Esa Hyytiä,et al.  On Round-Robin routing with FCFS and LCFS scheduling , 2016, Perform. Evaluation.

[19]  Francesco De Pellegrini,et al.  A Framework for Allocating Server Time to Spot and On-Demand Services in Cloud Computing , 2019, ACM Trans. Model. Perform. Evaluation Comput. Syst..

[20]  Liang Zheng,et al.  On the Viability of a Cloud Virtual Service Provider , 2016, SIGMETRICS.

[21]  Tim Roughgarden,et al.  Algorithmic Game Theory , 2007 .

[22]  Xiaohu Wu,et al.  Dispatching Discrete-Size Jobs with Multiple Deadlines to Parallel Heterogeneous Servers , 2019, Systems Modeling: Methodologies and Tools.

[23]  Danilo Ardagna,et al.  Quality-of-service in cloud computing: modeling techniques and their applications , 2014, Journal of Internet Services and Applications.

[24]  Sven Seuken,et al.  Cloud Pricing: The Spot Market Strikes Back , 2019, EC.

[25]  Xiaohu Wu,et al.  Algorithms for scheduling deadline-sensitive malleable tasks , 2015, 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[26]  Srikanth Kandula,et al.  Efficient queue management for cluster scheduling , 2016, EuroSys.

[27]  Carlo Curino,et al.  Mercury: Hybrid Centralized and Distributed Scheduling in Large Shared Clusters , 2015, USENIX Annual Technical Conference.

[28]  Xiaohu Wu,et al.  Toward Designing Cost-Optimal Policies to Utilize IaaS Clouds with Online Learning , 2020, IEEE Transactions on Parallel and Distributed Systems.

[29]  Yossi Azar,et al.  Truthful Online Scheduling with Commitments , 2015, EC.

[30]  Rajkumar Buyya,et al.  SLA-based virtual machine management for heterogeneous workloads in a cloud datacenter , 2014, J. Netw. Comput. Appl..

[31]  Gregory R. Ganger,et al.  Stratus: cost-aware container scheduling in the public cloud , 2018, SoCC.