Controlling the Variability of Capacity Allocations Using Service Deferrals

Ensuring predictability is a crucial goal for service systems. Traditionally, research has focused on designing systems that ensure predictable performance for service requests. Motivated by applications in cloud computing and electricity markets, this article focuses on a different form of predictability: predictable allocations of service capacity. The focus of the article is a new model where service capacity can be scaled dynamically and service deferrals (subject to deadline constraints) can be used to control the variability of the active service capacity. Four natural policies for the joint problem of scheduling and managing the active service capacity are considered. For each, the variability of service capacity and the likelihood of deadline misses are derived. Further, the paper illustrates how pricing can be used to provide incentives for jobs to reveal deadlines and thus enable the possibility of service deferral in systems where the flexibility of jobs is not known to the system a priori.

[1]  Adam Wierman,et al.  Is Tail-Optimal Scheduling Possible? , 2012, Oper. Res..

[2]  Adam Tauman Kalai,et al.  Online convex optimization in the bandit setting: gradient descent without a gradient , 2004, SODA '05.

[3]  Fernando Paganini,et al.  Queueing analysis of service deferrals for load management in power systems , 2015, 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[4]  Willett Kempton,et al.  Using fleets of electric-drive vehicles for grid support , 2007 .

[5]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .

[6]  Pravin Varaiya,et al.  Aggregate flexibility of a collection of loadsπ , 2013, 52nd IEEE Conference on Decision and Control.

[7]  Mohamed A. El-Sharkawi,et al.  Optimal Scheduling of Vehicle-to-Grid Energy and Ancillary Services , 2012, IEEE Transactions on Smart Grid.

[8]  John P. Lehoczky,et al.  Real-time queueing network theory , 1997, Proceedings Real-Time Systems Symposium.

[9]  Pravin Varaiya,et al.  Real-Time Scheduling of Distributed Resources , 2013, IEEE Transactions on Smart Grid.

[10]  F. Baccelli,et al.  Stochastic Geometry and Wireless Networks, Part I: Theory , 2009 .

[11]  Rajesh Gupta,et al.  Energy Efficient Geographical Load Balancing via Dynamic Deferral of Workload , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.

[12]  Mustafa Akan,et al.  Congestion-Based Lead-Time Quotation for Heterogenous Customers with Convex-Concave Delay Costs: Optimality of a Cost-Balancing Policy Based on Convex Hull Functions , 2012, Oper. Res..

[13]  Constantinos Maglaras,et al.  Queueing Systems with Leadtime Constraints: A Fluid-Model Approach for Admission and Sequencing Control , 2004, Eur. J. Oper. Res..

[14]  Mor Harchol-Balter,et al.  Performance Modeling and Design of Computer Systems: Queueing Theory in Action , 2013 .

[15]  Thomas H. Bradley,et al.  The effect of communication architecture on the availability, reliability, and economics of plug-in hybrid electric vehicle-to-grid ancillary services , 2010 .

[16]  Luiz André Barroso,et al.  The tail at scale , 2013, CACM.

[17]  Lachlan L. H. Andrew,et al.  Geographical load balancing with renewables , 2011, PERV.

[18]  Fernando Paganini,et al.  Controlling aggregates of deferrable loads for power system regulation , 2015, 2015 American Control Conference (ACC).

[19]  Rajesh K. Gupta,et al.  Workload Shaping to Mitigate Variability in Renewable Power Use by Data Centers , 2014, 2014 IEEE 7th International Conference on Cloud Computing.

[20]  Sem C. Borst,et al.  Asymptotic behavior of generalized processor sharing with long-tailed traffic sources , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[21]  Anthony Ephremides,et al.  Optimal scheduling with strict deadlines , 1989 .

[22]  Haim Mendelson,et al.  Optimal Incentive-Compatible Priority Pricing for the M/M/1 Queue , 1990, Oper. Res..

[23]  Lachlan L. H. Andrew,et al.  Dynamic Right-Sizing for Power-Proportional Data Centers , 2011, IEEE/ACM Transactions on Networking.

[24]  Refael Hassin,et al.  To Queue or Not to Queue: Equilibrium Behavior in Queueing Systems , 2002 .

[25]  Philipp Afèche,et al.  Optimal Price/Lead-Time Menus for Queues with Customer Choice: Segmentation, Pooling, and Strategic Delay , 2016, Manag. Sci..

[26]  Erica L. Plambeck,et al.  A Multiclass Queue in Heavy Traffic with Throughput Time Constraints: Asymptotically Optimal Dynamic Controls , 2001, Queueing Syst. Theory Appl..

[27]  François Baccelli,et al.  Stochastic Geometry and Wireless Networks, Volume 1: Theory , 2009, Found. Trends Netw..

[28]  Bert Zwart,et al.  Tails in scheduling , 2007, PERV.

[29]  Costis Maglaras,et al.  Dynamic Pricing and Lead-Time Quotation for a Multiclass Make-to-Order Queue , 2008, Manag. Sci..

[30]  Adam Wierman,et al.  Distributional analysis for model predictive deferrable load control , 2014, 53rd IEEE Conference on Decision and Control.

[31]  Michael Pinedo,et al.  Stochastic Scheduling with Release Dates and Due Dates , 1983, Oper. Res..

[32]  J. Kiefer,et al.  Stochastic Estimation of the Maximum of a Regression Function , 1952 .

[33]  Adam Wierman,et al.  Preventing Large Sojourn Times Using SMART Scheduling , 2008, Oper. Res..

[34]  Qian Zhu,et al.  Resource Provisioning with Budget Constraints for Adaptive Applications in Cloud Environments , 2010, IEEE Transactions on Services Computing.

[35]  Rajkumar Buyya,et al.  Dynamically scaling applications in the cloud , 2011, CCRV.

[36]  Philippe Robert Stochastic Networks and Queues , 2003 .

[37]  Michel Mandjes,et al.  Large deviations of sojourn times in processor sharing queues , 2006, Queueing Syst. Theory Appl..

[38]  Lukasz Kruk,et al.  Heavy traffic limit for processor sharing queue with soft deadlines , 2007, 0707.4600.

[39]  S. Zachary A Note on Insensitivity in Stochastic Networks , 2006, Journal of Applied Probability.

[40]  Haim Mendelson,et al.  Pricing and Priority Auctions in Queueing Systems with a Generalized Delay Cost Structure , 2004, Manag. Sci..

[41]  Donald F. Towsley,et al.  A Performance Analysis of Minimum Laxity and Earliest Deadline Scheduling in a Real-Time System , 1989, IEEE Trans. Computers.