On demand elastic capacity planning for service auto-scaling

Cloud computing allows on demand elastic service scaling. The capability of a service to predict resource requirements for the next operational period defines how well it will exploit the elasticity of cloud computing in order to reduce operational costs. In this work, we consider a capacity planning process for service scale-out as an online pricing model. In particular, we study the impact of buffering service requests on revenues in various settings with allocation and maintenance costs. In addition, we analyze the incurred latency implied by buffering service requests. We believe that our insights will allow to significantly simplify predictions and mitigate the unknowns of future demands on resources.

[1]  Sally Floyd,et al.  Wide area traffic: the failure of Poisson modeling , 1995, TNET.

[2]  Barbara M. Anthony,et al.  Infrastructure Leasing Problems , 2007, IPCO.

[3]  Kirill Kogan,et al.  Priority queueing with multiple packet characteristics , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[4]  Jie Yu,et al.  Heavy tails, generalized coding, and optimal Web layout , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[5]  Allan Borodin,et al.  Online computation and competitive analysis , 1998 .

[6]  Michael Segal,et al.  Best Effort and Priority Queuing Policies for Buffered Crossbar Switches , 2008, Chic. J. Theor. Comput. Sci..

[7]  Pan Hui,et al.  Economic models for cloud service markets , 2012, ICDCN 2012.

[8]  Babak Falsafi,et al.  Clearing the clouds: a study of emerging scale-out workloads on modern hardware , 2012, ASPLOS XVII.

[9]  Kushagra Vaid,et al.  Web search using mobile cores: quantifying and mitigating the price of efficiency , 2010, ISCA.

[10]  Sriram Sankar,et al.  Server Engineering Insights for Large-Scale Online Services , 2010, IEEE Micro.

[11]  José Antonio Lozano,et al.  A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments , 2014, Journal of Grid Computing.

[12]  Kirill Kogan,et al.  Balancing work and size with bounded buffers , 2014, 2014 Sixth International Conference on Communication Systems and Networks (COMSNETS).

[13]  Xiaowei Yang,et al.  CloudCmp: comparing public cloud providers , 2010, IMC '10.

[14]  Adam Meyerson The parking permit problem , 2005, 46th Annual IEEE Symposium on Foundations of Computer Science (FOCS'05).

[15]  Kevin Lee,et al.  Empirical prediction models for adaptive resource provisioning in the cloud , 2012, Future Gener. Comput. Syst..

[16]  Waheed Iqbal,et al.  Adaptive resource provisioning for read intensive multi-tier applications in the cloud , 2011, Future Gener. Comput. Syst..

[17]  Wolfgang Fischer,et al.  The Markov-Modulated Poisson Process (MMPP) Cookbook , 1993, Perform. Evaluation.

[18]  Kirill Kogan,et al.  Multi-queued network processors for packets with heterogeneous processing requirements , 2013, 2013 Fifth International Conference on Communication Systems and Networks (COMSNETS).

[19]  Xiaowei Yang,et al.  CloudCmp: Shopping for a Cloud Made Easy , 2010, HotCloud.

[20]  Moshe Zukerman,et al.  Internet traffic modeling and future technology implications , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[21]  Moshe Zukerman,et al.  Performance evaluation of a queue fed by a Poisson Pareto burst process , 2002, Comput. Networks.

[22]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[23]  Massoud Pedram,et al.  SLA-based Optimization of Power and Migration Cost in Cloud Computing , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[24]  Kirill Kogan,et al.  A taxonomy of Semi-FIFO policies , 2012, 2012 IEEE 31st International Performance Computing and Communications Conference (IPCCC).

[25]  Zhenhuan Gong,et al.  PRESS: PRedictive Elastic ReSource Scaling for cloud systems , 2010, 2010 International Conference on Network and Service Management.

[26]  Eddy Caron,et al.  Pattern Matching Based Forecast of Non-periodic Repetitive Behavior for Cloud Clients , 2011, Journal of Grid Computing.

[27]  Ilkka Nomos On the Use of Fractional Brownian Motion in the Theory of Connectionless Networks , 1995 .

[28]  Michael Segal,et al.  Packet mode and QoS algorithms for buffered crossbar switches with FIFO queuing , 2008, PODC '08.

[29]  Michael Segal,et al.  Providing performance guarantees in multipass network processors , 2011, 2011 Proceedings IEEE INFOCOM.

[30]  Michael Segal,et al.  Improved Competitive Performance Bounds for CIOQ Switches , 2008, Algorithmica.

[31]  David M. Lucantoni,et al.  A Markov Modulated Characterization of Packetized Voice and Data Traffic and Related Statistical Multiplexer Performance , 1986, IEEE J. Sel. Areas Commun..

[32]  Xiaohui Gu,et al.  CloudScale: elastic resource scaling for multi-tenant cloud systems , 2011, SoCC.

[33]  Patrick Th. Eugster,et al.  Essential Traffic Parameters for Shared Memory Switch Performance , 2015, SIROCCO.

[34]  Kirill Kogan,et al.  Single and Multiple Buffer Processing , 2016, Encyclopedia of Algorithms.

[35]  Moustafa Ghanem,et al.  Lightweight Resource Scaling for Cloud Applications , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[36]  Patrick Th. Eugster,et al.  Shared Memory Buffer Management for Heterogeneous Packet Processing , 2014, 2014 IEEE 34th International Conference on Distributed Computing Systems.

[37]  Verena Kantere,et al.  An Economic Model for Self-Tuned Cloud Caching , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[38]  Vijayaraghavan Soundararajan,et al.  The impact of management operations on the virtualized datacenter , 2010, ISCA '10.

[39]  Lingjia Tang,et al.  The impact of memory subsystem resource sharing on datacenter applications , 2011, 2011 38th Annual International Symposium on Computer Architecture (ISCA).

[40]  Michael H. Goldwasser A survey of buffer management policies for packet switches , 2010, SIGA.

[41]  Adam Silberstein,et al.  Benchmarking cloud serving systems with YCSB , 2010, SoCC '10.

[42]  Kirill Kogan,et al.  FIFO Queueing Policies for Packets with Heterogeneous Processing , 2012, MedAlg.