Response time-optimized distributed cloud resource allocation

In the near future many more compute resources will be available at different geographical locations. To minimize the response time of requests, application servers closer to the user can hence be used to shorten network round trip times. However, this advantage is neutralized if the used data centre is highly loaded as the processing time of requests is important as well. We model the request response time as the network round trip time plus the processing time at a data centre.We present a capacitated facility location problem formalization where the processing time is modelled as the sojourn time of a queueing model. We discuss the \emph{Pareto trade-off} between the number of used data centres and the resulting response time. For example, using fewer data centres could cut expenses but results in high utilization, high response time, and smaller revenues.Previous work presented a non-linear cost function. We prove its \emph{convexity} and exploit this property in two ways: First, we transform the convex model into a linear model while controlling the maximum approximation error. Second, we used a convex solver instead of a slower non-linear solver.Numerical results on network topologies exemplify our work.

[1]  Zvi Drezner,et al.  Location and allocation of service units on a congested network , 2008 .

[2]  Björn Geißler,et al.  Using Piecewise Linear Functions for Solving MINLP s , 2012 .

[3]  Kenneth Ward Church,et al.  On Delivering Embarrassingly Distributed Cloud Services , 2008, HotNets.

[4]  Navneet Vidyarthi,et al.  Response time reduction in make-to-order and assemble-to-order supply chain design , 2009 .

[5]  O. Kariv,et al.  An Algorithmic Approach to Network Location Problems. II: The p-Medians , 1979 .

[6]  Qian Wang,et al.  Facility location models for immobile servers with stochastic demand , 2004 .

[7]  Kajal T. Claypool,et al.  Latency and player actions in online games , 2006, CACM.

[8]  Toyotaro Suzumura,et al.  Elastic Stream Computing with Clouds , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[9]  Judith Kelner,et al.  Resource allocation for distributed cloud: concepts and research challenges , 2011, IEEE Network.

[10]  Alec Wolman,et al.  Volley: Automated Data Placement for Geo-Distributed Cloud Services , 2010, NSDI.

[11]  Holger Karl,et al.  A topology-aware adaptive deployment framework for elastic applications , 2013, 2013 17th International Conference on Intelligence in Next Generation Networks (ICIN).

[12]  Fang Hao,et al.  Enhancing dynamic cloud-based services using network virtualization , 2009, CCRV.

[13]  Krishna P. Gummadi,et al.  King: estimating latency between arbitrary internet end hosts , 2002, IMW '02.

[14]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[15]  Xavier Hesselbach,et al.  Virtual Network Embedding: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[16]  Zvi Drezner,et al.  The gravity multiple server location problem , 2011, Comput. Oper. Res..

[17]  Alexander L. Stolyar,et al.  Shadow-Routing Based Dynamic Algorithms for Virtual Machine Placement in a Network Cloud , 2013, IEEE Transactions on Cloud Computing.

[18]  R. Saigal,et al.  Handbook of semidefinite programming : theory, algorithms, and applications , 2000 .

[19]  Rajkumar Buyya,et al.  Minimizing Execution Costs when Using Globally Distributed Cloud Services , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[20]  Jordi Torres,et al.  Intelligent Placement of Datacenters for Internet Services , 2011, 2011 31st International Conference on Distributed Computing Systems.

[21]  Alexander L. Stolyar,et al.  Shadow-routing based dynamic algorithms for virtual machine placement in a network cloud , 2013, INFOCOM.

[22]  Tarik Taleb,et al.  Toward carrier cloud: Potential, challenges, and solutions , 2014, IEEE Wireless Communications.

[23]  Tommaso Cucinotta,et al.  End-to-End Service Quality for Cloud Applications , 2013, GECON.

[24]  Emin Gün Sirer,et al.  Meridian: a lightweight network location service without virtual coordinates , 2005, SIGCOMM '05.

[25]  Gunter Bolch,et al.  Queueing Networks and Markov Chains , 2005 .

[26]  Zvi Drezner,et al.  Facility location - applications and theory , 2001 .

[27]  Qian Wang,et al.  Algorithms for a Facility Location Problem with Stochastic Customer Demand and Immobile Servers , 2002, Ann. Oper. Res..

[28]  Seyed Taghi Akhavan Niaki,et al.  Genetic application in a facility location problem with random demand within queuing framework , 2012, J. Intell. Manuf..

[29]  T. V. Lakshman,et al.  Network aware resource allocation in distributed clouds , 2012, 2012 Proceedings IEEE INFOCOM.

[30]  Tarik Taleb,et al.  Follow me cloud: interworking federated clouds and distributed mobile networks , 2013, IEEE Network.

[31]  Tommaso Cucinotta,et al.  Run-time Support for Real-Time Multimedia in the Cloud , 2013, REACTION.

[32]  Bhushan H. Trivedi,et al.  Improving Performance of Mobile Agent Based Intrusion Detection System , 2015, 2015 Fifth International Conference on Advanced Computing & Communication Technologies.

[33]  Minseok Kwon,et al.  A Tutorial on Network Latency and Its Measurements , 2015 .

[34]  Gunter Bolch,et al.  Queueing Networks and Markov Chains - Modeling and Performance Evaluation with Computer Science Applications, Second Edition , 1998 .

[35]  F. M. Moghadas,et al.  Maximal covering location-allocation problem with M/M/k queuing system and side constraints , 2011 .

[36]  Prashant J. Shenoy,et al.  Empirical evaluation of latency-sensitive application performance in the cloud , 2010, MMSys '10.

[37]  Lachlan L. H. Andrew,et al.  Greening Geographical Load Balancing , 2015, IEEE/ACM Transactions on Networking.

[38]  Vijay K. Gurbani,et al.  Monitoring and abstraction for networked clouds , 2012, 2012 16th International Conference on Intelligence in Next Generation Networks.

[39]  Massoud Pedram,et al.  Geographical Load Balancing for Online Service Applications in Distributed Datacenters , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[40]  Tarik Taleb,et al.  Service-aware network function placement for efficient traffic handling in carrier cloud , 2014, 2014 IEEE Wireless Communications and Networking Conference (WCNC).

[41]  Gergely Pongrácz,et al.  Unifying Cloud and Carrier Network: EU FP7 Project UNIFY , 2013, 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing.

[42]  Han-I Su,et al.  Are all games equally cloud-gaming-friendly? An electromyographic approach , 2012, 2012 11th Annual Workshop on Network and Systems Support for Games (NetGames).

[43]  Zvi Drezner,et al.  The multiple server center location problem , 2009, Ann. Oper. Res..

[44]  Quanyan Zhu,et al.  Dynamic Service Placement in Geographically Distributed Clouds , 2012, IEEE Journal on Selected Areas in Communications.

[45]  David Mazières,et al.  OASIS: Anycast for Any Service , 2006, NSDI.

[46]  Zvi Drezner,et al.  The multiple server location problem , 2007, J. Oper. Res. Soc..

[47]  Lachlan L. H. Andrew,et al.  Online algorithms for geographical load balancing , 2012, 2012 International Green Computing Conference (IGCC).

[48]  Patrick Wendell,et al.  DONAR: decentralized server selection for cloud services , 2010, SIGCOMM '10.

[49]  Emin Gün Sirer,et al.  ClosestNode.com: an open access, scalable, shared geocast service for distributed systems , 2006, OPSR.

[50]  Vladimir Marianov,et al.  Location–Allocation of Multiple-Server Service Centers with Constrained Queues or Waiting Times , 2002, Ann. Oper. Res..

[51]  Myron Hlynka,et al.  Queueing Networks and Markov Chains (Modeling and Performance Evaluation With Computer Science Applications) , 2007, Technometrics.

[52]  Ralf Steinmetz,et al.  Modelling the Internet Delay Space Based on Geographical Locations , 2009, 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing.

[53]  S. L. HAKIMIt AN ALGORITHMIC APPROACH TO NETWORK LOCATION PROBLEMS. , 1979 .

[54]  Matthew Roughan,et al.  The Internet Topology Zoo , 2011, IEEE Journal on Selected Areas in Communications.

[55]  Philip S. Yu,et al.  Geographic load balancing for scalable distributed Web systems , 2000, Proceedings 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (Cat. No.PR00728).

[56]  Manfred W. Padberg,et al.  Approximating Separable Nonlinear Functions Via Mixed Zero-One Programs , 1998, Oper. Res. Lett..

[57]  Zhitao Wan Cloud Computing infrastructure for latency sensitive applications , 2010, 2010 IEEE 12th International Conference on Communication Technology.

[58]  Dennis Cai,et al.  The Evolution of the Carrier Cloud Networking , 2013, 2013 IEEE Seventh International Symposium on Service-Oriented System Engineering.