Response-Time-Optimised Service Deployment: MILP Formulations of Piece-wise Linear Functions Approximating Non-linear Bivariate Mixed-integer Functions

A current trend in networking and cloud computing is to provide compute resources at widely dispersed places; this is exemplified by developments such as Network Function Virtualisation. This paves the way for wide-area service deployments with improved service quality: e.g, a nearby server can reduce the user-perceived response times. But always using the nearest server can be a bad decision if that server is already highly utilised. This paper formalises the two related problems of allocating resources at different locations and assigning users to them with the goal of minimising the response times for a given number of resources to use -- a non-linear capacitated facility location problem with integrated queuing systems. To efficiently handle the non-linearity, we introduce five linear problem approximations and adapt the currently best heuristic for a similar problem to our scenario. All six approaches are compared in experiments for solution quality and solving time. Surprisingly, our best optimisation formulation outperforms the heuristic in both time and quality. Additionally, we evaluate the influence ot resource distributions in the network on the response time: Cut by half for some configurations. The presented formulations are applicable to a broader optimisation domain.

[1]  M. E. Dyer,et al.  Note---On the Validity of Marginal Analysis for Allocating Servers in M/M/c Queues , 1977 .

[2]  Ronald L. Rivest,et al.  Introduction to Algorithms, third edition , 2009 .

[3]  Hau L. Lee,et al.  A note on the convexity of performance measures of M/M/c queueing systems , 1983 .

[4]  Ralf Steinmetz,et al.  Modelling the Internet Delay Space Based on Geographical Locations , 2009, 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing.

[5]  Tarik Taleb,et al.  Toward carrier cloud: Potential, challenges, and solutions , 2014, IEEE Wireless Communications.

[6]  Zvi Drezner,et al.  The multiple server center location problem , 2009, Ann. Oper. Res..

[7]  Ronald L. Rivest,et al.  Introduction to Algorithms , 1990 .

[8]  Gunter Bolch,et al.  Queueing Networks and Markov Chains - Modeling and Performance Evaluation with Computer Science Applications, Second Edition , 1998 .

[9]  Silvano Martello,et al.  Piecewise linear approximation of functions of two variables in MILP models , 2010, Oper. Res. Lett..

[10]  Lachlan L. H. Andrew,et al.  Online algorithms for geographical load balancing , 2012, 2012 International Green Computing Conference (IGCC).

[11]  Quanyan Zhu,et al.  Dynamic Service Placement in Geographically Distributed Clouds , 2012, IEEE Journal on Selected Areas in Communications.

[12]  Arie Harel,et al.  Convexity Results for the Erlang Delay and Loss Formulae When the Server Utilization Is Held Constant , 2011, Oper. Res..

[13]  David Mazières,et al.  OASIS: Anycast for Any Service , 2006, NSDI.

[14]  Patrick Wendell,et al.  DONAR: decentralized server selection for cloud services , 2010, SIGCOMM '10.

[15]  Emin Gün Sirer,et al.  ClosestNode.com: an open access, scalable, shared geocast service for distributed systems , 2006, OPSR.

[16]  Zhitao Wan Cloud Computing infrastructure for latency sensitive applications , 2010, 2010 IEEE 12th International Conference on Communication Technology.

[17]  Vladimir Marianov,et al.  Location–Allocation of Multiple-Server Service Centers with Constrained Queues or Waiting Times , 2002, Ann. Oper. Res..

[18]  F. M. Moghadas,et al.  Maximal covering location-allocation problem with M/M/k queuing system and side constraints , 2011 .

[19]  Matthew Roughan,et al.  The Internet Topology Zoo , 2011, IEEE Journal on Selected Areas in Communications.

[20]  Zvi Drezner,et al.  The multiple server location problem , 2007, J. Oper. Res. Soc..

[21]  Guoping Zeng,et al.  Two common properties of the erlang-B function, erlang-C function, and Engset blocking function , 2003 .

[22]  Philip S. Yu,et al.  Geographic load balancing for scalable distributed Web systems , 2000, Proceedings 8th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (Cat. No.PR00728).

[23]  Dennis Cai,et al.  The Evolution of the Carrier Cloud Networking , 2013, 2013 IEEE Seventh International Symposium on Service-Oriented System Engineering.

[24]  Tarik Taleb,et al.  Service-aware network function placement for efficient traffic handling in carrier cloud , 2014, 2014 IEEE Wireless Communications and Networking Conference (WCNC).

[25]  Zvi Drezner,et al.  Facility location - applications and theory , 2001 .

[26]  Qian Wang,et al.  Algorithms for a Facility Location Problem with Stochastic Customer Demand and Immobile Servers , 2002, Ann. Oper. Res..

[27]  Seyed Taghi Akhavan Niaki,et al.  Genetic application in a facility location problem with random demand within queuing framework , 2012, J. Intell. Manuf..

[28]  Zvi Drezner,et al.  Classroom Note: A note on calculating steady state results for an M/M/k queuing system when the ratio of the arrival rate to the service rate is large , 1998, Adv. Decis. Sci..

[29]  Holger Karl,et al.  Response time-optimized distributed cloud resource allocation , 2014, DCC '14.

[30]  Emre Tokgoz,et al.  A Mixed Integer Convexity Result with an Application to an M/M/s Queueing System , 2010 .

[31]  Rajkumar Buyya,et al.  Minimizing Execution Costs when Using Globally Distributed Cloud Services , 2010, 2010 24th IEEE International Conference on Advanced Information Networking and Applications.

[32]  Alexander L. Stolyar,et al.  Shadow-routing based dynamic algorithms for virtual machine placement in a network cloud , 2013, INFOCOM.

[33]  Vijay K. Gurbani,et al.  Monitoring and abstraction for networked clouds , 2012, 2012 16th International Conference on Intelligence in Next Generation Networks.

[34]  Massoud Pedram,et al.  Geographical Load Balancing for Online Service Applications in Distributed Datacenters , 2013, 2013 IEEE Sixth International Conference on Cloud Computing.

[35]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[36]  Zvi Drezner,et al.  The gravity multiple server location problem , 2011, Comput. Oper. Res..

[37]  Gunter Bolch,et al.  Queueing Networks and Markov Chains , 2005 .

[38]  Krishna P. Gummadi,et al.  King: estimating latency between arbitrary internet end hosts , 2002, IMW '02.

[39]  Toyotaro Suzumura,et al.  Elastic Stream Computing with Clouds , 2011, 2011 IEEE 4th International Conference on Cloud Computing.

[40]  Fang Hao,et al.  Enhancing dynamic cloud-based services using network virtualization , 2009, CCRV.

[41]  Björn Geißler,et al.  Using Piecewise Linear Functions for Solving MINLP s , 2012 .

[42]  Paul H. Zipkin,et al.  Strong Convexity Results for Queueing Systems , 1987, Oper. Res..

[43]  Navneet Vidyarthi,et al.  Response time reduction in make-to-order and assemble-to-order supply chain design , 2009 .

[44]  O. Kariv,et al.  An Algorithmic Approach to Network Location Problems. II: The p-Medians , 1979 .

[45]  Kajal T. Claypool,et al.  Latency and player actions in online games , 2006, CACM.

[46]  Han-I Su,et al.  Are all games equally cloud-gaming-friendly? An electromyographic approach , 2012, 2012 11th Annual Workshop on Network and Systems Support for Games (NetGames).

[47]  Lachlan L. H. Andrew,et al.  Greening geographical load balancing , 2015, TNET.

[48]  A. Imamoto,et al.  A Recursive Descent Algorithm for Finding the Optimal Minimax Piecewise Linear Approximation of Convex Functions , 2008, Advances in Electrical and Electronics Engineering - IAENG Special Edition of the World Congress on Engineering and Computer Science 2008.

[49]  Xavier Hesselbach,et al.  Virtual Network Embedding: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[50]  Steffen Rebennack,et al.  Continuous Piecewise Linear Delta-Approximations for Bivariate and Multivariate Functions , 2015, J. Optim. Theory Appl..

[51]  Prashant J. Shenoy,et al.  Empirical evaluation of latency-sensitive application performance in the cloud , 2010, MMSys '10.

[52]  G. Dantzig ON THE SIGNIFICANCE OF SOLVING LINEAR PROGRAMMING PROBLEMS WITH SOME INTEGER VARIABLES , 1960 .

[53]  Emin Gün Sirer,et al.  Meridian: a lightweight network location service without virtual coordinates , 2005, SIGCOMM '05.

[54]  Judith Kelner,et al.  Autonomic Cloud Computing: Giving Intelligence to Simpleton Nodes , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[55]  Alec Wolman,et al.  Volley: Automated Data Placement for Geo-Distributed Cloud Services , 2010, NSDI.

[56]  Sem C. Borst,et al.  Dimensioning Large Call Centers , 2000, Oper. Res..

[57]  Judith Kelner,et al.  Resource allocation for distributed cloud: concepts and research challenges , 2011, IEEE Network.