Robust Validation of Network Designs under Uncertain Demands and Failures

A key challenge confronting wide-area network architects is validating that their network designs provide assurable performance in the face of variable traffic demands and failures. Validation is hard because of the exponential, and possibly non-enumerable, set of scenarios that must be considered. Current theoretical tools provide overly conservative bounds on network performance since to remain tractable, they do not adequately model the flexible routing strategies that networks employ in practice to adapt to failures and changing traffic demands. In this paper, we develop an optimizationtheoretic framework to derive the worst-case network performance across scenarios of interest by modeling flexible routing adaptation strategies. We present an approach to tackling the resulting intractable problems, which can achieve tighter bounds on network performance than current techniques. While our framework is general, we focus on bounding worst-case link utilizations, and case studies involving topology design, and MPLS tunnels, chosen both for their practical importance and to illustrate key aspects of our framework. Evaluations over real network topologies and traffic data show the promise of the approach.

[1]  Mikkel Thorup,et al.  Robust optimization of OSPF/IS-IS weights , 2003 .

[2]  Xin-She Yang,et al.  Introduction to Algorithms , 2021, Nature-Inspired Optimization Algorithms.

[3]  Nesa L'abbe Wu,et al.  Linear programming and extensions , 1981 .

[4]  Minlan Yu,et al.  SIMPLE-fying middlebox policy enforcement using SDN , 2013, SIGCOMM.

[5]  Albert G. Greenberg,et al.  COPE: traffic engineering in dynamic networks , 2006, SIGCOMM.

[6]  Warren P. Adams,et al.  A Reformulation-Linearization Technique for Solving Discrete and Continuous Nonconvex Problems , 1998 .

[7]  A. Ben-Tal,et al.  Adjustable robust solutions of uncertain linear programs , 2004, Math. Program..

[8]  Marcin Bienkowski,et al.  A practical algorithm for constructing oblivious routing schemes , 2003, SPAA '03.

[9]  Albert G. Greenberg,et al.  Network anomography , 2005, IMC '05.

[10]  Rahul Potharaju,et al.  When the network crumbles: an empirical study of cloud network failures and their impact on services , 2013, SoCC.

[11]  Yin Zhang,et al.  R3: resilient routing reconfiguration , 2010, SIGCOMM '10.

[12]  Mohammad Taghi Hajiaghayi,et al.  Semi-oblivious routing: lower bounds , 2007, SODA '07.

[13]  Harald Räcke,et al.  Minimizing Congestion in General Networks , 2002, FOCS.

[14]  Vyas Sekar,et al.  Making middleboxes someone else's problem: network processing as a cloud service , 2012, SIGCOMM '12.

[15]  D. Handelman Representing polynomials by positive linear functions on compact convex polyhedra. , 1988 .

[16]  Edith Cohen,et al.  Coping with network failures: routing strategies for optimal demand oblivious restoration , 2004, SIGMETRICS '04/Performance '04.

[17]  Daniel Kuhn,et al.  K-Adaptability in Two-Stage Robust Binary Programming , 2015, Oper. Res..

[18]  J. Rexford,et al.  Network architecture for joint failure recovery and traffic engineering , 2011, PERV.

[19]  Eric C. Rosen,et al.  Multiprotocol Label Switching Architecture , 2001, RFC.

[20]  Mohammad Taghi Hajiaghayi,et al.  Oblivious network design , 2006, SODA '06.

[21]  Ramesh Govindan,et al.  Evolve or Die: High-Availability Design Principles Drawn from Googles Network Infrastructure , 2016, SIGCOMM.

[22]  George Varghese,et al.  Header Space Analysis: Static Checking for Networks , 2012, NSDI.

[23]  Dimitris Bertsimas,et al.  On the power and limitations of affine policies in two-stage adaptive optimization , 2012, Math. Program..

[24]  Martín Casado,et al.  Fabric: a retrospective on evolving SDN , 2012, HotSDN '12.

[25]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[26]  Cheng Jin,et al.  MATE: MPLS adaptive traffic engineering , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[27]  Pranjal Awasthi,et al.  On the adaptivity gap in two-stage robust linear optimization under uncertain packing constraints , 2019, Math. Program..

[28]  Constantine Caramanis,et al.  Theory and Applications of Robust Optimization , 2010, SIAM Rev..

[29]  Brighten Godfrey,et al.  VeriFlow: verifying network-wide invariants in real time , 2012, HotSDN '12.

[30]  Min Zhu,et al.  B4: experience with a globally-deployed software defined wan , 2013, SIGCOMM.

[31]  Jean-Philippe Vial,et al.  Robust Optimization , 2021, ICORES.

[32]  Nick Feamster,et al.  Programming slick network functions , 2015, SOSR.

[33]  Minlan Yu,et al.  Condor: Better Topologies Through Declarative Design , 2015, Comput. Commun. Rev..

[34]  Srikanth Kandula,et al.  Achieving high utilization with software-driven WAN , 2013, SIGCOMM.

[35]  Vyas Sekar,et al.  Simplifying Software-Defined Network Optimization Using SOL , 2016, NSDI.

[36]  Srikanth Kandula,et al.  Traffic engineering with forward fault correction , 2014, SIGCOMM.

[37]  Harald Räcke,et al.  Optimal hierarchical decompositions for congestion minimization in networks , 2008, STOC.

[38]  Edith Cohen,et al.  Making intra-domain routing robust to changing and uncertain traffic demands: understanding fundamental tradeoffs , 2003, SIGCOMM '03.

[39]  Yurii Nesterov,et al.  New variants of bundle methods , 1995, Math. Program..

[40]  Alon Itai,et al.  On the complexity of time table and multi-commodity flow problems , 1975, 16th Annual Symposium on Foundations of Computer Science (sfcs 1975).

[41]  Satish Rao,et al.  A polynomial-time tree decomposition to minimize congestion , 2003, SPAA '03.

[42]  Fernando Pedone,et al.  Merlin: A Language for Provisioning Network Resources , 2014, CoNEXT.

[43]  Edith Cohen,et al.  Optimal oblivious routing in polynomial time , 2003, STOC '03.

[44]  Deepinder P. Sidhu,et al.  Finding disjoint paths in networks , 1991, SIGCOMM '91.

[45]  Donald F. Towsley,et al.  Optimal routing with multiple traffic matrices tradeoff between average and worst case performance , 2005, 13TH IEEE International Conference on Network Protocols (ICNP'05).

[46]  Pranjal Awasthi,et al.  On the Adaptivity Gap in Two-Stage Robust Linear Optimization under Uncertain Constraints , 2015 .

[47]  Robert Soulé,et al.  Kulfi: Robust Traffic Engineering Using Semi-Oblivious Routing , 2016, ArXiv.

[48]  Srikanth Kandula,et al.  Walking the tightrope: responsive yet stable traffic engineering , 2005, SIGCOMM '05.