Joint Minimization of Monitoring Cost and Delay in Overlay Networks: Optimal Policies with a Markovian Approach

Continuous monitoring of network resources enables to make more-informed resource allocation decisions but incurs overheads. We investigate the trade-off between monitoring costs and benefits of accurate state information for a routing problem. In our approach link delays are modeled by Markov chains or hidden Markov models. The current delay information on a link can be obtained by actively monitoring this link at a fixed cost. At each time slot, the decision maker chooses to monitor a subset of links with the objective of minimizing a linear combination of long-run average delay and monitoring costs. This decision problem is modeled as a Markov decision process whose solution is computed numerically. In addition, in simple settings we prove that immediate monitoring cost and delay minimization leads to a threshold policy on a filter which sums up information from past measurements. The lightweight method as well as the optimal policy are tested on several use-cases. We demonstrate on an overlay of 30 nodes of RIPE Atlas that we obtain delay values close to the performance of the always best path with an extremely low monitoring effort when delays between nodes are modeled with hierarchical Dirichlet process hidden Markov models.

[1]  Vikram Krishnamurthy,et al.  Partially Observed Markov Decision Processes: From Filtering to Controlled Sensing , 2016, 1604.08127.

[2]  Claudiu Danilov,et al.  Structured Overlay Networks for a New Generation of Internet Services , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[3]  Jeremie Leguay,et al.  Overlay routing for fast video transfers in CDN , 2017, 2017 IFIP/IEEE Symposium on Integrated Network and Service Management (IM).

[4]  Srinivasan Seshan,et al.  A case for end system multicast , 2002, IEEE J. Sel. Areas Commun..

[5]  Ying Zhang,et al.  Understanding network delay changes caused by routing events , 2007, SIGMETRICS '07.

[6]  Erol Gelenbe,et al.  Towards Networks with Cognitive Packets , 2001 .

[7]  Donald F. Towsley,et al.  Detecting shared congestion of flows via end-to-end measurement , 2000, SIGMETRICS '00.

[8]  Hoon Kim,et al.  Monte Carlo Statistical Methods , 2000, Technometrics.

[9]  Thomas F. La Porta,et al.  Parsimonious Tomography: Optimizing Cost-Identifiability Trade-off for Probing-based Network Monitoring , 2018, PERV.

[10]  Vikram Krishnamurthy,et al.  Algorithms for optimal scheduling and management of hidden Markov model sensors , 2002, IEEE Trans. Signal Process..

[11]  Nick Feamster,et al.  The case for separating routing from routers , 2004, FDNA '04.

[12]  Kin K. Leung,et al.  Robust monitor placement for network tomography in dynamic networks , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[13]  Pablo Belzarena,et al.  End-to-end quality of service seen by applications: A statistical learning approach , 2010, Comput. Networks.

[14]  Qiang Xu,et al.  Software-Defined Latency Monitoring in Data Center Networks , 2015, PAM.

[15]  Hari Balakrishnan,et al.  Resilient overlay networks , 2001, SOSP.

[16]  Fei Li,et al.  End-to-End Service Quality Measurement Using Source-Routed Probes , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[17]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[18]  Frank Thomson Leighton,et al.  Improving performance on the internet , 2008, CACM.

[19]  Ramesh K. Sitaraman,et al.  The Akamai network: a platform for high-performance internet applications , 2010, OPSR.

[20]  Bruce M. Maggs,et al.  Designing overlay multicast networks for streaming , 2003, SPAA '03.

[21]  Ramesh K. Sitaraman,et al.  Overlay Networks: An Akamai Perspective , 2014 .

[22]  Robert Stone,et al.  CenterTrack: An IP Overlay Network for Tracking DoS Floods , 2000, USENIX Security Symposium.

[23]  Robert Nowak,et al.  Internet tomography , 2002, IEEE Signal Process. Mag..

[24]  Min Zhu,et al.  B4: experience with a globally-deployed software defined wan , 2013, SIGCOMM.

[25]  Scott Shenker,et al.  Overcoming the Internet impasse through virtualization , 2005, Computer.

[26]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[27]  Akihiro Nakao,et al.  Scalable routing overlay networks , 2006, OPSR.

[28]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[29]  H. Rahul,et al.  Towards Realizing the Performance and Availability Benefits of a Global Overlay Network , 2005 .

[30]  Lawrence Kreeger,et al.  Virtual eXtensible Local Area Network (VXLAN): A Framework for Overlaying Virtualized Layer 2 Networks over Layer 3 Networks , 2014, RFC.

[31]  Pablo Belzarena,et al.  SDN-based Overlay Networks for QoS-aware Routing , 2016, LANCOMM@SIGCOMM.

[32]  Ben Y. Zhao,et al.  Tapestry: a resilient global-scale overlay for service deployment , 2004, IEEE Journal on Selected Areas in Communications.

[33]  Dimitri P. Bertsekas,et al.  Dynamic Programming and Optimal Control, Two Volume Set , 1995 .

[34]  Alejandro López-Ortiz,et al.  On the number of distributed measurement points for network tomography , 2003, IMC '03.

[35]  Olivier Brun,et al.  Scalable, self-healing, and self-optimizing routing overlays , 2016, 2016 IFIP Networking Conference (IFIP Networking) and Workshops.

[36]  Erol Gelenbe,et al.  Cognitive Packet Network for Bilateral Asymmetric Connections , 2014, IEEE Transactions on Industrial Informatics.

[37]  Brian L. Mark,et al.  Delay Network Tomography Using a Partially Observable Bivariate Markov Chain , 2017, IEEE/ACM Transactions on Networking.

[38]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[39]  Dinesh C. Verma,et al.  ALMI: An Application Level Multicast Infrastructure , 2001, USITS.

[40]  Alfred O. Hero,et al.  Unicast-based inference of network link delay distributions with finite mixture models , 2003, IEEE Trans. Signal Process..

[41]  Kin K. Leung,et al.  Efficient Identification of Additive Link Metrics via Network Tomography , 2013, 2013 IEEE 33rd International Conference on Distributed Computing Systems.

[42]  Giuseppe Di Battista,et al.  From BGP to RTT and Beyond: Matching BGP Routing Changes and Network Delay Variations with an Eye on Traceroute Paths , 2013, ArXiv.

[43]  Mohamad Assaad,et al.  Asymptotically Optimal Pilot Allocation Over Markovian Fading Channels , 2016, IEEE Transactions on Information Theory.

[44]  Fernando A. Kuipers,et al.  OpenNetMon: Network monitoring in OpenFlow Software-Defined Networks , 2014, 2014 IEEE Network Operations and Management Symposium (NOMS).

[45]  Ahmed Syed Irshad,et al.  Markov Decision Process , 2011 .

[46]  Y. Vardi,et al.  Network Tomography: Estimating Source-Destination Traffic Intensities from Link Data , 1996 .

[47]  Andrew A. Chien,et al.  Tolerating denial-of-service attacks using overlay networks: impact of topology , 2003, SSRS '03.

[48]  Yuval Shavitt,et al.  A Measurement Study of the Origins of End-to-End Delay Variations , 2010, PAM.

[49]  Krishna P. Gummadi,et al.  Improving the Reliability of Internet Paths with One-hop Source Routing , 2004, OSDI.

[50]  R. Kumar,et al.  Practical Beacon Placement for Link Monitoring Using Network Tomography , 2006, IEEE Journal on Selected Areas in Communications.

[51]  Erol Gelenbe,et al.  Optimizing Secure SDN-Enabled Inter-Data Centre Overlay Networks through Cognitive Routing , 2016, 2016 IEEE 24th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS).

[52]  Eytan Modiano,et al.  A Distributed Algorithm for Throughput Optimal Routing in Overlay Networks , 2016, 2019 IFIP Networking Conference (IFIP Networking).

[53]  Randy H. Katz,et al.  An algebraic approach to practical and scalable overlay network monitoring , 2004, SIGCOMM '04.

[54]  J. Sethuraman A CONSTRUCTIVE DEFINITION OF DIRICHLET PRIORS , 1991 .

[55]  G. Finn,et al.  A Virtual Internet Architecture , 2003 .

[56]  Bernhard Ager,et al.  Outsourcing the routing control logic: better internet routing based on SDN principles , 2012, HotNets-XI.

[57]  Abishek Gopalan,et al.  On Identifying Additive Link Metrics Using Linearly Independent Cycles and Paths , 2012, IEEE/ACM Transactions on Networking.

[58]  Jörg Liebeherr,et al.  HyperCast: A Protocol for Maintaining Multicast Group Members in a Logical Hypercube Topology , 1999, Networked Group Communication.

[59]  Kensuke Fukuda,et al.  An empirical mixture model for large-scale RTT measurements , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[60]  Mohamad Assaad,et al.  Dynamic pilot allocation over Markovian fading channels: A restless bandit approach , 2016, 2016 IEEE Information Theory Workshop (ITW).

[61]  Wouter Tavernier,et al.  Adaptive and reliable multipath provisioning for media transfer in SDN-based overlay networks , 2017, Comput. Commun..

[62]  Rajeev Rastogi,et al.  Robust Monitoring of Link Delays and Faults in IP Networks , 2003, IEEE/ACM Transactions on Networking.

[63]  A. Chien,et al.  Tolerating Denial-of-Service Attacks Using Overlay Networks-Impact of Overlay Network Topology , 2003 .

[64]  Maurice Gagnaire,et al.  A SDN-based network architecture for cloud resiliency , 2015, 2015 12th Annual IEEE Consumer Communications and Networking Conference (CCNC).

[65]  Erol Gelenbe,et al.  Big Data for Autonomic Intercontinental Overlays , 2016, IEEE Journal on Selected Areas in Communications.

[66]  Ting He Distributed Link Anomaly Detection via Partial Network Tomography , 2017, SIGMETRICS Perform. Evaluation Rev..

[67]  Alan Edelman,et al.  Julia: A Fresh Approach to Numerical Computing , 2014, SIAM Rev..

[68]  W. Marsden I and J , 2012 .

[69]  Shun-Yun Hu,et al.  Scalable peer-to-peer networked virtual environment , 2004, NetGames '04.

[70]  Bobby Bhattacharjee,et al.  Scalable application layer multicast , 2002, SIGCOMM '02.

[71]  Iftikhar Ahmad,et al.  QoS routing for real time traffic in mobile ad hoc network , 2013, ICUIMC '13.

[72]  Micah Beck,et al.  An end-to-end approach to globally scalable programmable networking , 2003, FDNA '03.