Multipoint Passive Monitoring in Packet Networks

Traffic monitoring is essential to manage large networks and validate Service Level Agreements. Passive monitoring is particularly valuable to promptly identify transient fault episodes and react in a timely manner. This article proposes a novel, non-invasive and flexible method to passively monitor large backbone networks. By using only packet counters, commonly available on existing hardware, we can accurately measure packet losses, in different segments of the network, affecting only specific flows. We can monitor not only end-to-end flows, but any generic flow with packets following several different paths in the network (multipoint flows). We also sketch a possible extension of the method to measure average one-way delay for multipoint flows, provided that the measurement points are synchronized. Through various experiments we show that the method is effective and enables easy zooming in on the cause of packet losses. Moreover, the method can scale to very large networks with a very low overhead on the data plane and the management plane.

[1]  Yin Zhang,et al.  On the constancy of internet path properties , 2001, IMW '01.

[2]  Al Morton,et al.  A One-Way Delay Metric for IP Performance Metrics (IPPM) , 2016, RFC.

[3]  Dario Rossi,et al.  On sizing CCN content stores by exploiting topological information , 2012, 2012 Proceedings IEEE INFOCOM Workshops.

[4]  Marco Mellia,et al.  Measuring IP and TCP behavior on edge nodes with Tstat , 2005, Comput. Networks.

[5]  Martín Casado,et al.  The Design and Implementation of Open vSwitch , 2015, NSDI.

[6]  Péter Benkö,et al.  A passive method for estimating end-to-end TCP packet loss , 2002, Global Telecommunications Conference, 2002. GLOBECOM '02. IEEE.

[7]  Robert Sedgewick,et al.  Algorithms in C : Part 5 : Graph Algo-rithms , 2002 .

[8]  George Varghese,et al.  Forwarding metamorphosis: fast programmable match-action processing in hardware for SDN , 2013, SIGCOMM.

[9]  Mauro Cociglio,et al.  Multipoint Alternate-Marking Method for Passive and Hybrid Performance Monitoring , 2020, RFC.

[10]  Matthew Roughan,et al.  The Internet Topology Zoo , 2011, IEEE Journal on Selected Areas in Communications.

[11]  Ronald W. Wolff,et al.  Poisson Arrivals See Time Averages , 1982, Oper. Res..

[12]  Minlan Yu,et al.  LossRadar: Fast Detection of Lost Packets in Data Center Networks , 2016, CoNEXT.

[13]  Novica Nosovic,et al.  Dijkstra's shortest path algorithm serial and parallel execution performance analysis , 2012, 2012 Proceedings of the 35th International Convention MIPRO.

[14]  Aiko Pras,et al.  Flow Monitoring Explained: From Packet Capture to Data Analysis With NetFlow and IPFIX , 2014, IEEE Communications Surveys & Tutorials.

[15]  Olivier Bonaventure,et al.  Achieving sub-second IGP convergence in large IP networks , 2005, CCRV.

[16]  Paul Barford,et al.  Improving accuracy in end-to-end packet loss measurement , 2005, SIGCOMM '05.

[17]  David K. Smith Network Flows: Theory, Algorithms, and Applications , 1994 .

[18]  Peng Liu,et al.  Elastic sketch: adaptive and fast network-wide measurements , 2018, SIGCOMM.

[19]  Robert E. Tarjan,et al.  Fibonacci heaps and their uses in improved network optimization algorithms , 1984, JACM.

[20]  Benoit Claise,et al.  Specification of the IP Flow Information Export (IPFIX) Protocol for the Exchange of Flow Information , 2013, RFC.

[21]  Antonio Scarfò The Evolution of Data Center Networking Technologies , 2011, 2011 First International Conference on Data Compression, Communications and Processing.

[22]  Greg Mirsky,et al.  Alternate-Marking Method for Passive and Hybrid Performance Monitoring , 2020, RFC.

[23]  Joachim Fabini,et al.  Advanced Stream and Sampling Framework for IP Performance Metrics (IPPM) , 2014, RFC.

[24]  Tal Mizrahi,et al.  AM-PM: Efficient Network Telemetry using Alternate Marking , 2019, IEEE Network.

[25]  Nick McKeown,et al.  A network in a laptop: rapid prototyping for software-defined networks , 2010, Hotnets-IX.

[26]  Minlan Yu,et al.  FlowRadar: A Better NetFlow for Data Centers , 2016, NSDI.

[27]  Fulvio Risso,et al.  Packet Capture and Analysis on MEDINA, A Massively Distributed Network Data Caching Platform , 2017, Parallel Process. Lett..

[28]  Evangelos P. Markatos,et al.  Realistic Passive Packet Loss Measurement for High-Speed Networks , 2009, TMA.

[29]  Stefan Savage,et al.  Sting: A TCP-based Network Measurement Tool , 1999, USENIX Symposium on Internet Technologies and Systems.

[30]  Paul Barford,et al.  Accurate and efficient SLA compliance monitoring , 2007, SIGCOMM '07.

[31]  David Moore,et al.  Beyond folklore: observations on fragmented traffic , 2002, TNET.