Two Samples are Enough: Opportunistic Flow-level Latency Estimation using NetFlow

The inherent support in routers (SNMP counters or NetFlow) is not sufficient to diagnose performance problems in IP networks, especially for flow-specific problems and hence, the aggregate behavior within a router appears normal. To address this problem, in this paper, we propose a Consistent NetFlow (CNF) architecture for measuring per-flow performance measurements within routers. CNF utilizes NetFlow architecture that already reports the first and last timestamps per-flow, and hash-based sampling for ensuring that two routers record same flows. We devise a novel Multiflow estimator that approximates the intermediate delay samples from other background flows to improve the per-flow latency estimates significantly compared to the naive estimator that only uses actual flow samples. In our experiments using real backbone traces and realistic delay models, we show that Multiflow estimator is accurate with a median relative error of less than 20% for flows of size greater than 100 packets. We also show that prior approach based on trajectory sampling performs about 2-3x worse.

[1]  George Varghese,et al.  Every microsecond counts: tracking fine-grain latencies with a lossy difference aggregator , 2009, SIGCOMM '09.

[2]  Fouad A. Tobagi,et al.  Analysis of measured single-hop delay from an operational backbone network , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[3]  Nick G. Duffield,et al.  Sampling and Filtering Techniques for IP Packet Selection , 2009, RFC.

[4]  QUTdN QeO,et al.  Random early detection gateways for congestion avoidance , 1993, TNET.

[5]  Darryl Veitch,et al.  A measurement-friendly network (MFN) architecture , 2006, INM '06.

[6]  Sally Floyd,et al.  Wide-Area Traffic: The Failure of Poisson Modeling , 1994, SIGCOMM.

[7]  Yao Zhao,et al.  Towards unbiased end-to-end network diagnosis , 2009, TNET.

[8]  Albert G. Greenberg,et al.  A Framework for Packet Selection and Reporting , 2009, RFC.

[9]  J. Reynolds The covariance structure of queues and related processes – a survey of recent work , 1975, Advances in Applied Probability.

[10]  Randy H. Katz,et al.  An algebraic approach to practical and scalable overlay network monitoring , 2004, SIGCOMM '04.

[11]  Paul Barford,et al.  Accurate and efficient SLA compliance monitoring , 2007, SIGCOMM '07.

[12]  Chen-Nee Chuah,et al.  ProgME: Towards Programmable Network MEasurement , 2007, IEEE/ACM Transactions on Networking.

[13]  Georg Carle,et al.  Evaluation of building blocks for passive one-way-delay measurements , 2001 .

[14]  Brian Trammell,et al.  YAF: Yet Another Flowmeter , 2010, LISA.

[15]  Nick G. Duffield,et al.  Trajectory engine: a backend for trajectory sampling , 2002, NOMS 2002. IEEE/IFIP Network Operations and Management Symposium. ' Management Solutions for the New Communications World'(Cat. No.02CH37327).

[16]  Nick G. Duffield,et al.  Simple network performance tomography , 2003, IMC '03.

[17]  Konstantina Papagiannaki,et al.  Measurement and analysis of single-hop delay on an IP backbone network , 2003, IEEE J. Sel. Areas Commun..

[18]  Li Hua-feng,et al.  On Simple Network Management Protocol (SNMP) , 2004 .

[19]  Jeffrey D. Case,et al.  Simple Network Management Protocol (SNMP) , 1989, RFC.

[20]  Kang Lee,et al.  IEEE 1588 standard for a precision clock synchronization protocol for networked measurement and control systems , 2002, 2nd ISA/IEEE Sensors for Industry Conference,.

[21]  George Varghese,et al.  New directions in traffic measurement and accounting: Focusing on the elephants, ignoring the mice , 2003, TOCS.

[22]  George Varghese,et al.  Building a better NetFlow , 2004, SIGCOMM 2004.

[23]  Nicolas Hohn,et al.  Inverting sampled traffic , 2003, IMC '03.

[24]  Sally Floyd,et al.  Wide area traffic: the failure of Poisson modeling , 1995, TNET.

[25]  Ramana Rao Kompella,et al.  The power of slicing in internet flow measurement , 2005, IMC '05.