Hierarchical Virtual Bitmaps for Spread Estimation in Traffic Measurement

This paper introduces a hierarchical traffic model for spread measurement of network traffic flows. The hierarchical model, which aggregates lower level flows into higher-level flows in a hierarchical structure, will allow us to measure network traffic at different granularities at once to support diverse traffic analysis from a grand view to fine-grained details. The spread of a flow is the number of distinct elements (under measurement) in the flow, where the flow label (that identifies packets belonging to the flow) and the elements (which are defined based on application need) can be found in packet headers or payload. Traditional flow spread estimators are designed without hierarchical traffic modeling in mind, and incur high overhead when they are applied to each level of the traffic hierarchy. In this paper, we propose a new Hierarchical Virtual bitmap Estimator (HVE) that performs simultaneous multi-level traffic measurement, at the same cost of a traditional estimator, without degrading measurement accuracy. We implement the proposed solution and perform experiments based on real traffic traces. The experimental results demonstrate that HVE improves measurement throughput by 43% to 155%, thanks to the reduction of perpacket processing overhead. For small to medium flows, its measurement accuracy is largely similar to traditional estimators that work at one level at a time. For large aggregate and base flows, its accuracy is better, with up to 97% smaller error in our experiments.

[1]  Junzhou Luo,et al.  Estimating Cardinality for Arbitrarily Large Data Stream With Improved Memory Efficiency , 2020, IEEE/ACM Transactions on Networking.

[2]  Ramesh Govindan,et al.  Resource/accuracy tradeoffs in software-defined measurement , 2013, HotSDN '13.

[3]  Shigang Chen,et al.  Online Spread Estimation with Non-duplicate Sampling , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications.

[4]  Min Chen,et al.  Hyper-Compact Virtual Estimators for Big Network Data Based on Register Sharing , 2015, SIGMETRICS.

[5]  Jih-Kwon Peir,et al.  Fit a Compact Spread Estimator in Small High-Speed Memory , 2011, IEEE/ACM Transactions on Networking.

[6]  Shigang Chen,et al.  Fast and compact per-flow traffic measurement through randomized counter sharing , 2011, 2011 Proceedings IEEE INFOCOM.

[7]  Yuanda Wang,et al.  Accurate Hierarchical Traffic Measurement in Datacenters Through Differentiated Memory Allocation , 2020, 2020 6th International Conference on Big Data Computing and Communications (BIGCOM).

[8]  Jih-Kwon Peir,et al.  Randomized Error Removal for Online Spread Estimation in Data Streaming , 2021, Proc. VLDB Endow..

[9]  Shigang Chen,et al.  Generalized Sketch Families for Network Traffic Measurement , 2020, Abstracts of the 2020 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems.

[10]  Vladimir Braverman,et al.  One Sketch to Rule Them All: Rethinking Network Flow Monitoring with UnivMon , 2016, SIGCOMM.

[11]  Rodrigo Braga,et al.  Lightweight DDoS flooding attack detection using NOX/OpenFlow , 2010, IEEE Local Computer Network Conference.

[12]  Minlan Yu,et al.  Software Defined Traffic Measurement with OpenSketch , 2013, NSDI.

[13]  P. Flajolet,et al.  HyperLogLog: the analysis of a near-optimal cardinality estimation algorithm , 2007 .

[14]  Xin Li,et al.  Distributed and collaborative traffic monitoring in software defined networks , 2014, HotSDN.

[15]  You Zhou,et al.  Generalized Sketch Families for Network Traffic Measurement , 2019, Proc. ACM Meas. Anal. Comput. Syst..

[16]  David P. Woodruff,et al.  An optimal algorithm for the distinct elements problem , 2010, PODS '10.

[17]  Kyu-Young Whang,et al.  A linear-time probabilistic counting algorithm for database applications , 1990, TODS.

[18]  Shiping Chen,et al.  Efficient Hierarchical Traffic Measurement in Software-Defined Datacenter Networks , 2017, 2017 IEEE 10th International Conference on Cloud Computing (CLOUD).

[19]  Yu-e Sun,et al.  Self-Adaptive Sampling for Network Traffic Measurement , 2021, IEEE INFOCOM 2021 - IEEE Conference on Computer Communications.

[20]  Shaojie Tang,et al.  An Efficient K-Persistent Spread Estimator for Traffic Measurement in High-Speed Networks , 2020, IEEE/ACM Transactions on Networking.

[21]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[22]  Peng Liu,et al.  Elastic sketch: adaptive and fast network-wide measurements , 2018, SIGCOMM.

[23]  Shigang Chen,et al.  Universal Online Sketch for Tracking Heavy Hitters and Estimating Moments of Data Streams , 2020, IEEE INFOCOM 2020 - IEEE Conference on Computer Communications.

[24]  Graham Cormode,et al.  An Improved Data Stream Summary: The Count-Min Sketch and Its Applications , 2004, LATIN.

[25]  Walter Willinger,et al.  cSamp: A System for Network-Wide Flow Monitoring , 2008, NSDI.