Recently, it has been necessary to perform traffic sampling in order to deteriorate the load of capturing and analyzing processes, as the amount of ISP traffic grows. Packet sampling and flow sampling are main sampling techniques. In order to estimate flow size distribution from sampled data, each sampling method has its own advantages and disadvantages. Flow sampling can extract flows in proportion to the original flow size distribution but is difficult to extract large-sized flows due to the heavy-tailed flow size distribution. On the other hand, packet sampling can extract large-sized flows but complete flows cannot be extracted. In this paper, we propose a hybrid sampling method which performs both flow sampling and packet sampling in parallel to utilize both advantages of above two methods and improve estimation accuracy. We also propose cost-effective implementation which employs a general-purpose switch. By verifying with real traffic data, we confirmed the effectiveness of our proposed method in terms of reproducibility.
[1]
Darryl Veitch,et al.
Towards optimal sampling for flow size estimation
,
2008,
IMC '08.
[2]
Nicolas Hohn,et al.
Inverting sampled traffic
,
2003,
IMC '03.
[3]
Ramana Rao Kompella,et al.
The power of slicing in internet flow measurement
,
2005,
IMC '05.
[4]
Lili Yang,et al.
Sampled Based Estimation of Network Traffic Flow Characteristics
,
2007,
IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.
[5]
Carsten Lund,et al.
Estimating flow distributions from sampled flow statistics
,
2003,
SIGCOMM '03.