A P2P Traffic Identification Method Based on VFDT

We analyzed the memory limitation problem of traffic identification arithmetic when faced the large and fast stream data, and extracted the traffic attribute based on the P2P working mechanism. Using VFDT method to identify the P2P traffic can scan the traffic data only once relying on the Hoeffding Restriction, the method reduce the complexity of algorithm on the part of timing and memory and ensure the identification correction rate. The experiment shows the method can get good performance.

[1]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[2]  James Won-Ki Hong,et al.  Towards Peer-to-Peer Traffic Analysis Using Flows , 2003, DSOM.

[3]  Michalis Faloutsos,et al.  Transport layer identification of P2P traffic , 2004, IMC '04.

[4]  Fabrice Guillemin,et al.  Impact of peer-to-peer applications on wide area network traffic: an experimental approach , 2004, IEEE Global Telecommunications Conference, 2004. GLOBECOM '04..

[5]  Oliver Spatscheck,et al.  Accurate, scalable in-network identification of p2p traffic using application signatures , 2004, WWW '04.

[6]  Matthew Roughan,et al.  P2P the gorilla in the cable , 2003 .

[7]  Panayiotis Mavrommatis,et al.  Identifying Known and Unknown Peer-to-Peer Traffic , 2006, Fifth IEEE International Symposium on Network Computing and Applications (NCA'06).

[8]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[9]  Sebastian Zander,et al.  Self-Learning IP Traffic Classification Based on Statistical Flow Characteristics , 2005, PAM.

[10]  W. Hoeffding Probability Inequalities for sums of Bounded Random Variables , 1963 .

[11]  Jia Wang,et al.  Analyzing peer-to-peer traffic across large networks , 2004, IEEE/ACM Trans. Netw..

[12]  Andrew W. Moore,et al.  Hoeffding Races: Accelerating Model Selection Search for Classification and Function Approximation , 1993, NIPS.

[13]  S. Kamei,et al.  Practicable network design for handling growth in the volume of peer-to-peer traffic , 2003, 2003 IEEE Pacific Rim Conference on Communications Computers and Signal Processing (PACRIM 2003) (Cat. No.03CH37490).

[14]  Huan Liu,et al.  Discretization: An Enabling Technique , 2002, Data Mining and Knowledge Discovery.

[15]  Bijan Raahemi,et al.  Classification of Peer-to-Peer Traffic Using Neural Networks , 2007, Artificial Intelligence and Pattern Recognition.