A Novel Method of P2P Traffic Classification Based on TCP Flow Analysis

Peer-to-Peer (P2P) applications have overtaken web stream as the most significant portions on the high-speed network, so P2P applications identification is important to a broad range of network operations. By deriving the transport/network layer headers of the packets of TCP flow, we obtained some attributes of all kinds of P2P traffic without relying on packet payload and port number, which leaded to a novel method for P2P traffic identification based on support vector machine (SVM). The method only needed to deal with the TCP packet of SYN and SYN+ACK flags. Experiment results show this method classifies the traffic achieved the high accuracy.

[1]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[2]  James Won-Ki Hong,et al.  Towards Peer-to-Peer Traffic Analysis Using Flows , 2003, DSOM.

[3]  Anthony McGregor,et al.  Flow Clustering Using Machine Learning Techniques , 2004, PAM.

[4]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[5]  Andrew W. Moore,et al.  Bayesian Neural Networks for Internet Traffic Classification , 2007, IEEE Transactions on Neural Networks.

[6]  Zhitang Li,et al.  A New Method of P2P Traffic Identification Based on Support Vector Machine at the Host Level , 2009, 2009 International Conference on Information Technology and Computer Science.

[7]  Anirban Mahanti,et al.  Traffic classification using clustering algorithms , 2006, MineNet '06.

[8]  Marcus Brunner,et al.  Self-Managing Distributed Systems , 2003, Lecture Notes in Computer Science.

[9]  Tu Hao,et al.  Proposition and Provement of A TCP Feature of P2P Traffic- An Example of BitTorrent and Emule , 2007, 2007 Second International Conference on Communications and Networking in China.

[10]  Michalis Faloutsos,et al.  Transport layer identification of P2P traffic , 2004, IMC '04.

[11]  Jim Kurose,et al.  Proceedings of the 4th ACM SIGCOMM Conference on Internet Measurement 2004, Taormina, Sicily, Italy, October 25-27, 2004 , 2004 .

[12]  Panayiotis Mavrommatis,et al.  Identifying Known and Unknown Peer-to-Peer Traffic , 2006, Fifth IEEE International Symposium on Network Computing and Applications (NCA'06).

[13]  Yang Liu,et al.  Solving P2P Traffic Identification Problems Via Optimized Support Vector Machines , 2007, 2007 IEEE/ACS International Conference on Computer Systems and Applications.

[14]  Guanming Lu,et al.  A Novel P2P Traffic Identification Scheme Based on Support Vector Machine Fuzzy Network , 2009, 2009 Second International Workshop on Knowledge Discovery and Data Mining.

[15]  Luca Salgarelli,et al.  A statistical approach to IP-level classification of network traffic , 2006, 2006 IEEE International Conference on Communications.

[16]  Matthew Roughan,et al.  Class-of-service mapping for QoS: a statistical signature-based approach to IP traffic classification , 2004, IMC '04.

[17]  Oliver Spatscheck,et al.  Accurate, scalable in-network identification of p2p traffic using application signatures , 2004, WWW '04.

[18]  Jia Wang,et al.  Analyzing peer-to-peer traffic across large networks , 2002, IMW '02.