Investigation of machine learning based network traffic classification

Timely and accurate traffic classification and application characterization are becoming increasingly important with many applications in wired and wireless networks, e.g., traffic engineering, security monitoring, and quality of service (QoS). In particular, Software Defined Networking (SDN) is a new networking paradigm that has great impact on future IP networks and 5G wireless networks. In SDN networks, application awareness is essential for functionalities such as virtual network resource slicing and fast routing. Compared to traditional classification methods such as port-based and payload-based algorithms, machine learning (ML) approaches offer a better choice in Internet traffic characterization by using payload-independent traffic statistics. In this paper, two ML algorithms, namely supervised Support Vector Machine (SVM) and unsupervised K-means clustering, are studied for traffic classification. It has been found that an overall accuracy of over 95% can be achieved. Meanwhile, the system performance can be further improved with model tuning and feature selection.

[1]  F. Richard Yu,et al.  Wireless Network Virtualization: A Survey, Some Research Issues and Challenges , 2015, IEEE Communications Surveys & Tutorials.

[2]  Anirban Mahanti,et al.  Traffic classification using clustering algorithms , 2006, MineNet '06.

[3]  Grenville J. Armitage,et al.  A survey of techniques for internet traffic classification using machine learning , 2008, IEEE Communications Surveys & Tutorials.

[4]  Jie Wu,et al.  Robust Network Traffic Classification , 2015, IEEE/ACM Transactions on Networking.

[5]  Sebastian Zander,et al.  Automated traffic classification and application identification using machine learning , 2005, The IEEE Conference on Local Computer Networks 30th Anniversary (LCN'05)l.

[6]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[7]  Andrew W. Moore,et al.  Internet traffic classification using bayesian analysis techniques , 2005, SIGMETRICS '05.

[8]  Xin Wang,et al.  Wireless network virtualization , 2013, 2013 International Conference on Computing, Networking and Communications (ICNC).

[9]  Nick McKeown,et al.  OpenFlow: enabling innovation in campus networks , 2008, CCRV.

[10]  Andrew W. Moore,et al.  Discriminators for use in flow-based classification , 2013 .

[11]  Andrew W. Moore,et al.  Architecture of a network monitor , 2003 .