Network Traffic Classification Analysis Using Machine Learning Algorithms

In the world of networking, it sometimes becomes essential to know what types of applications flow through the network for performance of certain tasks. Network traffic classification sees its main usage among ISP’s to analyze the characteristics required to design the network and hence affects the overall performance of a network. There are various techniques adopted to classify network protocols, such as port-based, pay-load based and Machine Learning based, all of them have their own pros and cons. Prominent nowadays is Machine Learning technique due to its vastness in usage in other fields and growing knowledge among researchers of its better accuracy among others when compared. In this paper, we compare two of the basic algorithms, Naïve Bayes and K nearest algorithm results when employed to networking data set extracted from live video feed using Wireshark software. For an implementation of Machine learning algorithm, python sklearn library is used with numpy and pandas library used as helper libraries. Finally, we observe that K nearest algorithm gives more accurate prediction than Naïve Bayes Algorithm, Decision Tree Algorithm and Support Vector Machine.

[1]  Vishnu Sharma,et al.  Documents ranking using new learning approach , 2016, 2016 International Conference on Computing, Communication and Automation (ICCCA).

[2]  S. Karthika,et al.  A Naïve Bayesian Classifier for Educational Qualification , 2015 .

[3]  Kuldeep Singh,et al.  Comparative analysis of five machine learning algorithms for IP traffic classification , 2011, 2011 International Conference on Emerging Trends in Networks and Computer Communications (ETNCC).

[4]  Li Ru-ping Research of Decision Tree Classification Algorithm in Data Mining , 2010 .

[5]  Grenville J. Armitage,et al.  A survey of techniques for internet traffic classification using machine learning , 2008, IEEE Communications Surveys & Tutorials.

[6]  Nabin Kumar Karn,et al.  Network Traffic Classification techniques and comparative analysis using Machine Learning algorithms , 2016, 2016 2nd IEEE International Conference on Computer and Communications (ICCC).

[7]  Lekha Bhambhu,et al.  DATA CLASSIFICATION USING SUPPORT VECTOR MACHINE , 2009 .

[8]  Abhinav Sharma,et al.  The low-rate denial of service attack based comparative study of active queue management scheme , 2017, 2017 Tenth International Conference on Contemporary Computing (IC3).

[9]  Paweł Foremski On different ways to classify Internet traffic : a short review of selected publications , 2013 .