Feature extraction based IP traffic classification using machine learning

With rapid growth in internet traffic over last couple of years due to the usage of large number of internet applications, IP traffic classification becomes very necessary for various internet service providers to optimize their network performance and for governmental intelligence organizations. Today, traditional IP traffic classification techniques such as port number and payload based direct packet inspection techniques are rarely used because of use of dynamic port number instead of well-known port number in packet headers and various cryptographic techniques which inhibit inspection of packet payload. Current trends are use of machine learning (ML) techniques for IP traffic classification. In this research paper, two different real time internet traffic datasets has been developed using packet capturing tool for 2 minute and 2 second packet capturing duration. After that, five ML algorithms MLP, RBF, C4.5, Bayes Net and Naïve Bayes are employed for internet traffic classification with these datasets. This experimental analysis shows that Bayes Net and C4.5 are effective ML techniques for IP traffic classification with accuracy in the range of 88% with reduction in packet capturing duration.

[1]  Ian Witten,et al.  Data Mining , 2000 .

[2]  K. Sundaraj,et al.  A study of back-propagation and radial basis neural network on EMG signal classification , 2009, 2009 6th International Symposium on Mechatronics and its Applications.

[3]  Russell Greiner,et al.  Learning Bayesian Belief Network Classifiers: Algorithms and System , 2001, Canadian Conference on AI.

[4]  Andrew W. Moore,et al.  Discriminators for use in flow-based classification , 2013 .

[5]  Kuldeep Singh,et al.  Internet Traffic Classification , 2011 .

[6]  Bo Yang,et al.  Traffic classification using probabilistic neural networks , 2010, 2010 Sixth International Conference on Natural Computation.

[7]  Ioan Pop,et al.  An approach of the Naive Bayes classifier for the document classification 1 , 2006 .

[8]  Grenville J. Armitage,et al.  A survey of techniques for internet traffic classification using machine learning , 2008, IEEE Communications Surveys & Tutorials.

[9]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[10]  Luca Salgarelli,et al.  Comparing traffic classifiers , 2007, CCRV.

[11]  Kuldeep Singh,et al.  Comparative analysis of five machine learning algorithms for IP traffic classification , 2011, 2011 International Conference on Emerging Trends in Networks and Computer Communications (ETNCC).

[12]  Shahrul Azman Noah,et al.  Performance Comparison of Multi-layer Perceptron (Back Propagation, Delta Rule and Perceptron) algorithms in Neural Networks , 2009, 2009 IEEE International Advance Computing Conference.

[13]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[14]  Judith Kelner,et al.  A Survey on Internet Traffic Identification , 2009, IEEE Communications Surveys & Tutorials.