A novel feature selection technique based on Roach Infestation Optimization for Internet Traffic Classification

In this paper, we propose a novel feature selection technique based on the Roach Infestation Optimization (RIO) meta heuristic for Internet traffic classification. The RIO based feature selection technique is a pre-processing step before launching the classification task where the aim is to identify the set of significant features to be used in the classification task. The proposed technique is combined with both random forest and Bayes network classifiers and evaluated on the well known NIMS dataset. The numerical results show the effectiveness of the proposed technique for Internet traffic classification.

[1]  Bernd Freisleben,et al.  Unsupervised Traffic Flow Classification Using a Neural Autoencoder , 2017, 2017 IEEE 42nd Conference on Local Computer Networks (LCN).

[2]  Grenville J. Armitage,et al.  A survey of techniques for internet traffic classification using machine learning , 2008, IEEE Communications Surveys & Tutorials.

[3]  C. Zhaohui,et al.  Notice of Retraction Cockroach Swarm Optimization , 2010 .

[4]  Andrew W. Moore,et al.  Internet traffic classification using bayesian analysis techniques , 2005, SIGMETRICS '05.

[5]  Sebastian Zander,et al.  Automated traffic classification and application identification using machine learning , 2005, The IEEE Conference on Local Computer Networks 30th Anniversary (LCN'05)l.

[6]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[7]  Ali Kashif Bashir,et al.  A machine learning approach for feature selection traffic classification using security analysis , 2018, The Journal of Supercomputing.

[8]  James M. Keller,et al.  Roach Infestation Optimization , 2008, 2008 IEEE Swarm Intelligence Symposium.

[9]  Xiaohong Guan,et al.  Accurate Classification of the Internet Traffic Based on the SVM Method , 2007, 2007 IEEE International Conference on Communications.

[10]  Liu Yingqiu,et al.  Network Traffic Classification Using K-means Clustering , 2007, Second International Multi-Symposiums on Computer and Computational Sciences (IMSCCS 2007).

[11]  Joanna Kwiecien Cockroach Swarm Optimization , 2020 .

[12]  Matthew Roughan,et al.  Class-of-service mapping for QoS: a statistical signature-based approach to IP traffic classification , 2004, IMC '04.

[13]  Zhiqiang Li,et al.  Efficient, Accurate Internet Traffic Classification using Discretization in Naive Bayes , 2008, 2008 IEEE International Conference on Networking, Sensing and Control.

[14]  Mark A. Hall,et al.  Correlation-based Feature Selection for Discrete and Numeric Class Machine Learning , 1999, ICML.

[15]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[16]  Li Wei,et al.  Network Traffic Classification Using K-means Clustering , 2007 .

[17]  Ming Zhu,et al.  End-to-end encrypted traffic classification with one-dimensional convolution neural networks , 2017, 2017 IEEE International Conference on Intelligence and Security Informatics (ISI).

[18]  Anirban Mahanti,et al.  Traffic classification using clustering algorithms , 2006, MineNet '06.