Real-time application clustering in wide area networks

Abstract Network traffic classification employing Machine Learning and Statistical approaches have contributed to the understanding of the dynamic nature of traffic. For further improvement, all phases of networks, including when the requirements of the network exceed its current resources, must be considered. With scenarios of networks with low-speed links, fragmentation and loss of packets leading to poor quality of services are highly expected, resulting in few flows being classified at a time with the features extracted. Training a classifier with few features inhibits the overall classification accuracy with real-time traffic traces. We propose a Real-Time Application Clustering (R-TAC) strategy which can classify application flows utilizing the limited flow features extracted. Results from evaluation reveal that our proposed clustering approach performs better in terms of classification accuracy (96.40%) and precision metrics (85–99%) than the existing state of the art methods, and the best classification accuracy when validated with an existing dataset.

[1]  M. Lifshits Gaussian Random Functions , 1995 .

[2]  Xiaohan Du,et al.  P2P flow classification based on wavelet transform , 2015, 2015 IEEE International Conference on Communication Problem-Solving (ICCP).

[3]  Jun Zhang,et al.  Internet Traffic Classification Using Constrained Clustering , 2014, IEEE Transactions on Parallel and Distributed Systems.

[4]  Osisanwo F.Y,et al.  Supervised Machine Learning Algorithms: Classification and Comparison , 2017 .

[5]  Antonio Nucci,et al.  Towards self adaptive network traffic classification , 2015, Comput. Commun..

[6]  Babangida Abubakar,et al.  Traffic Classification Analysis Using OMNeT , 2018 .

[7]  Feng Xiao,et al.  Network traffic classification based on transfer learning , 2018, Comput. Electr. Eng..

[8]  Huan Liu,et al.  Feature Selection for Clustering: A Review , 2018, Data Clustering: Algorithms and Applications.

[9]  Jing Yuan,et al.  A Survey of Traffic Classification in Software Defined Networks , 2018, 2018 1st IEEE International Conference on Hot Information-Centric Networking (HotICN).

[10]  Guanglu Sun,et al.  Internet Traffic Classification Based on Incremental Support Vector Machines , 2018, Mob. Networks Appl..

[11]  Mehdi Berenjkoub,et al.  A modular two-layer system for accurate and fast traffic classification , 2014, 2014 11th International ISC Conference on Information Security and Cryptology.

[12]  Gajendra Singh Chandel,et al.  An approach for classification of network traffic on semi-supervised data using clustering techniques , 2013, 2013 Nirma University International Conference on Engineering (NUiCONE).

[13]  Zoltán Nagy,et al.  Using machine learning techniques for occupancy-prediction-based cooling control in office buildings , 2018 .

[14]  Amandeep Bagga,et al.  Clustering Techniques for Traffic Classification: A Comprehensive Review , 2018, 2018 7th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO).

[15]  Min Luo,et al.  A Framework for QoS-aware Traffic Classification Using Semi-supervised Machine Learning in SDNs , 2016, 2016 IEEE International Conference on Services Computing (SCC).

[16]  Jens Myrup Pedersen,et al.  An approach for detection and family classification of malware based on behavioral analysis , 2016, 2016 International Conference on Computing, Networking and Communications (ICNC).

[17]  Lingyun Yang,et al.  Internet video traffic classification using QoS features , 2016, 2016 International Conference on Computing, Networking and Communications (ICNC).

[18]  Chuan-Mu Tseng,et al.  P2P traffic classification using clustering technology , 2016, 2016 IEEE/SICE International Symposium on System Integration (SII).

[19]  Ciprian Dobre,et al.  Internet traffic classification based on flows' statistical properties with machine learning , 2017, Int. J. Netw. Manag..

[20]  Jun Zhang,et al.  Network Traffic Classification Using Correlation Information , 2013, IEEE Transactions on Parallel and Distributed Systems.

[21]  Surajit Chaudhuri,et al.  KDD-99: the fifth ACM SIGKDD international conference on knowledge discovery and data mining , 2000, SKDD.