Scalable kernel convex hull online support vector machine for intelligent network traffic classification

Online support vector machine (SVM) is an effective learning method in real-time network traffic classification tasks. However, due to its geometric characteristics, the traditional online SVMs are sensitive to noise and class imbalance. In this paper, a scalable kernel convex hull online SVM called SKCHO-SVM is proposed to solve this problem. SKCHO-SVM involves two stages: (1) offline leaning stage, in which the noise points are deleted and initial pin-SVM classifier is built; (2) online updating stage, in which the classifier is updated with newly arrived data points, while carrying out the classification task. The noise deleting strategy and pinball loss function ensure SKCHO-SVM insensitive to noise data flows. Based on the scalable kernel convex hull, a small amount of convex hull vertices are dynamically selected as the training data points in each class, and the obtained scalable kernel convex hull can relieve class imbalance. Theoretical analysis and numerical experiments show that SKCHO-SVM has the distinctive ability of training time and classification performance.

[1]  C. Lee Giles,et al.  Nonconvex Online Support Vector Machines , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Dinil Mon Divakaran,et al.  SLIC: Self-Learning Intelligent Classifier for network traffic , 2015, Comput. Networks.

[3]  Pramod P. Khargonekar,et al.  Fast SVM training using approximate extreme points , 2013, J. Mach. Learn. Res..

[4]  Johan A. K. Suykens,et al.  Support Vector Machine Classifier With Pinball Loss , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Modjtaba Rouhani,et al.  Fast and de-noise support vector machine training method based on fuzzy clustering method for large real world datasets , 2016 .

[6]  Hong Qiao,et al.  A Fast Algorithm of Convex Hull Vertices Selection for Online Classification , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[7]  Korris Fu-Lai Chung,et al.  Kernel Density Estimation, Kernel Methods, and Fast Learning in Large Data Sets , 2014, IEEE Transactions on Cybernetics.

[8]  Antonio Pescapè,et al.  Issues and future directions in traffic classification , 2012, IEEE Network.

[9]  Pere Barlet-Ros,et al.  Independent comparison of popular DPI tools for traffic classification , 2015, Comput. Networks.

[10]  Xin Liu,et al.  Deep Learning for Encrypted Traffic Classification: An Overview , 2018, IEEE Communications Magazine.

[11]  Jie Wu,et al.  Robust Network Traffic Classification , 2015, IEEE/ACM Transactions on Networking.

[12]  Jie Cao,et al.  An accurate traffic classification model based on support vector machines , 2017, Int. J. Netw. Manag..

[13]  Min Wang,et al.  Online Support Vector Machine Based on Convex Hull Vertices Selection , 2013, IEEE Transactions on Neural Networks and Learning Systems.

[14]  Robert P. W. Duin,et al.  Support Vector Data Description , 2004, Machine Learning.

[15]  Sanjay Garg,et al.  A survey of simulators for P2P overlay networks with a case study of the P2P tree overlay using an event-driven simulator , 2017 .

[16]  Grenville J. Armitage,et al.  A survey of techniques for internet traffic classification using machine learning , 2008, IEEE Communications Surveys & Tutorials.

[17]  Johan A. K. Suykens,et al.  Asymmetric least squares support vector machine classifiers , 2014, Comput. Stat. Data Anal..

[18]  Jie Chen,et al.  Online Least Squares One-Class Support Vector Machines-Based Abnormal Visual Event Detection , 2013, Sensors.

[19]  Korris Fu-Lai Chung,et al.  Extreme vector machine for fast training on large data , 2020, Int. J. Mach. Learn. Cybern..

[20]  Dario Rossi,et al.  KISS: Stochastic Packet Inspection Classifier for UDP Traffic , 2010, IEEE/ACM Transactions on Networking.

[21]  Johan A. K. Suykens,et al.  Asymmetric v-tube support vector regression , 2014, Comput. Stat. Data Anal..

[22]  Shunyi Zhang,et al.  Internet Traffic Classification based on Fuzzy Kernel K-means Clustering , 2011 .

[23]  Guanglu Sun,et al.  Internet Traffic Classification Based on Incremental Support Vector Machines , 2018, Mob. Networks Appl..

[24]  Hongyuan Wang,et al.  Scalable transfer support vector machine with group probabilities , 2018, Neurocomputing.

[25]  Korris Fu-Lai Chung,et al.  Fast convex-hull vector machine for training on large-scale ncRNA data classification tasks , 2018, Knowl. Based Syst..

[26]  Marco Canini,et al.  Efficient application identification and the temporal and spatial stability of classification schema , 2009, Comput. Networks.

[27]  Farnam Jahanian,et al.  Internet inter-domain traffic , 2010, SIGCOMM '10.

[28]  Steven C. H. Hoi,et al.  Cost-Sensitive Online Classification , 2012, 2012 IEEE 12th International Conference on Data Mining.

[29]  Jason Weston,et al.  Fast Kernel Classifiers with Online and Active Learning , 2005, J. Mach. Learn. Res..

[30]  Konstantina Papagiannaki,et al.  Toward the Accurate Identification of Network Applications , 2005, PAM.