A Hybrid Intrusion Detection System: Integrating Hybrid Feature Selection Approach with Heterogeneous Ensemble of Intelligent Classifiers

This paper proposes Hybrid Feature Selection Approach – Heterogeneous Ensemble of Intelligent Classifiers (HyFSA-HEIC) for intelligent lightweight network intrusion detection system (NIDS). The purpose is to classify for anomaly from the incoming traffic. This system hierarchically integrates HyFSA and HEIC. The HyFSA will obtain the optimal number of features and then HEIC is built using these optimal features. HyFSA helps to decrease the computation time of the system and make it lightweight to work in real time. The aim of HEIC is to obtain accurate and robust classifier and enhance overall performance of the system. The results demonstrate that proposed system outperforms other ensemble and single classifier methods used in this paper. It has true positive rate (99.9%), accuracy (99.91%), precision (99.9%), receiver operating characteristics (99.9%), low false positive rate (0.1%) and lower root mean square error rate (3.06%) with a minimum number of selected 6 features. It also reduces time to build and time to test the model by 50.79% and 55.30% respectively on reduced features set. The results evince that detection rate, accuracy and precision of the system is increased by incorporating feature selection approach with heterogeneous ensemble of intelligent classifiers and significantly reduce the computation time.

[1]  Arputharaj Kannan,et al.  Decision tree based light weight intrusion detection using a wrapper approach , 2012, Expert Syst. Appl..

[2]  Léon Bottou,et al.  Large-Scale Machine Learning with Stochastic Gradient Descent , 2010, COMPSTAT.

[3]  Namita Mittal,et al.  Hybrid Approach for Detection of Anomaly Network Traffic using Data Mining Techniques , 2012 .

[4]  Yinhui Li,et al.  An efficient intrusion detection system based on support vector machines and gradually feature removal method , 2012, Expert Syst. Appl..

[5]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[6]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.

[7]  Amrita,et al.  A Hybrid-Based Feature Selection Approach for IDS , 2014 .

[8]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[9]  Manas Ranjan Patra,et al.  A Hybrid Intelligent Approach for Network Intrusion Detection , 2012 .

[10]  Vahid Golmah An Efficient Hybrid Intrusion Detection System based on C5.0 and SVM , 2014 .

[11]  Gisung Kim,et al.  A novel hybrid intrusion detection method integrating anomaly detection with misuse detection , 2014, Expert Syst. Appl..

[12]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[13]  Sanmay Das,et al.  Filters, Wrappers and a Boosting-Based Hybrid for Feature Selection , 2001, ICML.

[14]  K Nageswararao,et al.  A Study of Feature Selection Methods in Intrusion Detection System: A Survey , 2017 .

[15]  Chou-Yuan Lee,et al.  An intelligent algorithm with feature selection and decision rules applied to anomaly intrusion detection , 2012, Appl. Soft Comput..

[16]  Ron Kohavi,et al.  Feature Selection for Knowledge Discovery and Data Mining , 1998 .

[17]  Huang Chuanhe,et al.  Anomaly Based Intrusion Detection Using Hybrid Learning Approach of Combining k-Medoids Clustering and Naïve Bayes Classification , 2012, 2012 8th International Conference on Wireless Communications, Networking and Mobile Computing.

[18]  Usama M. Fayyad,et al.  Multi-Interval Discretization of Continuous-Valued Attributes for Classification Learning , 1993, IJCAI.

[19]  Wail S. Elkilani,et al.  A hybrid approach for efficient anomaly detection using metaheuristic methods , 2014, Journal of advanced research.

[20]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[21]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[22]  Lior Rokach,et al.  Ensemble-based classifiers , 2010, Artificial Intelligence Review.

[23]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[24]  Subhash C. Bagui,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2005, Technometrics.