Towards a Lightweight Detection System for Cyber Attacks in the IoT Environment Using Corresponding Features

The application of a large number of Internet of Things (IoT) devices makes our life more convenient and industries more efficient. However, it also makes cyber-attacks much easier to occur because so many IoT devices are deployed and most of them do not have enough resources (i.e., computation and storage capacity) to carry out ordinary intrusion detection systems (IDSs). In this study, a lightweight machine learning-based IDS using a new feature selection algorithm is designed and implemented on Raspberry Pi, and its performance is verified using a public dataset collected from an IoT environment. To make the system lightweight, we propose a new algorithm for feature selection, called the correlated-set thresholding on gain-ratio (CST-GR) algorithm, to select really necessary features. Because the feature selection is conducted on three specific kinds of cyber-attacks, the number of selected features can be significantly reduced, which makes the classifiers very small and fast. Thus, our detection system is lightweight enough to be implemented and carried out in a Raspberry Pi system. More importantly, as the really necessary features corresponding to each kind of attack are exploited, good detection performance can be expected. The performance of our proposal is examined in detail with different machine learning algorithms, in order to learn which of them is the best option for our system. The experiment results indicate that the new feature selection algorithm can select only very few features for each kind of attack. Thus, the detection system is lightweight enough to be implemented in the Raspberry Pi environment with almost no sacrifice on detection performance.

[1]  Jiankun Hu,et al.  Generation of a new IDS test dataset: Time to retire the KDD collection , 2013, 2013 IEEE Wireless Communications and Networking Conference (WCNC).

[2]  Shyava Tripathi,et al.  Raspberry Pi as an Intrusion Detection System, a Honeypot and a Packet Analyzer , 2018, 2018 International Conference on Computational Techniques, Electronics and Mechanical Systems (CTEMS).

[3]  Eibe Frank,et al.  Logistic Model Trees , 2003, Machine Learning.

[4]  Gyu Sang Choi,et al.  Towards Trust and Friendliness Approaches in the Social Internet of Things , 2019, Applied Sciences.

[5]  Michele Nogueira Lima,et al.  Detection of sinkhole attacks for supporting secure routing on 6LoWPAN for Internet of Things , 2015, 2015 IFIP/IEEE International Symposium on Integrated Network Management (IM).

[6]  Ron Kohavi,et al.  Feature Subset Selection Using the Wrapper Method: Overfitting and Dynamic Search Space Topology , 1995, KDD.

[7]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[8]  A Min Tjoa,et al.  Performance Comparison between Naïve Bayes, Decision Tree and k-Nearest Neighbor in Searching Alternative Design in an Energy Simulation Tool , 2013 .

[9]  Reza Azmi,et al.  A survey on Botnet: Classification, detection and defense , 2015, 2015 International Electronics Symposium (IES).

[10]  Geoff Hulten,et al.  Mining time-changing data streams , 2001, KDD '01.

[11]  Gordon Fyodor Lyon,et al.  Nmap Network Scanning: The Official Nmap Project Guide to Network Discovery and Security Scanning , 2009 .

[12]  Biju Issac,et al.  Performance Comparison of Intrusion Detection Systems and Application of Machine Learning to Snort System , 2017, Future Gener. Comput. Syst..

[13]  Jin Cao,et al.  An Automata Based Intrusion Detection Method for Internet of Things , 2017, Mob. Inf. Syst..

[14]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[15]  Mauro Conti,et al.  RPiDS: Raspberry Pi IDS — A Fruitful Intrusion Detection System for IoT , 2016, 2016 Intl IEEE Conferences on Ubiquitous Intelligence & Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Cloud and Big Data Computing, Internet of People, and Smart World Congress (UIC/ATC/ScalCom/CBDCom/IoP/SmartWorld).

[16]  J. Friedman Special Invited Paper-Additive logistic regression: A statistical view of boosting , 2000 .

[17]  Biplab Sikdar,et al.  A Survey on IoT Security: Application Areas, Security Threats, and Solution Architectures , 2019, IEEE Access.

[18]  Paulus Insap Santosa,et al.  Implementing Lightweight IoT-IDS on Raspberry Pi Using Correlation-Based Feature Selection and Its Performance Evaluation , 2019, AINA.

[19]  Roksana Boreli,et al.  A Host-Based Intrusion Detection and Mitigation Framework for Smart Home IoT Using OpenFlow , 2016, 2016 11th International Conference on Availability, Reliability and Security (ARES).

[20]  Kjell Johnson,et al.  An Introduction to Feature Selection , 2013 .

[21]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[22]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[23]  Yuzhu Chen,et al.  Pi-IDS: evaluation of open-source intrusion detection systems on Raspberry Pi 2 , 2015, 2015 Second International Conference on Information Security and Cyber Forensics (InfoSec).

[24]  Geoff Hulten,et al.  Mining high-speed data streams , 2000, KDD '00.

[25]  Elena Sitnikova,et al.  Towards the Development of Realistic Botnet Dataset in the Internet of Things for Network Forensic Analytics: Bot-IoT Dataset , 2018, Future Gener. Comput. Syst..

[26]  Rong Yang,et al.  Machine Learning and Data Mining in Pattern Recognition , 2012, Lecture Notes in Computer Science.

[27]  Mustafa COŞAR,et al.  Performance Comparison of Open Source IDSs via Raspberry Pi , 2018, 2018 International Conference on Artificial Intelligence and Data Processing (IDAP).

[28]  Abdolhossein Sarrafzadeh,et al.  Free and open source intrusion detection systems: A study , 2015, 2015 International Conference on Machine Learning and Cybernetics (ICMLC).

[29]  Andreas Aspernäs,et al.  IDS on Raspberry Pi : A Performance Evaluation , 2015 .

[30]  Ian G. Harris,et al.  An efficient approach to prevent Battery Exhaustion Attack on BLE-based mesh networks , 2017, 2017 International Conference on Computing, Networking and Communications (ICNC).

[31]  Tomas Zitta,et al.  The security of RFID readers with IDS/IPS solution using Raspberry Pi , 2017, 2017 18th International Carpathian Control Conference (ICCC).

[32]  Nour Moustafa,et al.  UNSW-NB15: a comprehensive data set for network intrusion detection systems (UNSW-NB15 network data set) , 2015, 2015 Military Communications and Information Systems Conference (MilCIS).

[33]  Jiankun Hu,et al.  Windows Based Data Sets for Evaluation of Robustness of Host Based Intrusion Detection Systems (IDS) to Zero-Day and Stealth Attacks , 2016, Future Internet.

[34]  A. Karegowda,et al.  COMPARATIVE STUDY OF ATTRIBUTE SELECTION USING GAIN RATIO AND CORRELATION BASED FEATURE SELECTION , 2010 .

[35]  Peter Burnap,et al.  Pulse: an adaptive intrusion detection for the internet of things , 2018, IoT 2018.

[36]  Liang Lu,et al.  Feature Selection for Machine Learning-Based Early Detection of Distributed Cyber Attacks , 2018, 2018 IEEE 16th Intl Conf on Dependable, Autonomic and Secure Computing, 16th Intl Conf on Pervasive Intelligence and Computing, 4th Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[37]  Ainuddin Wahid Abdul Wahab,et al.  A Lightweight Perceptron-Based Intrusion Detection System for Fog Computing , 2019, Applied Sciences.

[38]  Roberto Battiti,et al.  Using mutual information for selecting features in supervised neural net learning , 1994, IEEE Trans. Neural Networks.

[39]  Francisco L. de Caldas Filho,et al.  Tracking intruders in IoT networks by means of DNS traffic analysis , 2017, 2017 Workshop on Communication Networks and Power Systems (WCNPS).

[40]  José Augusto Baranauskas,et al.  How Many Trees in a Random Forest? , 2012, MLDM.

[41]  Adeilson Marques da Silva Cardoso,et al.  Poster Abstract: Real-Time DDoS Detection Based on Complex Event Processing for IoT , 2018, 2018 IEEE/ACM Third International Conference on Internet-of-Things Design and Implementation (IoTDI).