Intelligent IoT Traffic Classification Using Novel Search Strategy for Fast-Based-Correlation Feature Selection in Industrial Environments

Internet of Things (IoT) can be combined with machine learning in order to provide intelligent applications to the network nodes. Furthermore, IoT expands these advantages and technologies to the industry. In this paper, we propose a modification of one of the most popular algorithms for feature selection, fast-based-correlation feature (FCBF). The key idea is to split the feature space in fragments with the same size. By introducing this division, we can improve the correlation and, therefore, the machine learning applications that are operating on each node. This kind of IoT applications for industry allows us to separate and prioritize the sensor data from the multimedia-related traffic. With this separation, the sensors are able to detect efficiently emergency situations and avoid both material and human damage. The results show the performance of the three FCBF-based algorithms for different problems and different classifiers, confirming the improvements achieved by our approach in terms of model accuracy and execution time.

[1]  Max A. Little,et al.  Objective Automatic Assessment of Rehabilitative Speech Treatment in Parkinson's Disease , 2014, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[2]  Ron Kohavi,et al.  Irrelevant Features and the Subset Selection Problem , 1994, ICML.

[3]  Wu He,et al.  Internet of Things in Industries: A Survey , 2014, IEEE Transactions on Industrial Informatics.

[4]  Huan Liu,et al.  Toward integrating feature selection algorithms for classification and clustering , 2005, IEEE Transactions on Knowledge and Data Engineering.

[5]  Huan Liu,et al.  Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution , 2003, ICML.

[6]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[7]  Hwee Pink Tan,et al.  Machine Learning in Wireless Sensor Networks: Algorithms, Strategies, and Applications , 2014, IEEE Communications Surveys & Tutorials.

[8]  Hakki C. Cankaya,et al.  Churn prediction in subscriber management for mobile and wireless communications services , 2013, 2013 IEEE Globecom Workshops (GC Wkshps).

[9]  Jaime Lloret,et al.  Practical Deployments of Wireless Sensor Networks: a Survey , 2010 .

[10]  Zahir Tari,et al.  An optimal and stable feature selection approach for traffic classification based on multi-criterion fusion , 2014, Future Gener. Comput. Syst..

[11]  Liang Wang,et al.  Using the IOT to construct ubiquitous learning environment , 2011, 2011 Second International Conference on Mechanic Automation and Control Engineering.

[12]  Lei Yu,et al.  Fast Correlation Based Filter (FCBF) with a different search strategy , 2008, 2008 23rd International Symposium on Computer and Information Sciences.

[13]  Yang Xiao,et al.  Non-intrusive Traffic Data Collection with Wireless Sensor Networks for Intelligent Transportation Systems , 2016, Ad Hoc Sens. Wirel. Networks.

[14]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[15]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Mohammad Khubeb Siddiqui,et al.  Analysis of KDD CUP 99 Dataset using Clustering based Data Mining , 2013 .

[17]  Jane Labadin,et al.  Feature selection based on mutual information , 2015, 2015 9th International Conference on IT in Asia (CITA).

[18]  Pat Langley,et al.  Selection of Relevant Features and Examples in Machine Learning , 1997, Artif. Intell..

[19]  Rajan Gupta,et al.  Internet Traffic Surveillance & Network Monitoring in India: Case Study of NETRA , 2016, Netw. Protoc. Algorithms.

[20]  Mark A. Hall,et al.  Correlation-based Feature Selection for Machine Learning , 2003 .

[21]  Gang Lu,et al.  Feature selection for optimizing traffic classification , 2012, Comput. Commun..

[22]  Jaime Lloret,et al.  Context-Aware Cloud Robotics for Material Handling in Cognitive Industrial Internet of Things , 2018, IEEE Internet of Things Journal.

[23]  Jaime Lloret,et al.  A secure and low-energy zone-based wireless sensor networks routing protocol for pollution monitoring , 2016, Wirel. Commun. Mob. Comput..

[24]  Jaime Lloret,et al.  ELDC: An Artificial Neural Network Based Energy-Efficient and Robust Routing Scheme for Pollution Monitoring in WSNs , 2020, IEEE Transactions on Emerging Topics in Computing.

[25]  Elias Oliveira,et al.  Agglomeration and Elimination of Terms for Dimensionality Reduction , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[26]  Sebastian Zander,et al.  A preliminary performance comparison of five machine learning algorithms for practical IP traffic flow classification , 2006, CCRV.

[27]  Tongtong Li,et al.  Congestion-Aware Routing Scheme based on Traffic Information in Sensor Networks , 2017, Ad Hoc Sens. Wirel. Networks.

[28]  Mohsen Guizani,et al.  Recent advances in green industrial networking [Guest Editorial] , 2016, IEEE Commun. Mag..

[29]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[30]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[31]  Éric Gaussier,et al.  A Probabilistic Interpretation of Precision, Recall and F-Score, with Implication for Evaluation , 2005, ECIR.

[32]  Diego López-de-Ipiña,et al.  ARIIMA: A Real IoT Implementation of a Machine-Learning Architecture for Reducing Energy Consumption , 2014, UCAmI.

[33]  Alessio Vecchio,et al.  Adapting the Duty Cycle to Traffic Load in a Preamble Sampling MAC for WSNs: Formal Specification and Performance Evaluation , 2016, Ad Hoc Sens. Wirel. Networks.

[34]  Luis Hernández-Callejo,et al.  Ensemble network traffic classification: Algorithm comparison and novel ensemble scheme proposal , 2017, Comput. Networks.

[35]  Isabelle Guyon,et al.  Winning the KDD Cup Orange Challenge with Ensemble Selection , 2009 .

[36]  Frank Pearson Lees,et al.  Loss prevention in the process industries : hazard identification, assessment, and control , 1980 .