Internet of things and data mining: An application oriented survey

Abstract Advancement in the fields of electronic communication, data processing, and internet technologies enable easy access to and interaction with a variety of physical devices throughout the globe. Our whole world is enveloped by a blanket of innumerable smart devices equipped with the sensors and actuators. Extensive research on the Internet of things (IoT) with cloud technologies, make it possible to accumulate tremendous data created from this heterogeneous environment and transform it into precious knowledge by utilizing data mining technologies. Furthermore, this generated knowledge will play a key role in intelligent decision making, system performance boosting, and optimum management of resources and services. With this background, this paper presents a systematic and detailed review of various data mining techniques employed in the large and small scale IoT applications to formulate an intelligent environment. It also presents an overview of cloud-assisted IoT Big data mining system to better understand the importance of data mining for an IoT environment.

[1]  Torben Bach Pedersen,et al.  Time Series Management Systems: A Survey , 2017, IEEE Transactions on Knowledge and Data Engineering.

[2]  Noël Crespi,et al.  The Cluster Between Internet of Things and Social Networks: Review and Research Challenges , 2014, IEEE Internet of Things Journal.

[3]  Dazhong Wu,et al.  Deep learning for smart manufacturing: Methods and applications , 2018, Journal of Manufacturing Systems.

[4]  Mohd Wazir Mustafa,et al.  Smart grids security challenges: Classification by sources of threats , 2018, Journal of Electrical Systems and Information Technology.

[5]  Chin-Feng Lai,et al.  Multi-appliance recognition system with hybrid SVM/GMM classifier in ubiquitous smart home , 2013, Inf. Sci..

[6]  Lawrence B. Holder,et al.  Discovering Activities to Recognize and Track in a Smart Environment , 2011, IEEE Transactions on Knowledge and Data Engineering.

[7]  Wu He,et al.  Internet of Things in Industries: A Survey , 2014, IEEE Transactions on Industrial Informatics.

[8]  Zahir Tari,et al.  A Spatiotemporal Data Summarization Approach for Real-Time Operation of Smart Grid , 2020, IEEE Transactions on Big Data.

[9]  Yuguang Fang,et al.  DataClouds: Enabling Community-Based Data-Centric Services Over the Internet of Things , 2014, IEEE Internet of Things Journal.

[10]  Xiong Li,et al.  An elliptic curve cryptography based lightweight authentication scheme for smart grid communication , 2018, Future Gener. Comput. Syst..

[11]  Aggelos K. Katsaggelos,et al.  ECG-Based Classification of Resuscitation Cardiac Rhythms for Retrospective Data Analysis , 2017, IEEE Transactions on Biomedical Engineering.

[12]  John A. Stankovic,et al.  Behavioral Patterns of Older Adults in Assisted Living , 2008, IEEE Transactions on Information Technology in Biomedicine.

[13]  Ivan Stojmenovic,et al.  Machine-to-Machine Communications With In-Network Data Aggregation, Processing, and Actuation for Large-Scale Cyber-Physical Systems , 2014, IEEE Internet of Things Journal.

[14]  Fei Jiang,et al.  Big data issues in smart grid – A review , 2017 .

[15]  Lei Shu,et al.  Smart Factory of Industry 4.0: Key Technologies, Application Case, and Challenges , 2018, IEEE Access.

[16]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[17]  Ilhami Colak,et al.  Smart grid technologies and applications , 2016 .

[18]  Dominik Engel,et al.  Influence of Data Granularity on Smart Meter Privacy , 2015, IEEE Transactions on Smart Grid.

[19]  H. T. Mouftah,et al.  Trustworthy Sensing for Public Safety in Cloud-Centric Internet of Things , 2014, IEEE Internet of Things Journal.

[20]  Yasser Abdel-Rady I. Mohamed,et al.  Big data framework for analytics in smart grids , 2017 .

[21]  Mohsen Guizani,et al.  Deep Learning for IoT Big Data and Streaming Analytics: A Survey , 2017, IEEE Communications Surveys & Tutorials.

[22]  Albert Y. Zomaya,et al.  Tensor-Based Big Data Management Scheme for Dimensionality Reduction Problem in Smart Grid Systems: SDN Perspective , 2018, IEEE Transactions on Knowledge and Data Engineering.

[23]  Seng W. Loke,et al.  A Review of Mobile Crowdsourcing Architectures and Challenges: Toward Crowd-Empowered Internet-of-Things , 2019, IEEE Access.

[24]  Sebastian I. Wolf,et al.  Body-Sensor-Network-Based Spasticity Detection , 2014, IEEE Journal of Biomedical and Health Informatics.

[25]  Neeraj Kumar,et al.  Consumption-Aware Data Analytical Demand Response Scheme for Peak Load Reduction in Smart Grid , 2018, IEEE Transactions on Industrial Electronics.

[26]  Sang Do Noh,et al.  Smart manufacturing: Past research, present findings, and future directions , 2016, International Journal of Precision Engineering and Manufacturing-Green Technology.

[27]  Imrich Chlamtac,et al.  Internet of things: Vision, applications and research challenges , 2012, Ad Hoc Networks.

[28]  Marimuthu Palaniswami,et al.  PPFA: Privacy Preserving Fog-Enabled Aggregation in Smart Grid , 2018, IEEE Transactions on Industrial Informatics.

[29]  John F. Roddick,et al.  A Survey of Temporal Knowledge Discovery Paradigms and Methods , 2002, IEEE Trans. Knowl. Data Eng..

[30]  Ruoming Jin,et al.  Fast and exact out-of-core and distributed k-means clustering , 2006, Knowledge and Information Systems.

[31]  Selwyn Piramuthu,et al.  Internet of Things (IoT) in high-risk Environment, Health and Safety (EHS) industries: A comprehensive review , 2018, Decis. Support Syst..

[32]  Jian Lu,et al.  Recognizing Multiuser Activities Using Wireless Body Sensor Networks , 2011, IEEE Transactions on Mobile Computing.

[33]  Abdelsalam Helal,et al.  Scalable Cloud–Sensor Architecture for the Internet of Things , 2016, IEEE Internet of Things Journal.

[34]  Jairo Alejandro Gomez,et al.  Review of IoT applications in agro-industrial and environmental fields , 2017, Comput. Electron. Agric..

[35]  Jiawei Han,et al.  Frequent pattern mining: current status and future directions , 2007, Data Mining and Knowledge Discovery.

[36]  Ruili Wang,et al.  A Survey on an Emerging Area: Deep Learning for Smart City Data , 2019, IEEE Transactions on Emerging Topics in Computational Intelligence.

[37]  Lei Shu,et al.  Challenges and Research Issues of Data Management in IoT for Large-Scale Petrochemical Plants , 2018, IEEE Systems Journal.

[38]  Siobhán Clarke,et al.  Middleware for Internet of Things: A Survey , 2016, IEEE Internet of Things Journal.

[39]  Pingyu Jiang,et al.  RFID-based production data analysis in an IoT-enabled smart job-shop , 2018, IEEE/CAA Journal of Automatica Sinica.

[40]  Thomas Bartzanas,et al.  Internet of Things in agriculture, recent advances and future challenges , 2017 .

[41]  Alberto Leon-Garcia,et al.  A Fog-Based Internet of Energy Architecture for Transactive Energy Management Systems , 2018, IEEE Internet of Things Journal.

[42]  Steven Lanzisera,et al.  Communicating Power Supplies: Bringing the Internet to the Ubiquitous Energy Gateways of Electronic Devices , 2014, IEEE Internet of Things Journal.

[43]  Vic Barnett,et al.  Outliers in Statistical Data , 1980 .

[44]  François Carrez,et al.  A Practical Evaluation of Information Processing and Abstraction Techniques for the Internet of Things , 2015, IEEE Internet of Things Journal.

[45]  Zarko Zecevic,et al.  Architecting an IoT-enabled platform for precision agriculture and ecological monitoring: A case study , 2017, Comput. Electron. Agric..

[46]  Joel J. P. C. Rodrigues,et al.  Fog-Based Crime-Assistance in Smart IoT Transportation System , 2018, IEEE Access.

[47]  Junaid Qadir,et al.  Leveraging Machine Learning and Big Data for Smart Buildings: A Comprehensive Survey , 2019, IEEE Access.

[48]  Abdulmotaleb El-Saddik,et al.  InCloud: a cloud-based middleware for vehicular infotainment systems , 2016, Multimedia Tools and Applications.

[49]  Cedric Baudoin,et al.  Towards the Deployment of Machine Learning Solutions in Network Traffic Classification: A Systematic Survey , 2019, IEEE Communications Surveys & Tutorials.

[50]  John A. Stankovic,et al.  Research Directions for the Internet of Things , 2014, IEEE Internet of Things Journal.

[51]  Amjad Anvari-Moghaddam,et al.  Optimal Smart Home Energy Management Considering Energy Saving and a Comfortable Lifestyle , 2016, IEEE Transactions on Smart Grid.

[52]  William C. Mann,et al.  The Gator Tech Smart House: a programmable pervasive space , 2005, Computer.

[53]  Gerardo Toraldo,et al.  The IoT as a tool to combine the scheduling of the irrigation with the geostatistics of the soils , 2018, Future Gener. Comput. Syst..

[54]  Li Li,et al.  Sequential Behavior Pattern Discovery with Frequent Episode Mining and Wireless Sensor Network , 2017, IEEE Communications Magazine.

[55]  Binxing Fang,et al.  A Survey on Access Control in the Age of Internet of Things , 2020, IEEE Internet of Things Journal.

[56]  Jaydip Sen,et al.  Internet of Things - Applications and Challenges in Technology and Standardization , 2011 .

[57]  Abdulsalam Yassine,et al.  Mining Human Activity Patterns From Smart Home Big Data for Health Care Applications , 2017, IEEE Access.

[58]  Tae-Jin Lee,et al.  Joint Access Control and Resource Allocation for Concurrent and Massive Access of M2M Devices , 2015, IEEE Transactions on Wireless Communications.

[59]  Taekyoung Kwon,et al.  Data Transfusion: Pairing Wearable Devices and Its Implication on Security for Internet of Things , 2018, IEEE Access.

[60]  Zibin Zheng,et al.  Blockchain for Internet of Things: A Survey , 2019, IEEE Internet of Things Journal.

[61]  Mohsen Guizani,et al.  Vcash: A Novel Reputation Framework for Identifying Denial of Traffic Service in Internet of Connected Vehicles , 2019, IEEE Internet of Things Journal.

[62]  Michel Vacher,et al.  SVM-Based Multimodal Classification of Activities of Daily Living in Health Smart Homes: Sensors, Algorithms, and First Experimental Results , 2010, IEEE Transactions on Information Technology in Biomedicine.

[63]  Inderjit S. Dhillon,et al.  Kernel k-means: spectral clustering and normalized cuts , 2004, KDD.

[64]  Vipin Kumar,et al.  Introduction to Data Mining , 2022, Data Mining and Machine Learning Applications.

[65]  Tian Zhang,et al.  BIRCH: A New Data Clustering Algorithm and Its Applications , 1997, Data Mining and Knowledge Discovery.

[66]  M. Amaç Güvensan,et al.  Driver Behavior Analysis for Safe Driving: A Survey , 2015, IEEE Transactions on Intelligent Transportation Systems.

[67]  Xiang Cheng,et al.  Smart Choice for the Smart Grid: Narrowband Internet of Things (NB-IoT) , 2018, IEEE Internet of Things Journal.

[68]  M.A. Uusitalo,et al.  Global Vision for the Future Wireless World from the WWRF , 2006, IEEE Vehicular Technology Magazine.

[69]  Gregory Faraut,et al.  Activity Discovery and Detection of Behavioral Deviations of an Inhabitant From Binary Sensors , 2015, IEEE Transactions on Automation Science and Engineering.

[70]  Joarder Kamruzzaman,et al.  A Survey on Behavioral Pattern Mining From Sensor Data in Internet of Things , 2020, IEEE Access.

[71]  Peng Li,et al.  An Adaptive Dropout Deep Computation Model for Industrial IoT Big Data Learning With Crowdsourcing to Cloud Computing , 2019, IEEE Transactions on Industrial Informatics.