IoTCrawler: Challenges and Solutions for Searching the Internet of Things

Due to the rapid development of the Internet of Things (IoT) and consequently, the availability of more and more IoT data sources, mechanisms for searching and integrating IoT data sources become essential to leverage all relevant data for improving processes and services. This paper presents the IoT search framework IoTCrawler. The IoTCrawler framework is not only another IoT framework, it is a system of systems which connects existing solutions to offer interoperability and to overcome data fragmentation. In addition to its domain-independent design, IoTCrawler features a layered approach, offering solutions for crawling, indexing and searching IoT data sources, while ensuring privacy and security, adaptivity and reliability. The concept is proven by addressing a list of requirements defined for searching the IoT and an extensive evaluation. In addition, real world use cases showcase the applicability of the framework and provide examples of how it can be instantiated for new scenarios.

[1]  Anand Sivasubramaniam,et al.  IoTRepair: Systematically Addressing Device Faults in Commodity IoT , 2020, 2020 IEEE/ACM Fifth International Conference on Internet-of-Things Design and Implementation (IoTDI).

[2]  Josiane Xavier Parreira,et al.  The Linked Sensor Middleware — Connecting the real world and the Semantic Web , 2011 .

[3]  Ivan Izonin,et al.  An Approach towards Missing Data Recovery within IoT Smart System , 2019, Procedia Computer Science.

[4]  Lina Yao,et al.  Internet of Things Search Engine: Concepts, Classification, and Open Issues , 2018, ArXiv.

[5]  Benjamin Letham,et al.  Forecasting at Scale , 2018, PeerJ Prepr..

[6]  Shirin Enshaeifar,et al.  Lagrangian-Based Pattern Extraction for Edge Computing in the Internet of Things , 2019, 2019 6th IEEE International Conference on Cyber Security and Cloud Computing (CSCloud)/ 2019 5th IEEE International Conference on Edge Computing and Scalable Cloud (EdgeCom).

[7]  Yong Ho Hwang IoT Security & Privacy: Threats and Challenges , 2015, IoTPTS@AsiaCCS.

[8]  Gerald Kotonya,et al.  A Microservices Architecture for Reactive and Proactive Fault Tolerance in IoT Systems , 2018, 2018 IEEE 19th International Symposium on "A World of Wireless, Mobile and Multimedia Networks" (WoWMoM).

[9]  Geir M. Køien,et al.  Security and privacy in the Internet of Things: Current status and open issues , 2014, 2014 International Conference on Privacy and Security in Mobile Systems (PRISMS).

[10]  Giuseppe Piro,et al.  A policy enforcement framework for Internet of Things applications in the smart health , 2017 .

[11]  Yacine Challal,et al.  A Systemic Approach for IoT Security , 2013, 2013 IEEE International Conference on Distributed Computing in Sensor Systems.

[12]  Kevin Kam Fung Yuen,et al.  Towards a ranking approach for sensor services using primitive cognitive network process , 2014, The 4th Annual IEEE International Conference on Cyber Technology in Automation, Control and Intelligent.

[13]  Ramjee Prasad,et al.  Identity Management Framework towards Internet of Things (IoT): Roadmap and Key Challenges , 2010, CNSA.

[14]  María Bermúdez-Edo,et al.  A Knowledge-Based Approach for Real-Time IoT Data Stream Annotation and Processing , 2014, 2014 IEEE International Conference on Internet of Things(iThings), and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom).

[15]  Eamonn J. Keogh,et al.  iSAX 2.0: Indexing and Mining One Billion Time Series , 2010, 2010 IEEE International Conference on Data Mining.

[16]  Vlad Trifa,et al.  Interacting with the SOA-Based Internet of Things: Discovery, Query, Selection, and On-Demand Provisioning of Web Services , 2010, IEEE Transactions on Services Computing.

[17]  On Quality Ontology , 2021, The Performative Enterprise.

[18]  Martin Strohbach,et al.  Smart Home Crawler : Towards a framework for semi-automatic IoT sensor integration , 2019, 2019 Global IoT Summit (GIoTS).

[19]  Fulvio Corno,et al.  DogOnt - Ontology Modeling for Intelligent Domotic Environments , 2008, SEMWEB.

[20]  Simon Mayer,et al.  An extensible discovery service for smart things , 2011, WoT '11.

[21]  Aurora González-Vidal,et al.  BEATS: Blocks of Eigenvalues Algorithm for Time Series Segmentation , 2018, IEEE Transactions on Knowledge and Data Engineering.

[22]  Diane M. Strong,et al.  Beyond Accuracy: What Data Quality Means to Data Consumers , 1996, J. Manag. Inf. Syst..

[23]  Rahim Tafazolli,et al.  A distributed in-network indexing mechanism for the Internet of Things , 2016, 2016 IEEE 3rd World Forum on Internet of Things (WF-IoT).

[24]  N. F. Noy,et al.  Ontology Development 101: A Guide to Creating Your First Ontology , 2001 .

[25]  Ralf Tönjes,et al.  Virtual Sensor Creation to Replace Faulty Sensors Using Automated Machine Learning Techniques , 2020, 2020 Global Internet of Things Summit (GIoTS).

[26]  Danh Le Phuoc,et al.  A Native and Adaptive Approach for Unified Processing of Linked Streams and Linked Data , 2011, SEMWEB.

[27]  Christian Bizer,et al.  Sieve: linked data quality assessment and fusion , 2012, EDBT-ICDT '12.

[28]  Wolfgang Lehner,et al.  Representing Data Quality for Streaming and Static Data , 2007, 2007 IEEE 23rd International Conference on Data Engineering Workshop.

[29]  Tharam S. Dillon,et al.  Missing Value Imputation for Industrial IoT Sensor Data With Large Gaps , 2020, IEEE Internet of Things Journal.

[30]  Ralf Tönjes,et al.  Quality of Information for IoT-Frameworks , 2020, 2020 IEEE International Smart Cities Conference (ISC2).

[31]  Jose L. Hernandez-Ramos,et al.  Toward a Data-Driven Society: A Technological Perspective on the Development of Cybersecurity and Data-Protection Policies , 2020, IEEE Security & Privacy.

[32]  Wolfgang Kellerer,et al.  A real-time search engine for the Web of Things , 2010, IOT.

[33]  Shirin Enshaeifar,et al.  IoT-Stream: A Lightweight Ontology for Internet of Things Data Streams and Its Use with Data Analytics and Event Detection Services † , 2020, Sensors.

[34]  Ralf Tönjes,et al.  Valid.IoT: a framework for sensor data quality analysis and interpolation , 2018, MMSys.

[35]  Marimuthu Palaniswami,et al.  Missing Data Imputation With Bayesian Maximum Entropy for Internet of Things Applications , 2021, IEEE Internet of Things Journal.

[36]  Antonio F. Gómez-Skarmeta,et al.  Distributed Capability-based Access Control for the Internet of Things , 2013, J. Internet Serv. Inf. Secur..

[37]  Ralf Tönjes,et al.  CityPulse: Large Scale Data Analytics Framework for Smart Cities , 2016, IEEE Access.

[38]  George Christakos,et al.  Bayesian Maximum Entropy Analysis and Mapping: A Farewell to Kriging Estimators? , 1998 .

[39]  Shirin Enshaeifar,et al.  A New Pattern Representation Method for Time-Series Data , 2021, IEEE Transactions on Knowledge and Data Engineering.

[40]  Nabeel Al-Milli,et al.  Hybrid Neural Network to Impute Missing Data for IoT Applications , 2019, 2019 IEEE Jordan International Joint Conference on Electrical Engineering and Information Technology (JEEIT).

[41]  Aurora González-Vidal,et al.  Distributed real-time SlowDoS attacks detection over encrypted traffic using Artificial Intelligence , 2021, J. Netw. Comput. Appl..

[42]  Rahim Tafazolli,et al.  Large-Scale Indexing, Discovery, and Ranking for the Internet of Things (IoT) , 2018, ACM Comput. Surv..

[43]  Lakshmish Ramaswamy,et al.  DQS-Cloud: A Data Quality-Aware autonomic cloud for sensor services , 2014, 10th IEEE International Conference on Collaborative Computing: Networking, Applications and Worksharing.

[44]  Muhammad Intizar Ali,et al.  WOTS2E: A search engine for a Semantic Web of Things , 2016, 2016 IEEE 3rd World Forum on Internet of Things (WF-IoT).

[45]  Quan Z. Sheng,et al.  Internet of things search engine , 2019, Commun. ACM.

[46]  Elisa Bertino,et al.  XACML Policy Integration Algorithms , 2008, TSEC.

[47]  Themis Palpanas,et al.  Indexing for interactive exploration of big data series , 2014, SIGMOD Conference.

[48]  Amit P. Sheth,et al.  The SSN ontology of the W3C semantic sensor network incubator group , 2012, J. Web Semant..

[49]  Antonio F. Gómez-Skarmeta,et al.  Holistic Privacy-Preserving Identity Management System for the Internet of Things , 2017, Mob. Inf. Syst..

[50]  Klaus Moessner,et al.  Enabling Query of Frequently Updated Data from Mobile Sensing Sources , 2014, 2014 IEEE 17th International Conference on Computational Science and Engineering.

[51]  María Bermúdez-Edo,et al.  IoT-Lite: a lightweight semantic model for the internet of things and its use with dynamic semantics , 2016, Personal and Ubiquitous Computing.

[52]  Sergey Brin,et al.  Reprint of: The anatomy of a large-scale hypertextual web search engine , 2012, Comput. Networks.

[53]  Chonggang Wang,et al.  A Linked-Data Model for Semantic Sensor Streams , 2013, 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing.

[54]  R. Rezvani,et al.  Enabling Context-Aware Search using Extracted Insights from IoT Data Streams , 2020, 2020 Global Internet of Things Summit (GIoTS).

[55]  Salvador Pérez,et al.  Towards the CP-ABE Application for Privacy-Preserving Secure Data Sharing in IoT Contexts , 2017, IMIS.

[56]  Marco Angelini,et al.  Security and Privacy in Internet of Things-Enabled Smart Cities: Challenges and Future Directions , 2021, IEEE Security & Privacy.

[57]  Armin Haller,et al.  SOSA: A Lightweight Ontology for Sensors, Observations, Samples, and Actuators , 2018, J. Web Semant..

[58]  Patrick Martin,et al.  Reputation-Enhanced QoS-based Web Services Discovery , 2007, IEEE International Conference on Web Services (ICWS 2007).

[59]  Francois Carrez,et al.  Information Abstraction for Heterogeneous Real World Internet Data , 2013, IEEE Sensors Journal.

[60]  Liang Chang,et al.  Context-Aware Service Ranking in Wireless Sensor Networks , 2012, Journal of Network and Systems Management.

[61]  Sathish Penchala Search Engine on Semantic Web of Things , 2019 .