Quality of Information within Internet of Things Data

Due to the increasing number of IoT devices, the amount of data gathered nowadays is rather large and continuously growing. The availability of new sensors presented in IoT devices and open data platforms provides new possibilities for innovative applications and use-cases. However, the dependence on data for the provision of services creates the necessity of assuring the quality of data to ensure the viability of the services. In order to support the evaluation of the valuable information, this chapter shows the development of a series of metrics that have been defined as indicators of the quality of data in a quantifiable, fast, reliable, and human-understandable way. The metrics are based on sound statistical indicators. Statistical analysis, machine learning algorithms, and contextual information are some of the methods to create quality indicators. The developed framework is also suitable for deciding between different datasets that hold similar information, since until now with no way of rapidly discovering which one is best in terms of quality had been developed. These metrics have been applied to real scenarios which have been smart parking and environmental sensing for smart buildings, and in both cases, the methods have been representative for the quality of the data.

[1]  Timo Hämäläinen,et al.  Increasing web service availability by detecting application-layer DDoS attacks in encrypted traffic , 2016, 2016 23rd International Conference on Telecommunications (ICT).

[2]  Aurora González-Vidal,et al.  Distributed real-time SlowDoS attacks detection over encrypted traffic using Artificial Intelligence , 2021, J. Netw. Comput. Appl..

[3]  Yangyong Zhu,et al.  The Challenges of Data Quality and Data Quality Assessment in the Big Data Era , 2015, Data Sci. J..

[4]  Lon-Mu Liu,et al.  Joint Estimation of Model Parameters and Outlier Effects in Time Series , 1993 .

[5]  Susan P. Williams,et al.  Data quality and the Internet of Things , 2019, Computing.

[6]  Marimuthu Palaniswami,et al.  Missing Data Imputation With Bayesian Maximum Entropy for Internet of Things Applications , 2021, IEEE Internet of Things Journal.

[7]  Kin K. Leung,et al.  Toward QoI and Energy-Efficiency in Internet-of-Things Sensory Environments , 2014, IEEE Transactions on Emerging Topics in Computing.

[8]  Stephen D. Clark,et al.  Detection of Outliers in Time Series. , 1991 .

[9]  Diane M. Strong,et al.  Data quality in context , 1997, CACM.

[10]  E. Rivero Cornelio Bases de datos relacionales , 1988 .

[11]  Mani B. Srivastava,et al.  Building principles for a quality of information specification for sensor information , 2009, 2009 12th International Conference on Information Fusion.

[12]  Antonio F. Gómez-Skarmeta,et al.  IoT for Water Management: Towards Intelligent Anomaly Detection , 2019, 2019 IEEE 5th World Forum on Internet of Things (WF-IoT).

[13]  Carl Lagoze,et al.  Big Data, data integrity, and the fracturing of the control zone , 2014, Big Data Soc..

[14]  Amy J. Ruggles,et al.  An Experimental Comparison of Ordinary and Universal Kriging and Inverse Distance Weighting , 1999 .

[15]  Ralf Tönjes,et al.  Valid.IoT: a framework for sensor data quality analysis and interpolation , 2018, MMSys.