Imputing the missing data in IoT based on the spatial and temporal correlation

Missing data in IoT occurs due to a variety of reasons such as unstable network communication, synchronization problems, unreliable sensor devices, environmental factors and other device malfunctions which often resulted in data incompleteness. Missing data imputation is the most common preprocessing task to dealing with incomplete data. Though missing data is common in IoT but missing data imputation is hardly seen in the IoT environment. As a result, when analytics is performed on IoT data with missing values, it leads to the decline in accuracy and reliability of the data analysis results. In this paper, a novel ST-correlated proximate missing data imputation model is proposed to deal with missing data problem in IoT. Experimental results have proved that the proposed method has outperformed the existing single imputation and multiple imputation statistical methods in terms of accuracy.

[1]  Vadlamani Ravi,et al.  Data imputation via evolutionary computation, clustering and a neural network , 2015, Neurocomputing.

[2]  Xiaojun Ren,et al.  A New Estimation Model for Wireless Sensor Networks Based on the Spatial-Temporal Correlation Analysis , 2015, J. Inform. and Commun. Convergence Engineering.

[3]  Mickael Guedj,et al.  A Comparison of Six Methods for Missing Data Imputation , 2015 .

[4]  Doreswamy,et al.  Performance Analysis Of Neural Network Models For Oxazolines And Oxazoles Derivatives Descriptor Dataset , 2013, ArXiv.

[5]  Dave Evans,et al.  How the Next Evolution of the Internet Is Changing Everything , 2011 .

[6]  Muhammad Tayyab Asif,et al.  Low-dimensional models for missing data imputation in road networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[7]  Sudhir T. Bagade,et al.  Internet of Things: Architecture, Security Issues and Countermeasures , 2015 .

[8]  Md Zahidul Islam,et al.  FIMUS: A framework for imputing missing values using co-appearance, correlation and similarity analysis , 2014, Knowl. Based Syst..

[9]  Cem Iyigun,et al.  Comparison of missing value imputation methods in time series: the case of Turkish meteorological data , 2013, Theoretical and Applied Climatology.

[10]  Lynne E. Parker,et al.  Nearest neighbor imputation using spatial-temporal correlations in wireless sensor networks , 2014, Inf. Fusion.

[11]  C. Y. Peng,et al.  Principled missing data methods for researchers , 2013, SpringerPlus.

[12]  Bing Yu,et al.  Missing data analyses: a hybrid multiple imputation algorithm using Gray System Theory and entropy based on clustering , 2013, Applied Intelligence.

[13]  Zhipeng Gao,et al.  A Missing Sensor Data Estimation Algorithm Based on Temporal and Spatial Correlation , 2015, Int. J. Distributed Sens. Networks.

[14]  D. Bennett How can I deal with missing data in my study? , 2001, Australian and New Zealand journal of public health.

[15]  Nan Jiang,et al.  Estimating Missing Data in Data Streams , 2007, DASFAA.