Data Clustering Technique for In-Network Data Reduction in Wireless Sensor Network

In wireless sensor networks (WSNs), plenty of sensor nodes are typically deployed in the field to provide a long-term monitoring facility. These sensor nodes are usually collect a huge amount of data over time. Transmitting the huge data from the sensor nodes to a sink introduces a big challenge to the network due to energy constraint of the sensor nodes. Therefore, many research efforts have been carried out so far to design efficient data clustering techniques for WSNs. The main purpose of these techniques is to reduce the amount of data over the network while retaining their fundamental properties. This paper aims to develop a Histogram-based Data Clustering (HDC) technique at the cluster-head (CH) for in-network data reduction. The HDC groups the homogeneous data into clusters and then performs in-network data reduction by selecting the central values (instead of all data points) of each cluster. Simulations on real-world sensor data show that the proposed HDC can effectively reduce a significant amount of redundant data and outperform existing techniques.

[1]  Michele Magno,et al.  b+WSN: Smart beehive with preliminary decision tree analysis for agriculture and honey bee health monitoring , 2016, Comput. Electron. Agric..

[2]  Amer O. Abu Salem,et al.  Enhanced LEACH protocol for increasing a lifetime of WSNs , 2019, Personal and Ubiquitous Computing.

[3]  Athanasios V. Vasilakos,et al.  Hierarchical Data Aggregation Using Compressive Sensing (HDACS) in WSNs , 2015, ACM Trans. Sens. Networks.

[4]  Naixue Xiong,et al.  Data prediction, compression, and recovery in clustered wireless sensor networks for environmental monitoring applications , 2016, Inf. Sci..

[5]  Jing Gao,et al.  DLRDG: distributed linear regression-based hierarchical data gathering framework in wireless sensor network , 2012, Neural Computing and Applications.

[6]  Xianbin Wang,et al.  Recursive Principal Component Analysis-Based Data Outlier Detection and Sensor Data Aggregation in IoT Systems , 2017, IEEE Internet of Things Journal.

[7]  Sajal K. Das,et al.  An Adaptive Bayesian System for Context-Aware Data Fusion in Smart Environments , 2017, IEEE Transactions on Mobile Computing.

[8]  Marc Moonen,et al.  Distributed adaptive estimation of covariance matrix eigenvectors in wireless sensor networks with application to distributed PCA , 2014, Signal Process..

[9]  Abdallah Makhoul,et al.  Self-Adaptive Data Collection and Fusion for Health Monitoring Based on Body Sensor Networks , 2016, IEEE Transactions on Industrial Informatics.

[10]  Robert V. Brill,et al.  Applied Statistics and Probability for Engineers , 2004, Technometrics.

[11]  Janghoon Yang,et al.  Multivariated Bayesian Compressive Sensing in Wireless Sensor Networks , 2016, IEEE Sensors Journal.

[12]  Erkki Mäkinen,et al.  Task-oriented distributed data fusion in autonomous wireless sensor networks , 2015, Soft Comput..

[13]  Ying Wang,et al.  Automatic ARIMA modeling-based data aggregation scheme in wireless sensor networks , 2013, EURASIP Journal on Wireless Communications and Networking.

[14]  Guangjie Han,et al.  Concept drift detection for data stream learning based on angle optimized global embedding and principal component analysis in sensor networks , 2017, Comput. Electr. Eng..

[15]  Elisa Bertino,et al.  Sensor Network Provenance Compression Using Dynamic Bayesian Networks , 2017, ACM Trans. Sens. Networks.

[16]  Yair Be'ery,et al.  Decentralized estimation of regression coefficients in sensor networks , 2017, Digit. Signal Process..

[17]  Huazhong Yang,et al.  Blind Drift Calibration of Sensor Networks Using Sparse Bayesian Learning , 2016, IEEE Sensors Journal.

[18]  Eduardo Morgado,et al.  Scalable Data-Coupled Clustering for Large Scale WSN , 2015, IEEE Transactions on Wireless Communications.

[19]  Gerald Keller Statistics for Management and Economics: Abbreviated , 2003 .

[20]  David Laiymani,et al.  EK-means: A new clustering approach for datasets classification in sensor networks , 2019, Ad Hoc Networks.

[21]  Eduardo Morgado,et al.  Energy Efficiency and Quality of Data Reconstruction Through Data-Coupled Clustering for Self-Organized Large-Scale WSNs , 2016, IEEE Sensors Journal.

[22]  Hwee Pink Tan,et al.  Rate-Distortion Balanced Data Compression for Wireless Sensor Networks , 2016, IEEE Sensors Journal.

[23]  Lynne E. Parker,et al.  Nearest neighbor imputation using spatial-temporal correlations in wireless sensor networks , 2014, Inf. Fusion.

[24]  Julio Cesar Stacchini de Souza,et al.  Data Compression in Smart Distribution Systems via Singular Value Decomposition , 2017, IEEE Transactions on Smart Grid.

[25]  Xiao Lin,et al.  Online Bayesian Data Fusion in Environment Monitoring Sensor Networks , 2014 .

[26]  Raphaël Couturier,et al.  Tree-Based Data Aggregation Approach in Periodic Sensor Networks Using Correlation Matrix and Polynomial Regression , 2016, 2016 IEEE Intl Conference on Computational Science and Engineering (CSE) and IEEE Intl Conference on Embedded and Ubiquitous Computing (EUC) and 15th Intl Symposium on Distributed Computing and Applications for Business Engineering (DCABES).

[27]  José López Vicario,et al.  Data Aggregation and Principal Component Analysis in WSNs , 2016, IEEE Transactions on Wireless Communications.

[28]  Stathes Hadjiefthymiades,et al.  Advanced Principal Component-Based Compression Schemes for Wireless Sensor Networks , 2014, TOSN.