An efficient data aggregation and outlier detection scheme based on radial basis function neural network for WSN

Wireless sensor network (WSN) is used for data collection and transmission in IoT environment. Since it consists of a large number of sensor nodes, a significant amount of redundant data and outliers are generated which substantially deteriorate the network performance. Data aggregation is needed to reduce energy consumption and prolong the lifetime of WSN. In this paper a novel data aggregation scheme is proposed which is based on modified radial basis function neural network to classify the collected data at cluster head and eliminate the redundant data and outliers. Additionally, cosine similarity is used to cluster the nodes having the most similar data. The radial basis function (RBF) is adapted by Mahalanobis distance to support the outlier’s detection and analysis in the multivariate data. The data collected from the sensor node at the cluster head are processed by mahalanbis distance-based radial basis function neural network (MDRBF-NN) before transferred to the based station. Extensive computer simulation with real datasets shows that the proposed scheme consistently outperforms the existing representative data aggregation schemes in terms of data classification, outlier detection, and energy efficiency.

[1]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[2]  Xiaoqing Cui,et al.  Dynamic heartbeat detection algorithm based on RBFNN , 2019 .

[3]  Hee Yong Youn,et al.  Statistical Multipath Queue-Wise Preemption Routing for ZigBee-Based WSN , 2018, Wirel. Pers. Commun..

[4]  Rama Shankar Yadav,et al.  Redundancy Elimination During Data Aggregation in Wireless Sensor Networks for IoT Systems , 2019 .

[5]  Kay Römer,et al.  An Adaptive Strategy for Quality-Based Data Reduction in Wireless Sensor Networks , 2006 .

[6]  T. Fearn,et al.  A note on Mahalanobis and related distance measures in WinISI and The Unscrambler , 2019, Journal of Near Infrared Spectroscopy.

[7]  Leszek Rutkowski,et al.  Basic Concepts of Data Stream Mining , 2019, Studies in Big Data.

[8]  Jian Pei,et al.  An Energy-Efficient Data Collection Framework for Wireless Sensor Networks by Exploiting Spatiotemporal Correlation , 2007, IEEE Transactions on Parallel and Distributed Systems.

[9]  Sajal K. Das,et al.  A Middleware Framework for Ambiguous Context Mediation in Smart Healthcare Application , 2007 .

[10]  Jiming Liu,et al.  Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range , 2014, BMC Medical Research Methodology.

[11]  Xianbin Wang,et al.  Temporal and spatial correlation based distributed fault detection in wireless sensor networks , 2015, 2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE).

[12]  Hee Yong Youn,et al.  A novel data aggregation scheme based on self-organized map for WSN , 2018, The Journal of Supercomputing.

[13]  Yang Xiao,et al.  Polynomial Regression Based Secure Data Aggregation for Wireless Sensor Networks , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[14]  Quanzhong Li,et al.  An Efficient Data Compression Model Based on Spatial Clustering and Principal Component Analysis in Wireless Sensor Networks , 2015, Sensors.

[15]  Arun Kumar Sangaiah,et al.  Selection of Machine Learning Techniques for Network Lifetime Parameters and Synchronization Issues in Wireless Networks , 2019, J. Inf. Process. Syst..

[16]  Y. Zhang,et al.  Active and dynamic information fusion for multisensor systems with dynamic bayesian networks , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[17]  Daniel Zwillinger,et al.  CRC Standard Probability and Statistics Tables and Formulae, Student Edition , 1999 .

[18]  Wei Bo,et al.  A Pseudo LEACH Algorithm for Wireless Sensor Networks. , 2007 .

[19]  Cheng Li,et al.  Distributed Data Aggregation Using Slepian–Wolf Coding in Cluster-Based Wireless Sensor Networks , 2010, IEEE Transactions on Vehicular Technology.

[20]  B. Kröse,et al.  Bayesian Activity Recognition in Residence for Elders , 2007 .

[21]  Manjeet Singh,et al.  Fuzzy based novel clustering technique by exploiting spatial correlation in wireless sensor network , 2018, J. Ambient Intell. Humaniz. Comput..

[22]  Ying Wang,et al.  Automatic ARIMA modeling-based data aggregation scheme in wireless sensor networks , 2013, EURASIP Journal on Wireless Communications and Networking.

[23]  Kavi Khedo,et al.  READA: Redundancy Elimination for Accurate Data Aggregation in Wireless Sensor Networks , 2010, Wirel. Sens. Netw..

[24]  Hee Yong Youn,et al.  Efficient data aggregation with node clustering and extreme learning machine for WSN , 2020, The Journal of Supercomputing.

[25]  Charu C. Aggarwal,et al.  Outlier Analysis , 2013, Springer New York.

[26]  Fei Yuan,et al.  Data Density Correlation Degree Clustering Method for Data Aggregation in WSN , 2014, IEEE Sensors Journal.

[27]  Philip S. Yu,et al.  ASAP: An Adaptive Sampling Approach to Data Collection in Sensor Networks , 2007, IEEE Transactions on Parallel and Distributed Systems.

[28]  Shigeng Zhang,et al.  Outlier Detection Techniques for Localization in Wireless Sensor Networks: A Survey , 2015 .

[29]  S. Jayashri,et al.  An optimal mobile data gathering in small scale WSN by power saving adaptive clustering techniques , 2020, Journal of Ambient Intelligence and Humanized Computing.

[30]  Hong Chen,et al.  The optimized clustering technique based on spatial-correlation in wireless sensor networks , 2009, 2009 IEEE Youth Conference on Information, Computing and Telecommunication.

[31]  Wei Fang,et al.  CSDA: a novel cluster-based secure data aggregation scheme for WSNs , 2019, Cluster Computing.

[32]  M.N.S. Swamy,et al.  Neural Networks and Statistical Learning , 2013 .

[33]  Punyasha Chatterjee,et al.  Energy-Efficient Connected Target Coverage in Multi-hop Wireless Sensor Networks , 2018 .

[34]  Ramón García-Martínez,et al.  Outlier detection in audit logs for application systems , 2014, Inf. Syst..

[35]  Tahir Mahmood,et al.  The cosine similarity measures of spherical fuzzy sets and their applications in decision making , 2019, J. Intell. Fuzzy Syst..

[36]  Kohei Adachi,et al.  Matrix-Based Introduction to Multivariate Data Analysis , 2016 .

[37]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[38]  Giuseppe Lo Re,et al.  Adaptive Distributed Outlier Detection for WSNs , 2015, IEEE Transactions on Cybernetics.

[39]  Azzedine Boukerche,et al.  An energy-aware spatio-temporal correlation mechanism to perform efficient data collection in wireless sensor networks , 2013, Comput. Commun..

[40]  Di Bai,et al.  Maximum Data Collection Rate Routing Protocol Based on Topology Control for Rechargeable Wireless Sensor Networks , 2016, Sensors.

[41]  Marco Morana,et al.  User detection through multi-sensor fusion in an AmI scenario , 2012, 2012 15th International Conference on Information Fusion.

[42]  D. N. Kashid,et al.  Subset selection in multiple linear regression in the presence of outlier and multicollinearity , 2014 .

[43]  Ivan Nunes da Silva,et al.  Radial Basis Function Networks , 2017 .

[44]  Wen-Tsai Sung Employed BPN to Multi-sensors Data Fusion for Environment Monitoring Services , 2009, ATC.

[45]  K. K. Pattanaik,et al.  Contextual outlier detection for wireless sensor networks , 2020, J. Ambient Intell. Humaniz. Comput..

[46]  Faiza Nawaz,et al.  Efficient data delivery in dense reader environment of passive sensor network , 2020, J. Ambient Intell. Humaniz. Comput..