Consensus-Based Data Statistics in Distributed Network Systems

Data has become increasingly important in network systems because a lot of data is needed in new technologies such as machine learning. To obtain statistics (e.g. maximum, average, and distribution) in a fully distributed way with low complexity is challenging. Existing research on consensus algorithms can successfully obtain the max/min, average and median in a distributed network, but little work has been done on how to compute other statistics, especially probability density function (PDF). In this paper, consensus-based algorithms are proposed to obtain PDF in a fully distributed way with low complexity. The key idea of our algorithms is to divide the range of nodes' values into several sections and calculate the proportions of values in each section in a fully distributed way. If nodes have their unique identifications (IDs), repeatedly run max/min consensus algorithm in the network to reach the partially max/min values and then erase them in order to reach all values exactly once. We prove that the algorithm converges in finite time. When nodes' IDs are not available, the main challenge is to solve the conflicts when two or more nodes have the same value. We propose an asymptotically converged algorithm to solve the problem in this scenario.

[1]  Wenwu Yu,et al.  An Overview of Recent Progress in the Study of Distributed Multi-Agent Coordination , 2012, IEEE Transactions on Industrial Informatics.

[2]  Richard M. Murray,et al.  Consensus problems in networks of agents with switching topology and time-delays , 2004, IEEE Transactions on Automatic Control.

[3]  Ling Shi,et al.  SATS: Secure Average-Consensus-Based Time Synchronization in Wireless Sensor Networks , 2013, IEEE Transactions on Signal Processing.

[4]  Ling Shi,et al.  Distributed Privacy-Preserving Data Aggregation Against Dishonest Nodes in Network Systems , 2019, IEEE Internet of Things Journal.

[5]  Xindong Wu,et al.  Data mining with big data , 2014, IEEE Transactions on Knowledge and Data Engineering.

[6]  Stephen P. Boyd,et al.  Fast linear iterations for distributed averaging , 2003, 42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475).

[7]  Ling Shi,et al.  Time Synchronization in WSNs: A Maximum-Value-Based Consensus Approach , 2014, IEEE Transactions on Automatic Control.

[8]  Guangming Xie,et al.  Average consensus in networks of dynamic agents with switching topologies and multiple time-varying delays , 2008, Syst. Control. Lett..

[9]  Stephen P. Boyd,et al.  A scheme for robust distributed sensor fusion based on average consensus , 2005, IPSN 2005. Fourth International Symposium on Information Processing in Sensor Networks, 2005..

[10]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[11]  Yilin Mo,et al.  Privacy Preserving Maximum Consensus , 2015, 2015 54th IEEE Conference on Decision and Control (CDC).

[12]  Jiming Chen,et al.  Distributed Privacy-Aware Fast Selection Algorithm for Large-Scale Data , 2018, IEEE Transactions on Parallel and Distributed Systems.

[13]  John N. Tsitsiklis,et al.  Convergence Speed in Distributed Consensus and Averaging , 2009, SIAM J. Control. Optim..

[14]  Alessandro Giua,et al.  Finite-Time Consensus on the Median Value With Robustness Properties , 2017, IEEE Transactions on Automatic Control.

[15]  Yunhao Liu,et al.  Big Data: A Survey , 2014, Mob. Networks Appl..

[16]  Murti V Salapaka,et al.  Distributed protocol for determining when averaging consensus is reached , 2007 .

[17]  Anand D. Sarwate,et al.  Distributed Learning of Distributions via Social Sampling , 2013, IEEE Transactions on Automatic Control.

[18]  Philippe Ciblat,et al.  Analysis of Max-Consensus Algorithms in Wireless Channels , 2012, IEEE Transactions on Signal Processing.

[19]  Stephen P. Boyd,et al.  Distributed Average Consensus with Time-Varying Metropolis Weights ? , 2006 .