Set reconciliation in multi-node environment
暂无分享,去创建一个
Set reconciliation between nodes without prior context is an interesting problem that has numerous applications. Finding the set difference between two sets forms the basis of set reconciliation and de-duplication problems. As the number of devices spread across diverse networks required to be kept in sync is increasing, the cost of communication is increasingly proportionately. In this paper, I propose an effective scheme in terms of communication cost to reconcile the distributed data on all the nodes that don't have any prior context of the other nodes.
[1] Burton H. Bloom,et al. Space/time trade-offs in hash coding with allowable errors , 1970, CACM.
[2] David Eppstein,et al. Straggler Identification in Round-Trip Data Streams via Newton's Identities and Invertible Bloom Filters , 2007, IEEE Transactions on Knowledge and Data Engineering.
[3] George Varghese,et al. What's the difference?: efficient set reconciliation without prior context , 2011, SIGCOMM.