MBF: a Real Matrix Bloom Filter Representation Method on Dynamic Set

Bloom filter is a simple space-efficient randomized data structure for representing a set in order to support membership queries, which uses an m-bit array to represent a data set. In order to support representing dynamic set, dynamic bloom filter (DBF) and split bloom filter (SBF) have been developed. Both DBF and SBF can support concisely representation and approximate membership queries of dynamic set instead of static set. SBF declares that it uses an s Xm bit matrix that consists of s bloom filters to represent a dynamic set, so DBF dose. But in fact, both the two bloom filters are not matrix representation method at all. They are just a set of s bloom filters whose length is m, and they have got a departure from the original idea of bloom filter: the constant query time cost. This paper points out the fact, and then introduce a truly matrix representation method of bloom filter to represent a dynamic set. We call it the matrix bloom filter (MBF). Then, we analyze the algorithm of MBF and study the average time complexity and the false positive probability.

[1]  Xiao Ming Split Bloom Filter , 2004 .

[2]  Li Fan,et al.  Summary cache: a scalable wide-area web cache sharing protocol , 2000, TNET.

[3]  Abhishek Kumar,et al.  Space-code bloom filter for efficient per-flow traffic measurement , 2004, IEEE INFOCOM 2004.

[4]  Jie Wu,et al.  Theory and Network Applications of Dynamic Bloom Filters , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[5]  Mingzhong Xiao,et al.  i-DBF: an Improved Bloom Filter Representation Method on Dynamic Set , 2006, 2006 Fifth International Conference on Grid and Cooperative Computing Workshops.

[6]  Yossi Matias,et al.  Spectral bloom filters , 2003, SIGMOD '03.

[7]  Michael Mitzenmacher,et al.  Compressed bloom filters , 2001, PODC '01.

[8]  A. Kumar,et al.  Space-code bloom filter for efficient per-flow traffic measurement , 2004, IEEE INFOCOM 2004.

[9]  Burton H. Bloom,et al.  Space/time trade-offs in hash coding with allowable errors , 1970, CACM.