On (ε,k)-min-wise independent permutations

A family of permutations of $\cal{F}$ [n] = l1,2,…,nr is (e,k)-min-wise independent if for every nonempty subset X of at most k elements of [n], and for any x ∈ X, the probability that in a random element π of $\cal{F}$, π(x) is the minimum element of π(X), deviates from 1/mXm by at most e/mXm. This notion can be defined for the uniform case, when the elements of $\cal{F}$ are picked according to a uniform distribution, or for the more general, biased case, in which the elements of $\cal{F}$ are chosen according to a given distribution D. It is known that this notion is a useful tool for indexing replicated documents on the web. We show that even in the more general, biased case, for all admissible k and e and all large n, the size of $\cal{F}$ must satisfy $$|{\cal{F}}| \ge \Omega \left({k \over \varepsilon^2\log(1/\varepsilon)} \log n\right),$$ as well as $$|{\cal{F}}| \ge \Omega \left({k^2 \over \varepsilon\log(1/\varepsilon)} \log n\right).$$ This improves the best known previous estimates even for the uniform case. © 2007 Wiley Periodicals, Inc. Random Struct. Alg., 2007

[1]  Andrei Z. Broder,et al.  On the resemblance and containment of documents , 1997, Proceedings. Compression and Complexity of SEQUENCES 1997 (Cat. No.97TB100171).

[2]  Piotr Indyk,et al.  A small approximately min-wise independent family of hash functions , 1999, SODA '99.

[3]  Alan M. Frieze,et al.  Min-Wise Independent Permutations , 2000, J. Comput. Syst. Sci..

[4]  Yoshinori Takei,et al.  On permutations with limited independence , 2000, SODA '00.

[5]  Aravind Srinivasan,et al.  Low Discrepancy Sets Yield Approximate Min-Wise Independent Permutation Families , 1999, RANDOM-APPROX.

[6]  S. A. Norin A Polynomial Lower Bound on the Size of a k-Min-Wise Independent Set of Permutations , 2003 .

[7]  Noga Alon,et al.  Problems and results in extremal combinatorics--I , 2003, Discret. Math..

[8]  Jirí Matousek,et al.  On restricted min‐wise independence of permutations , 2003, Random Struct. Algorithms.

[9]  Yoshinori Takei,et al.  A Nearly Linear Size 4-Min-Wise Independent Permutation Family by Finite Geometries , 2003, RANDOM-APPROX.

[10]  Yoshinori Takei,et al.  On the sample size of k-restricted min-wise independent permutations and other k-wise distributions , 2003, STOC '03.

[11]  N. Alon,et al.  Equilateral Sets in lpn , 2003 .

[12]  Ketan Mulmuley Randomized geometric algorithms and pseudorandom generators , 2005, Algorithmica.

[13]  Noga Alon,et al.  Perturbed Identity Matrices Have High Rank: Proof and Applications , 2009, Combinatorics, Probability and Computing.