Testing Non-uniform k-Wise Independent Distributions over Product Spaces

A distribution D over Σ1 × ... × Σn is called (non-uniform) k-wise independent if for any set of k indices {i1,..., ik} and for any z1...zk ∈ Σi1×...×Σik, PrX-D[Xi1...Xik = z1 ... zk] = PrX-D[Xi1 = z1] ... PrX-D[Xik = zk]. We study the problem of testing (non-uniform) k-wise independent distributions over product spaces. For the uniform case we show an upper bound on the distance between a distribution D from the set of k-wise independent distributions in terms of the sum of Fourier coefficients of D at vectors of weight at most k. Such a bound was previously known only for the binary field. For the non-uniform case, we give a new characterization of distributions being k-wise independent and further show that such a characterization is robust. These greatly generalize the results of Alon et al. [1] on uniform k-wise independence over the binary field to non-uniform k-wise independence over product spaces. Our results yield natural testing algorithms for k-wise independence with time and sample complexity sublinear in terms of the support size when k is a constant. The main technical tools employed include discrete Fourier transforms and the theory of linear systems of congruences.

[1]  E. Fischer THE ART OF UNINFORMED DECISIONS: A PRIMER TO PROPERTY TESTING , 2004 .

[2]  Dana Ron,et al.  Property testing and its connection to learning and approximation , 1998, JACM.

[3]  Manuel Blum,et al.  Self-testing/correcting with applications to numerical problems , 1990, STOC '90.

[4]  Noga Alon,et al.  Almost k-wise independence versus k-wise independence , 2003, Information Processing Letters.

[5]  Hanno Lefmann,et al.  MOD p -tests, almost independence and small probability spaces , 2000 .

[6]  Eric Blais Testing juntas nearly optimally , 2009, STOC '09.

[7]  Ronitt Rubinfeld,et al.  Algorithms column: sublinear time algorithms , 2003, SIGA.

[8]  Noga Alon,et al.  Simple construction of almost k-wise independent random variables , 1990, Proceedings [1990] 31st Annual Symposium on Foundations of Computer Science.

[9]  H. Smith I. On systems of linear indeterminate equations and congruences , 1862, Proceedings of the Royal Society of London.

[10]  A. Terras Fourier Analysis on Finite Groups and Applications: Index , 1999 .

[11]  Ronitt Rubinfeld,et al.  Sublinear algorithms for testing monotone and unimodal distributions , 2004, STOC '04.

[12]  A. Joffe On a Set of Almost Deterministic $k$-Independent Random Variables , 1974 .

[13]  Dana Ron,et al.  On Testing Expansion in Bounded-Degree Graphs , 2000, Studies in Complexity and Cryptography.

[14]  Vince Grolmusz,et al.  Superpolynomial Size Set-systems with Restricted Intersections mod 6 and Explicit Ramsey Graphs , 2000, Comb..

[15]  Richard M. Karp,et al.  A fast parallel algorithm for the maximal independent set problem , 1985, JACM.

[16]  V. B. Uvarov,et al.  Classical Orthogonal Polynomials of a Discrete Variable , 1991 .

[17]  Yossi Azar,et al.  Approximating Probability Distributions Using Small Sample Spaces , 1998, Comb..

[18]  Daniel Stefankovic Fourier Transforms in Computer Science , 2000 .

[19]  Nimrod Megiddo,et al.  Constructing small sample spaces satisfying given constraints , 1993, SIAM J. Discret. Math..

[20]  Elchanan Mossel,et al.  Gaussian Bounds for Noise Correlation of Functions and Tight Analysis of Long Codes , 2008, 2008 49th Annual IEEE Symposium on Foundations of Computer Science.

[21]  Y. Mansour,et al.  On construction of k-wise independent random variables , 1994, STOC '94.

[22]  SahaiAmit,et al.  A complete problem for statistical zero knowledge , 2003 .

[23]  Artur Czumaj,et al.  Sublinear-Time Algorithms , 2006, Bull. EATCS.

[24]  Michael Luby,et al.  A simple parallel algorithm for the maximal independent set problem , 1985, STOC '85.

[25]  Michael Luby Removing Randomness in Parallel Computation without a Processor Penalty , 1993, J. Comput. Syst. Sci..

[26]  Dana Ron,et al.  Strong Lower Bounds for Approximating Distribution Support Size and the Distinct Elements Problem , 2007, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).

[27]  O. Svensson,et al.  Inapproximability Results for Sparsest Cut, Optimal Linear Arrangement, and Precedence Constrained Scheduling , 2007, FOCS 2007.

[28]  Ronitt Rubinfeld,et al.  Robust Characterizations of Polynomials with Applications to Program Testing , 1996, SIAM J. Comput..

[29]  J. Silvester Determinants of block matrices , 2000, The Mathematical Gazette.

[30]  Oded Goldreich,et al.  Efficient approximation of product distributions , 1998 .

[31]  Sergey Yekhanin,et al.  Towards 3-query locally decodable codes of subexponential length , 2008, JACM.

[32]  Noga Alon,et al.  Simple Construction of Almost k-wise Independent Random Variables , 1992, Random Struct. Algorithms.

[33]  Noga Alon,et al.  A Fast and Simple Randomized Parallel Algorithm for the Maximal Independent Set Problem , 1985, J. Algorithms.

[34]  Noga Alon,et al.  Testing k-wise and almost k-wise independence , 2007, STOC '07.

[35]  Sanguthevar Rajasekaran Handbook of randomized computing , 2001 .

[36]  G. Hardy,et al.  An Introduction to the Theory of Numbers , 1938 .

[37]  Hanno Lefmann,et al.  MODp-tests, almost independence and small probability spaces , 2000, Random Struct. Algorithms.

[38]  Ronitt Rubinfeld,et al.  Testing that distributions are close , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[39]  Ronitt Rubinfeld,et al.  Testing random variables for independence and identity , 2001, Proceedings 2001 IEEE International Conference on Cluster Computing.

[40]  Per Austrin Conditional Inapproximability and Limited Independence , 2008 .

[41]  C. Neuman,et al.  Discrete (Legendre) orthogonal polynomials—a survey , 1974 .

[42]  Oded Goldreich,et al.  On the power of two-point based sampling , 1989, J. Complex..

[43]  Klim Efremenko,et al.  3-Query Locally Decodable Codes of Subexponential Length , 2008 .

[44]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[45]  Moni Naor,et al.  Small-Bias Probability Spaces: Efficient Constructions and Applications , 1993, SIAM J. Comput..

[46]  Rocco A. Servedio,et al.  Testing for Concise Representations , 2007, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).