Testing monotone high-dimensional distributions

A <i>monotone distribution</i> <i>P</i> over a (partially) ordered domain has <i>P</i>(<i>y</i>) ≥ <i>P</i>(<i>x</i>) if <i>y</i> ≥ <i>x</i> in the order. We study several natural problems of testing properties of monotone distributions over the <i>n</i>-dimensional Boolean cube, given access to random draws from the distribution being tested. We give a poly(<i>n</i>)-time algorithm for testing whether a monotone distribution is equivalent to or ε-far (in the <i>L</i><inf>1</inf> norm) from the uniform distribution. A key ingredient of the algorithm is a generalization of a known isoperimetric inequality for the Boolean cube. We also introduce a method for proving lower bounds on testing monotone distributions over the <i>n</i>-dimensional Boolean cube, based on a new decomposition technique for monotone distributions. We use this method to show that our uniformity testing algorithm is optimal up to polylog(<i>n</i>) factors, and also to give exponential lower bounds on the complexity of several other problems (testing whether a monotone distribution is identical to or ε-far from a fixed known monotone product distribution and approximating the entropy of an unknown monotone distribution).

[1]  Ronitt Rubinfeld,et al.  Testing that distributions are close , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[2]  Sean W. Smith,et al.  Improved learning of AC0 functions , 1991, COLT '91.

[3]  W. Cary Huffman,et al.  Fundamentals of Error-Correcting Codes , 1975 .

[4]  Umesh V. Vazirani,et al.  An Introduction to Computational Learning Theory , 1994 .

[5]  Rocco A. Servedio,et al.  On learning monotone DNF under product distributions , 2001, Inf. Comput..

[6]  Ronitt Rubinfeld,et al.  The complexity of approximating entropy , 2002, STOC '02.

[7]  Andrew Chi-Chih Yao,et al.  Probabilistic computations: Toward a unified measure of complexity , 1977, 18th Annual Symposium on Foundations of Computer Science (sfcs 1977).

[8]  Ronitt Rubinfeld,et al.  Testing random variables for independence and identity , 2001, Proceedings 2001 IEEE International Conference on Cluster Computing.

[9]  Noam Nisan,et al.  Constant depth circuits, Fourier transform, and learnability , 1989, 30th Annual Symposium on Foundations of Computer Science.

[10]  D. Aldous On the Markov Chain Simulation Method for Uniform Combinatorial Distributions and Simulated Annealing , 1987, Probability in the Engineering and Informational Sciences.

[11]  Ronitt Rubinfeld,et al.  Sublinear algorithms for testing monotone and unimodal distributions , 2004, STOC '04.

[12]  Seshadhri Comandur,et al.  Testing Expansion in Bounded Degree Graphs , 2007, Electron. Colloquium Comput. Complex..

[13]  Rajeev Motwani,et al.  Randomized algorithms , 1996, CSUR.