On Active and Passive Testing

Given a property of Boolean functions, what is the minimum number of queries required to determine with high probability if an input function satisfies this property? This is a fundamental question in Property Testing, where traditionally the testing algorithm is allowed to pick its queries among the entire set of inputs. Balcan et al. have recently suggested to restrict the tester to take its queries from a smaller, typically random, subset of the inputs. This model is called active testing, in resemblance of active learning. Active testing gets more dicult as the size of the set we can query from decreases, and the extreme case is when it is exactly the number of queries we perform (so the algorithm actually has no choice). This is known as passive testing, or testing from random examples. In their paper, Balcan et al. have shown that active and passive testing of dictator functions is as hard as learning them, and requires (log n) queries (unlike the classic model, in which it can be done in a constant number of queries). We extend this result to k-linear functions, proving that passive and active testing of them requires ( k logn) queries, assuming k is not too large. Other classes of functions we consider are juntas, partially symmetric functions, linear functions, and low degree polynomials. For linear functions we provide tight bounds on the query complexity in both active and passive models (which asymptotically dier). The analysis for low degree polynomials is less complete and the exact query complexity is given only for passive testing. In both these cases, the query complexity for passive testing is essentially equivalent to that of learning. For juntas and partially symmetric functions, that is, functions that depend on a small number of variables and potentially also on the Hamming weight of the input, we provide some lower and upper bounds for the dierent models. When the functions depend on a constant number of variables, our analysis for both families is asymptotically tight. Moreover, the family of partially symmetric functions is the first example for which the query complexities in all these models are asymptotically dierent. Our methods combine algebraic, combinatorial, and probabilistic techniques, including the Talagrand concentration inequality and the Erdfis‐Rado results on -systems.

[1]  Guy Kindler,et al.  Testing juntas , 2002, J. Comput. Syst. Sci..

[2]  Allan Sly,et al.  Cutoff phenomena for random walks on random regular graphs , 2008, 0812.0060.

[3]  Yuichi Yoshida,et al.  Partially Symmetric Functions Are Efficiently Isomorphism-Testable , 2012, 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science.

[4]  Dana Ron,et al.  Property testing and its connection to learning and approximation , 1996, Proceedings of 37th Conference on Foundations of Computer Science.

[5]  Madhu Sudan,et al.  Optimal Testing of Reed-Muller Codes , 2009, 2010 IEEE 51st Annual Symposium on Foundations of Computer Science.

[6]  N. Alon,et al.  The Probabilistic Method: Alon/Probabilistic , 2008 .

[7]  Shachar Lovett,et al.  Random low-degree polynomials are hard to approximate , 2009, computational complexity.

[8]  Eric Blais Improved Bounds for Testing Juntas , 2008, APPROX-RANDOM.

[9]  Dana Ron,et al.  Testing Problems with Sublearning Sample Complexity , 2000, J. Comput. Syst. Sci..

[10]  Daniel M. Kane,et al.  Tight Bounds for Testing k-Linearity , 2012, APPROX-RANDOM.

[11]  Claude E. Shannon,et al.  The synthesis of two-terminal switching circuits , 1949, Bell Syst. Tech. J..

[12]  B. Reed Graph Colouring and the Probabilistic Method , 2001 .

[13]  Noga Alon,et al.  Testing Low-Degree Polynomials over GF(2( , 2003, RANDOM-APPROX.

[14]  Maria-Florina Balcan,et al.  Active Property Testing , 2011, 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science.

[15]  Dana Ron,et al.  Property Testing: A Learning Theory Perspective , 2007, COLT.

[16]  Eric Blais Testing juntas nearly optimally , 2009, STOC '09.

[17]  Leslie G. Valiant,et al.  A theory of the learnable , 1984, STOC '84.

[18]  Hana Chockler,et al.  A lower bound for testing juntas , 2004, Inf. Process. Lett..

[19]  Noga Alon,et al.  The Probabilistic Method , 2015, Fundamentals of Ramsey Theory.

[20]  Ilias Diakonikolas,et al.  Testing for Concise Representations , 2007, FOCS 2007.

[21]  Joshua Brody,et al.  Property Testing Lower Bounds via Communication Complexity , 2011, 2011 IEEE 26th Annual Conference on Computational Complexity.

[22]  Mihir Bellare,et al.  Linearity testing in characteristic two , 1996, IEEE Trans. Inf. Theory.

[23]  Manuel Blum,et al.  Self-testing/correcting with applications to numerical problems , 1990, STOC '90.

[24]  David A. Cohn,et al.  Improving generalization with active learning , 1994, Machine Learning.

[25]  H. L. Abbott,et al.  Intersection Theorems for Systems of Sets , 1972, J. Comb. Theory, Ser. A.

[26]  O. Svensson,et al.  Inapproximability Results for Sparsest Cut, Optimal Linear Arrangement, and Precedence Constrained Scheduling , 2007, FOCS 2007.

[27]  Ronitt Rubinfeld,et al.  Robust Characterizations of Polynomials with Applications to Program Testing , 1996, SIAM J. Comput..

[28]  Carsten Lund,et al.  Nondeterministic exponential time has two-prover interactive protocols , 1990, Proceedings [1990] 31st Annual Symposium on Foundations of Computer Science.

[29]  M. Talagrand Concentration of measure and isoperimetric inequalities in product spaces , 1994, math/9406212.

[30]  Aaron D. Wyner,et al.  The Synthesis of TwoTerminal Switching Circuits , 1993 .

[31]  Eldar Fischer,et al.  Junto-Symmetric Functions, Hypergraph Isomorphism and Crunching , 2012, 2012 IEEE 27th Conference on Computational Complexity.