Optimal bounds on approximation of submodular and XOS functions by juntas

We investigate the approximability of several classes of real-valued functions by functions of a small number of variables (juntas). Our main results are tight bounds on the number of variables required to approximate a function f : {0, 1}n → [0, 1] within l2-error e over the uniform distribution: If f is submodular, then it is e-close to a function of O(1/e2) variables. This is an exponential improvement over previously known results [1]. We note that Ω(1/e2) variables are necessary even for linear functions. If f is fractionally subadditive (XOS) it is e-close to a function of 2O(1/e2) variables. This result holds for all functions with low total l1-influence and is a real-valued analogue of Friedgut's theorem for boolean functions. We show that 2Ω(1/e) variables are necessary even for XOS functions. As applications of these results, we provide learning algorithms over the uniform distribution. For XOS functions, we give a PAC learning algorithm that runs in time 21/poly(∈)poly(n). For submodular functions we give an algorithm in the more demanding PMAC learning model [2] which requires a multiplicative (1 + γ) factor approximation with probability at least 1 - ∈ over the target distribution. Our uniform distribution algorithm runs in time 21/poly(γ∈)poly(n). This is the first algorithm in the PMAC model that can achieve a constant approximation factor arbitrarily close to 1 for all submodular functions (even over the uniform distribution). It relies crucially on our approximation by junta result. As follows from the lower bounds in [1] both of these algorithms are close to optimal. We also give applications for proper learning, testing and agnostic learning with value queries of these classes.

[1]  J. Bourgain On the distribution of the fourier spectrum of Boolean functions , 2002 .

[2]  Y. Rabani,et al.  Improved lower bounds for embeddings into L 1 , 2006, SODA 2006.

[3]  Ryan O'Donnell,et al.  Learning Monotone Decision Trees in Polynomial Time , 2007, SIAM J. Comput..

[4]  Jan Vondrák,et al.  Is Submodularity Testable? , 2010, Algorithmica.

[5]  F. Dunstan MATROIDS AND SUBMODULAR FUNCTIONS , 1976 .

[6]  Maria-Florina Balcan,et al.  Learning Valuation Functions , 2011, COLT.

[7]  Omer Reingold,et al.  DNF sparsification and a faster deterministic counting algorithm , 2012, 2012 IEEE 27th Conference on Computational Complexity.

[8]  Sofya Raskhodnikova,et al.  Learning pseudo-Boolean k-DNF and submodular functions , 2013, SODA.

[9]  Pravesh Kothari,et al.  Submodular functions are noise stable , 2012, SODA.

[10]  Vahab S. Mirrokni,et al.  Approximating submodular functions everywhere , 2009, SODA.

[11]  Colin McDiarmid,et al.  Concentration for self-bounding functions and an inequality of Talagrand , 2006 .

[12]  Jan Vondrák,et al.  A note on concentration of submodular functions , 2010, ArXiv.

[13]  Pravesh Kothari,et al.  Representation, Approximation and Learning of Submodular Functions Using Low-rank Decision Trees , 2013, COLT.

[14]  C. Guestrin,et al.  Near-optimal sensor placements: maximizing information while minimizing communication cost , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[15]  Noam Nisan,et al.  On the degree of boolean functions as real polynomials , 1992, STOC '92.

[16]  S. Safra,et al.  On the hardness of approximating minimum vertex cover , 2005 .

[17]  Daniel Lehmann,et al.  Combinatorial auctions with decreasing marginal utilities , 2001, EC '01.

[18]  Assaf Naor,et al.  Boolean functions whose Fourier transform is concentrated on the first two levels , 2002, Adv. Appl. Math..

[19]  Jan Vondrák,et al.  Optimal Bounds on Approximation of Submodular and XOS Functions by Juntas , 2013, 2013 IEEE 54th Annual Symposium on Foundations of Computer Science.

[20]  Andreas Krause,et al.  Near-optimal sensor placements in Gaussian processes , 2005, ICML.

[21]  Jan Vondrák,et al.  Covering minimum spanning trees of random subgraphs , 2004, SODA '04.

[22]  Nathan Linial,et al.  The influence of variables on Boolean functions , 1988, [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science.

[23]  Nathan Linial,et al.  Collective coin flipping, robust voting schemes and minima of Banzhaf values , 1985, 26th Annual Symposium on Foundations of Computer Science (sfcs 1985).

[24]  Aaron Roth,et al.  Privately releasing conjunctions and the statistical query barrier , 2010, STOC '11.

[25]  Omer Reingold,et al.  DNF Sparsification and a Faster Deterministic Counting Algorithm , 2012, Computational Complexity Conference.

[26]  Rocco A. Servedio,et al.  On learning monotone DNF under product distributions , 2001, Inf. Comput..

[27]  Pravesh Kothari,et al.  Learning Coverage Functions , 2013, ArXiv.

[28]  S. Boucheron,et al.  A sharp concentration inequality with applications , 1999, Random Struct. Algorithms.

[29]  Ryan O'Donnell,et al.  On the Fourier tails of bounded functions over the discrete cube , 2006, STOC '06.

[30]  Tim Roughgarden,et al.  Sketching valuation functions , 2012, SODA.

[31]  László Lovász,et al.  Submodular functions and convexity , 1982, ISMP.

[32]  Ehud Friedgut,et al.  Boolean Functions With Low Average Sensitivity Depend On Few Coordinates , 1998, Comb..