论文信息 - Optimal bounds on approximation of submodular and XOS functions by juntas

Optimal bounds on approximation of submodular and XOS functions by juntas

We investigate the approximability of several classes of real-valued functions by functions of a small number of variables (juntas). Our main results are tight bounds on the number of variables required to approximate a function f : {0, 1}n → [0, 1] within l2-error e over the uniform distribution: If f is submodular, then it is e-close to a function of O(1/e2) variables. This is an exponential improvement over previously known results [1]. We note that Ω(1/e2) variables are necessary even for linear functions. If f is fractionally subadditive (XOS) it is e-close to a function of 2O(1/e2) variables. This result holds for all functions with low total l1-influence and is a real-valued analogue of Friedgut's theorem for boolean functions. We show that 2Ω(1/e) variables are necessary even for XOS functions. As applications of these results, we provide learning algorithms over the uniform distribution. For XOS functions, we give a PAC learning algorithm that runs in time 21/poly(∈)poly(n). For submodular functions we give an algorithm in the more demanding PMAC learning model [2] which requires a multiplicative (1 + γ) factor approximation with probability at least 1 - ∈ over the target distribution. Our uniform distribution algorithm runs in time 21/poly(γ∈)poly(n). This is the first algorithm in the PMAC model that can achieve a constant approximation factor arbitrarily close to 1 for all submodular functions (even over the uniform distribution). It relies crucially on our approximation by junta result. As follows from the lower bounds in [1] both of these algorithms are close to optimal. We also give applications for proper learning, testing and agnostic learning with value queries of these classes.

Vitaly Feldman | Jan Vondrak

[1] J. Bourgain. On the distribution of the fourier spectrum of Boolean functions , 2002 .

[2] Y. Rabani,et al. Improved lower bounds for embeddings into L 1 , 2006, SODA 2006.

[3] Ryan O'Donnell,et al. Learning Monotone Decision Trees in Polynomial Time , 2007, SIAM J. Comput..

[4] Jan Vondrák,et al. Is Submodularity Testable? , 2010, Algorithmica.

[5] F. Dunstan. MATROIDS AND SUBMODULAR FUNCTIONS , 1976 .

[6] Maria-Florina Balcan,et al. Learning Valuation Functions , 2011, COLT.

[7] Omer Reingold,et al. DNF sparsification and a faster deterministic counting algorithm , 2012, 2012 IEEE 27th Conference on Computational Complexity.

[8] Sofya Raskhodnikova,et al. Learning pseudo-Boolean k-DNF and submodular functions , 2013, SODA.

[9] Pravesh Kothari,et al. Submodular functions are noise stable , 2012, SODA.

[10] Vahab S. Mirrokni,et al. Approximating submodular functions everywhere , 2009, SODA.

[11] Colin McDiarmid,et al. Concentration for self-bounding functions and an inequality of Talagrand , 2006 .

[12] Jan Vondrák,et al. A note on concentration of submodular functions , 2010, ArXiv.

[13] Pravesh Kothari,et al. Representation, Approximation and Learning of Submodular Functions Using Low-rank Decision Trees , 2013, COLT.

[14] C. Guestrin,et al. Near-optimal sensor placements: maximizing information while minimizing communication cost , 2006, 2006 5th International Conference on Information Processing in Sensor Networks.

[15] Noam Nisan,et al. On the degree of boolean functions as real polynomials , 1992, STOC '92.

[16] S. Safra,et al. On the hardness of approximating minimum vertex cover , 2005 .

[17] Daniel Lehmann,et al. Combinatorial auctions with decreasing marginal utilities , 2001, EC '01.

[18] Assaf Naor,et al. Boolean functions whose Fourier transform is concentrated on the first two levels , 2002, Adv. Appl. Math..

[19] Jan Vondrák,et al. Optimal Bounds on Approximation of Submodular and XOS Functions by Juntas , 2013, 2013 IEEE 54th Annual Symposium on Foundations of Computer Science.

[20] Andreas Krause,et al. Near-optimal sensor placements in Gaussian processes , 2005, ICML.

[21] Jan Vondrák,et al. Covering minimum spanning trees of random subgraphs , 2004, SODA '04.

[22] Nathan Linial,et al. The influence of variables on Boolean functions , 1988, [Proceedings 1988] 29th Annual Symposium on Foundations of Computer Science.

[23] Nathan Linial,et al. Collective coin flipping, robust voting schemes and minima of Banzhaf values , 1985, 26th Annual Symposium on Foundations of Computer Science (sfcs 1985).

[24] Aaron Roth,et al. Privately releasing conjunctions and the statistical query barrier , 2010, STOC '11.

[25] Omer Reingold,et al. DNF Sparsification and a Faster Deterministic Counting Algorithm , 2012, Computational Complexity Conference.

[26] Rocco A. Servedio,et al. On learning monotone DNF under product distributions , 2001, Inf. Comput..

[27] Pravesh Kothari,et al. Learning Coverage Functions , 2013, ArXiv.

[28] S. Boucheron,et al. A sharp concentration inequality with applications , 1999, Random Struct. Algorithms.

[29] Ryan O'Donnell,et al. On the Fourier tails of bounded functions over the discrete cube , 2006, STOC '06.

[30] Tim Roughgarden,et al. Sketching valuation functions , 2012, SODA.

[31] László Lovász,et al. Submodular functions and convexity , 1982, ISMP.

[32] Ehud Friedgut,et al. Boolean Functions With Low Average Sensitivity Depend On Few Coordinates , 1998, Comb..