Isotropy and Log-Concave Polynomials: Accelerated Sampling and High-Precision Counting of Matroid Bases

We define a notion of isotropy for discrete set distributions. If <tex>$\mu$</tex> is a distribution over subsets <tex>$\mathcal{S}$</tex> of a ground set [<tex>$n$</tex>], we say that <tex>$\mu$</tex> is in isotropic position if <tex>$\mathbb{P}_{\mathcal{S}\sim\mu}[e\in \mathcal{S}]$</tex> is the same for all <tex>$e\in[n]$</tex>. We design a new approximate sampling algorithm that leverages isotropy for the class of distributions <tex>$\mu$</tex> that have a log-concave generating polynomial; this class includes determinantal point processes, strongly Rayleigh distributions, and uniform distributions over matroid bases. We show that when <tex>$\mu$</tex> is in approximately isotropic position, the running time of our algorithm depends polynomially on the size of the set <tex>$\mathcal{S}$</tex>, and only logarithmically on <tex>$n$</tex>. When <tex>$n$</tex> is much larger than the size of <tex>$\mathcal{S}$</tex>, this is significantly faster than prior algorithms, and can even be sublinear in <tex>$n$</tex>. We then show how to transform a non-isotropic <tex>$\mu$</tex> into an equivalent approximately isotropic form with a polynomial-time pre-processing step, accelerating subsequent sampling times. The main new ingredient enabling our algorithms is a class of negative dependence inequalities that may be of independent interest. As an application of our results, we show how to approximately count bases of a matroid of rank <tex>$k$</tex> over a ground set of <tex>$n$</tex> elements to within a factor of <tex>$1+\epsilon$</tex> in time <tex>$O((n+1/\epsilon^{2}) \cdot \text{poly}(k,\log n))$</tex>. This is the first algorithm that runs in nearly linear time for fixed rank <tex>$k$</tex>, and achieves an inverse polynomially low approximation error. The full version of this paper is available at: https://arxiv.org/abs/2004.09079

[1]  T. Liggett,et al.  Negative dependence and the geometry of polynomials , 2007, 0707.2340.

[2]  Yuval Peres,et al.  Concentration of Lipschitz Functionals of Determinantal and Other Strong Rayleigh Measures , 2011, Combinatorics, Probability and Computing.

[3]  Peter J. Cameron Notes on matroids and codes , 2000 .

[4]  Santosh S. Vempala,et al.  Recent Progress and Open Problems in Algorithmic Convex Geometry , 2010, FSTTCS.

[5]  Jack E. Graver,et al.  Rigidity Matroids , 1991, SIAM J. Discret. Math..

[6]  Botong Wang,et al.  Correlation bounds for fields and matroids , 2018, Journal of the European Mathematical Society.

[7]  Manfred K. Warmuth,et al.  Leveraged volume sampling for linear regression , 2018, NeurIPS.

[8]  Shayan Oveis Gharan,et al.  A Polynomial Time MCMC Method for Sampling from Continuous DPPs , 2018, ArXiv.

[9]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[10]  Irit Dinur,et al.  High Dimensional Expanders Imply Agreement Expanders , 2017, 2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS).

[11]  Leonid Gurvits,et al.  On multivariate Newton-like inequalities , 2008, 0812.3687.

[12]  Manfred K. Warmuth,et al.  Correcting the bias in least squares regression with volume-rescaled sampling , 2018, AISTATS.

[13]  Aaron Schild,et al.  An almost-linear time algorithm for uniform random spanning tree generation , 2017, STOC.

[14]  June Huh,et al.  Lorentzian polynomials , 2019, Annals of Mathematics.

[15]  Alain Guénoche Random Spanning Tree , 1983, J. Algorithms.

[16]  Mark Jerrum,et al.  Approximately counting bases of bicircular matroids , 2018, Combinatorics, Probability and Computing.

[17]  Santosh S. Vempala,et al.  The Kannan-Lov\'asz-Simonovits Conjecture. , 2018, 1807.03465.

[18]  Suvrit Sra,et al.  Exponentiated Strongly Rayleigh Distributions , 2018, NeurIPS.

[19]  Rémi Bardenet,et al.  On a few statistical applications of determinantal point processes , 2017 .

[20]  Leslie G. Valiant,et al.  Random Generation of Combinatorial Structures from a Uniform Distribution , 1986, Theor. Comput. Sci..

[21]  Tali Kaufman,et al.  High Dimensional Random Walks and Colorful Expansion , 2016, ITCS.

[22]  Nima Anari,et al.  Log-Concave Polynomials IV: Exchange Properties, Tight Mixing Times, and Faster Sampling of Spanning Trees , 2020, ArXiv.

[23]  Nima Anari,et al.  Spectral Independence in High-Dimensional Expanders and Applications to the Hardcore Model , 2020, 2020 IEEE 61st Annual Symposium on Foundations of Computer Science (FOCS).

[24]  Nisheeth K. Vishnoi,et al.  Faster Polytope Rounding, Sampling, and Volume Computation via a Sub-Linear Ball Walk , 2019, 2019 IEEE 60th Annual Symposium on Foundations of Computer Science (FOCS).

[25]  Ashish Goel,et al.  Connectivity in Random Forests and Credit Networks , 2015, SODA.

[26]  Michal Derezinski,et al.  Fast determinantal point processes via distortion-free intermediate sampling , 2018, COLT.

[27]  Santosh S. Vempala,et al.  The Kannan-Lovász-Simonovits Conjecture , 2018, ArXiv.

[28]  Nima Anari,et al.  Log-Concave Polynomials, Entropy, and a Deterministic Approximation Algorithm for Counting Bases of Matroids , 2018, 2018 IEEE 59th Annual Symposium on Foundations of Computer Science (FOCS).

[29]  Heng Guo,et al.  Modified log-Sobolev Inequalities for Strongly Log-Concave Distributions , 2019, 2019 IEEE 60th Annual Symposium on Foundations of Computer Science (FOCS).

[30]  Mark Jerrum,et al.  Spectral gap and log-Sobolev constant for balanced matroids , 2002, The 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Proceedings..

[31]  Tomás Feder,et al.  Balanced matroids , 1992, STOC '92.

[32]  Justin Salez,et al.  Modified log-Sobolev inequalities for strong-Rayleigh measures , 2019, The Annals of Applied Probability.

[33]  V. Climenhaga Markov chains and mixing times , 2013 .

[34]  Ben Taskar,et al.  Approximate Inference in Continuous Determinantal Processes , 2013, NIPS.

[35]  Santosh S. Vempala,et al.  Simulated annealing in convex bodies and an O*(n4) volume algorithm , 2006, J. Comput. Syst. Sci..

[36]  Ben Taskar,et al.  Determinantal Point Processes for Machine Learning , 2012, Found. Trends Mach. Learn..

[37]  Charles J. Colbourn,et al.  Reliability Polynomials: A Survey , 1998 .

[38]  S. Vempala Geometric Random Walks: a Survey , 2007 .

[39]  Tali Kaufman,et al.  High Order Random Walks: Beyond Spectral Gap , 2017, APPROX-RANDOM.

[40]  Nima Anari,et al.  Log-concave polynomials II: high-dimensional walks and an FPRAS for counting bases of a matroid , 2018, STOC.

[41]  Daniele Calandriello,et al.  Exact sampling of determinantal point processes with sublinear time preprocessing , 2019, NeurIPS.