Spectral Approximate Inference

Given a graphical model (GM), computing its partition function is the most essential inference task, but it is computationally intractable in general. To address the issue, iterative approximation algorithms exploring certain local structure/consistency of GM have been investigated as popular choices in practice. However, due to their local/iterative nature, they often output poor approximations or even do not converge, e.g., in low-temperature regimes (hard instances of large parameters). To overcome the limitation, we propose a novel approach utilizing the global spectral feature of GM. Our contribution is two-fold: (a) we first propose a fully polynomial-time approximation scheme (FPTAS) for approximating the partition function of GM associating with a low-rank coupling matrix; (b) for general high-rank GMs, we design a spectral mean-field scheme utilizing (a) as a subroutine, where it approximates a high-rank GM into a product of rank-1 GMs for an efficient approximation of the partition function. The proposed algorithm is more robust in its running time and accuracy than prior methods, i.e., neither suffers from the convergence issue nor depends on hard local structures, as demonstrated in our experiments.

[1]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.

[2]  M. Fisher,et al.  Dimer problem in statistical mechanics-an exact result , 1961 .

[3]  J. W. Essam,et al.  Derivation of Low‐Temperature Expansions for the Ising Model of a Ferromagnet and an Antiferromagnet , 1965 .

[4]  R. Ellis,et al.  The statistics of Curie-Weiss models , 1978 .

[5]  Judea Pearl,et al.  Reverend Bayes on Inference Engines: A Distributed Hierarchical Approach , 1982, AAAI.

[6]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Mark Jerrum,et al.  Polynomial-Time Approximation Algorithms for the Ising Model , 1990, SIAM J. Comput..

[8]  Charles Delorme,et al.  Laplacian eigenvalues and the maximum cut problem , 1993, Math. Program..

[9]  David P. Williamson,et al.  Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming , 1995, JACM.

[10]  Farid Alizadeh,et al.  Interior Point Methods in Semidefinite Programming with Applications to Combinatorial Optimization , 1995, SIAM J. Optim..

[11]  Robert J. Vanderbei,et al.  An Interior-Point Method for Semidefinite Programming , 1996, SIAM J. Optim..

[12]  Kim-Chuan Toh,et al.  SDPT3 -- A Matlab Software Package for Semidefinite Programming , 1996 .

[13]  Rina Dechter,et al.  Bucket Elimination: A Unifying Framework for Reasoning , 1999, Artif. Intell..

[14]  Radford M. Neal Annealed importance sampling , 1998, Stat. Comput..

[15]  Rina Dechter,et al.  Mini-buckets: A general scheme for bounded inference , 2003, JACM.

[16]  Eric Vigoda,et al.  A polynomial-time approximation algorithm for the permanent of a matrix with nonnegative entries , 2004, JACM.

[17]  Jeff A. Bilmes,et al.  Graphical models and automatic speech recognition , 2002 .

[18]  William T. Freeman,et al.  Learning Low-Level Vision , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[19]  Martin J. Wainwright,et al.  A new class of upper bounds on the log partition function , 2002, IEEE Transactions on Information Theory.

[20]  Dror Weitz,et al.  Counting independent sets up to the tree threshold , 2006, STOC '06.

[21]  Yurii Nesterov,et al.  Smoothing Technique and its Applications in Semidefinite Optimization , 2004, Math. Program..

[22]  David Gamarnik,et al.  Counting without sampling: Asymptotics of the log‐partition function for certain statistical physics models , 2008, Random Struct. Algorithms.

[23]  Hugo Larochelle,et al.  Efficient Learning of Deep Boltzmann Machines , 2010, AISTATS.

[24]  Qiang Liu,et al.  Bounding the Partition Function using Holder's Inequality , 2011, ICML.

[25]  Fengshan Bai,et al.  Approximating partition functions of the two-state spin system , 2011, Inf. Process. Lett..

[26]  Allan Sly,et al.  The Computational Hardness of Counting in Two-Spin Models on d-Regular Graphs , 2012, 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science.

[27]  Piyush Srivastava,et al.  Approximation Algorithms for Two-State Anti-Ferromagnetic Spin Systems on Bounded Degree Graphs , 2011, Journal of Statistical Physics.

[28]  Liang Li,et al.  Correlation Decay up to Uniqueness in Spin Systems , 2013, SODA.

[29]  Mehdi Molkaraie An Importance Sampling Algorithm for the Ising Model with Strong Couplings , 2014 .

[30]  Eric Vigoda,et al.  Inapproximability for antiferromagnetic spin systems in the tree non-uniqueness region , 2013, STOC.

[31]  Alexandre d'Aspremont,et al.  Phase recovery, MaxCut and complex semidefinite programming , 2012, Math. Program..

[32]  Thomas P. Hayes,et al.  Convergence of MCMC and Loopy BP in the Tree Uniqueness Region for the Hard-Core Model , 2016, 2016 IEEE 57th Annual Symposium on Foundations of Computer Science (FOCS).

[33]  Stefano Ermon,et al.  Variable Elimination in the Fourier Domain , 2015, ICML.

[34]  Viresh Patel,et al.  Deterministic polynomial-time approximation algorithms for partition functions and graph polynomials , 2016, Electron. Notes Discret. Math..

[35]  Piyush Srivastava,et al.  The Ising Partition Function: Zeros and Deterministic Approximation , 2017, Journal of Statistical Physics.

[36]  Nan Ye,et al.  Tensor Belief Propagation , 2017, ICML.

[37]  Jinwoo Shin,et al.  Gauged Mini-Bucket Elimination for Approximate Inference , 2018, AISTATS.

[38]  Mehdi Molkaraie,et al.  Monte Carlo Methods for the Ferromagnetic Potts Model Using Factor Graph Duality , 2015, IEEE Transactions on Information Theory.

[39]  Elchanan Mossel,et al.  The Mean-Field Approximation: Information Inequalities, Algorithms, and Complexity , 2018, COLT.

[40]  Jinwoo Shin,et al.  Bucket renormalization for approximate inference , 2018, ICML.