论文信息 - Variational Inference in Mixed Probabilistic Submodular Models

Variational Inference in Mixed Probabilistic Submodular Models

We consider the problem of variational inference in probabilistic models with both log-submodular and log-supermodular higher-order potentials. These models can represent arbitrary distributions over binary variables, and thus generalize the commonly used pairwise Markov random fields and models with log-supermodular potentials only, for which efficient approximate inference algorithms are known. While inference in the considered models is #P-hard in general, we present efficient approximate algorithms exploiting recent advances in the field of discrete optimization. We demonstrate the effectiveness of our approach in a large set of experiments, where our model allows reasoning about preferences over sets of items with complements and substitutes.

[1] Francis R. Bach,et al. Learning with Submodular Functions: A Convex Optimization Perspective , 2011, Found. Trends Mach. Learn..

[2] Andrew W. Fitzgibbon,et al. Global stereo reconstruction under second order smoothness priors , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[3] Rishabh K. Iyer,et al. Polyhedral aspects of Submodularity, Convexity and Concavity , 2015, ArXiv.

[4] Ben Taskar,et al. Determinantal Point Processes for Machine Learning , 2012, Found. Trends Mach. Learn..

[5] 藤重悟. Submodular functions and optimization , 1991 .

[6] Philip Wolfe,et al. An algorithm for quadratic programming , 1956 .

[7] Alkis Gotovos,et al. Sampling from Probabilistic Submodular Models , 2015, NIPS.

[8] Jack Edmonds,et al. Submodular Functions, Matroids, and Certain Polyhedra , 2001, Combinatorial Optimization.

[9] Andreas Krause,et al. Higher-Order Inference for Multi-class Log-Supermodular Models , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[10] Jack Edmonds,et al. Matroids and the greedy algorithm , 1971, Math. Program..

[11] Andreas Krause,et al. Learning Probabilistic Submodular Diversity Models Via Noise Contrastive Estimation , 2016, AISTATS.

[12] Ben Taskar,et al. Expectation-Maximization for Learning Determinantal Point Processes , 2014, NIPS.

[13] Rishabh K. Iyer,et al. On Approximate Non-submodular Minimization via Tree-Structured Supermodularity , 2015, AISTATS.

[14] Martin Jaggi,et al. Revisiting Frank-Wolfe: Projection-Free Sparse Convex Optimization , 2013, ICML.

[15] Rishabh K. Iyer,et al. Algorithms for Approximate Minimization of the Difference Between Submodular Functions, with Applications , 2012, UAI.

[16] Aapo Hyvärinen,et al. Noise-Contrastive Estimation of Unnormalized Statistical Models, with Applications to Natural Image Statistics , 2012, J. Mach. Learn. Res..

[17] Andreas Krause,et al. From MAP to Marginals: Variational Inference in Bayesian Submodular Models , 2014, NIPS.

[18] Vladimir Kolmogorov,et al. Optimizing Binary MRFs via Extended Roof Duality , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[19] Kazuo Murota,et al. Discrete convex analysis , 1998, Math. Program..

[20] Andreas Krause,et al. Scalable Variational Inference in Log-supermodular Models , 2015, ICML.

[21] Amin Karbasi,et al. Fast Mixing for Discrete Point Processes , 2015, COLT.

[22] Akiyoshi Shioura. Polynomial-Time Approximation Schemes for Maximizing Gross Substitutes Utility under Budget Constraints , 2011, ESA.

[23] Jeff A. Bilmes,et al. A Submodular-supermodular Procedure with Applications to Discriminative Structure Learning , 2005, UAI.