论文信息 - Optimal DR-Submodular Maximization and Applications to Provable Mean Field Inference

Optimal DR-Submodular Maximization and Applications to Provable Mean Field Inference

Mean field inference in probabilistic models is generally a highly nonconvex problem. Existing optimization methods, e.g., coordinate ascent algorithms, can only generate local optima. In this work we propose provable mean filed methods for probabilistic log-submodular models and its posterior agreement (PA) with strong approximation guarantees. The main algorithmic technique is a new Double Greedy scheme, termed DR-DoubleGreedy, for continuous DR-submodular maximization with box-constraints. It is a one-pass algorithm with linear time complexity, reaching the optimal 1/2 approximation ratio, which may be of independent interest. We validate the superior performance of our algorithms against baseline algorithms on both synthetic and real-world datasets.

[1] Joseph Naor,et al. A Tight Linear Time (1/2)-Approximation for Unconstrained Submodular Maximization , 2012, 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science.

[2] Ben Taskar,et al. Expectation-Maximization for Learning Determinantal Point Processes , 2014, NIPS.

[3] Vibhav Vineet,et al. Conditional Random Fields as Recurrent Neural Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[4] Yuichi Yoshida,et al. Non-Monotone DR-Submodular Function Maximization , 2016, AAAI.

[5] Andreas Krause,et al. Guarantees for Greedy Maximization of Non-submodular Functions with Applications , 2017, ICML.

[6] Michael I. Jordan,et al. Graphical Models, Exponential Families, and Variational Inference , 2008, Found. Trends Mach. Learn..

[7] Andreas Krause,et al. Submodular Function Maximization , 2014, Tractability.

[8] Andreas Krause,et al. From MAP to Marginals: Variational Inference in Bayesian Submodular Models , 2014, NIPS.

[9] Joachim M. Buhmann,et al. Information-theoretic analysis of MaxCut algorithms , 2016, 2016 Information Theory and Applications Workshop (ITA).

[10] Andreas Krause,et al. Near-optimal Nonmyopic Value of Information in Graphical Models , 2005, UAI.

[11] Amin Karbasi,et al. Online Continuous Submodular Maximization , 2018, AISTATS.

[12] Ben Taskar,et al. Near-Optimal MAP Inference for Determinantal Point Processes , 2012, NIPS.

[13] Jan Vondrák,et al. On Multiplicative Weight Updates for Concave and Submodular Function Maximization , 2015, ITCS.

[14] Stefan Bauer,et al. Model Selection for Gaussian Process Regression , 2017, GCPR.

[15] Britta Peis,et al. Submodular Function Maximization on the Bounded Integer Lattice , 2015, WAOA.

[16] Huy L. Nguyen,et al. A Reduction for Optimizing Lattice Submodular Functions with Diminishing Returns , 2016, ArXiv.

[17] Vahab S. Mirrokni,et al. Maximizing Non-Monotone Submodular Functions , 2011, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07).

[18] Joachim M. Buhmann,et al. How informative are Minimum Spanning Tree algorithms? , 2014, 2014 IEEE International Symposium on Information Theory.

[19] Stefanie Jegelka,et al. Robust Budget Allocation Via Continuous Submodular Functions , 2017, Applied Mathematics & Optimization.

[20] Hui Lin,et al. Optimal Selection of Limited Vocabulary Speech Corpora , 2011, INTERSPEECH.

[21] Jan Vondrák,et al. Maximizing a Submodular Set Function Subject to a Matroid Constraint (Extended Abstract) , 2007, IPCO.

[22] Hui Lin,et al. A Class of Submodular Functions for Document Summarization , 2011, ACL.

[23] Bryan Wilder,et al. Risk-Sensitive Submodular Optimization , 2018, AAAI.

[24] Tim Roughgarden,et al. Optimal Algorithms for Continuous Non-monotone Submodular and DR-Submodular Maximization , 2018, NeurIPS.

[25] Andreas Krause,et al. Variational Inference in Mixed Probabilistic Submodular Models , 2016, NIPS.

[26] Andreas Krause,et al. Continuous DR-submodular Maximization: Structure and Algorithms , 2017, NIPS 2017.

[27] W. Hoeffding. Probability Inequalities for sums of Bounded Random Variables , 1963 .

[28] Andreas Krause,et al. Submodular Dictionary Selection for Sparse Representation , 2010, ICML.

[29] Joachim M. Buhmann. Information theoretic model validation for clustering , 2010, 2010 IEEE International Symposium on Information Theory.

[30] László Lovász,et al. Submodular functions and convexity , 1982, ISMP.

[31] Rishabh K. Iyer,et al. Submodular Point Processes with Applications to Machine learning , 2015, AISTATS.

[32] Amin Karbasi,et al. Decentralized Submodular Maximization: Bridging Discrete and Continuous Settings , 2018, ICML.

[33] J. Vondrák,et al. Submodular Function Maximization via the Multilinear Relaxation and Contention Resolution Schemes , 2014 .

[34] Andreas Krause,et al. Learning Probabilistic Submodular Diversity Models Via Noise Contrastive Estimation , 2016, AISTATS.

[35] Joachim M. Buhmann,et al. Posterior agreement for large parameter-rich optimization problems , 2018, Theor. Comput. Sci..

[36] Ben Taskar,et al. Determinantal Point Processes for Machine Learning , 2012, Found. Trends Mach. Learn..

[37] Jan Vondrák,et al. Optimal approximation for the submodular welfare problem in the value oracle model , 2008, STOC.

[38] Andreas Krause,et al. Guaranteed Non-convex Optimization: Submodular Maximization over Continuous Domains , 2016, AISTATS.

[39] Andreas Krause,et al. Scalable Variational Inference in Log-supermodular Models , 2015, ICML.

[40] Francis Bach,et al. Submodular functions: from discrete to continuous domains , 2015, Mathematical Programming.