Submodularity Cuts and Applications

Several key problems in machine learning, such as feature selection and active learning, can be formulated as submodular set function maximization. We present herein a novel algorithm for maximizing a submodular set function under a cardinality constraint — the algorithm is based on a cutting-plane method and is implemented as an iterative small-scale binary-integer linear programming procedure. It is well known that this problem is NP-hard, and the approximation factor achieved by the greedy algorithm is the theoretical limit for polynomial time. As for (non-polynomial time) exact algorithms that perform reasonably in practice, there has been very little in the literature although the problem is quite important for many applications. Our algorithm is guaranteed to find the exact solution finitely many iterations, and it converges fast in practice due to the efficiency of the cutting-plane mechanism. Moreover, we also provide a method that produces successively decreasing upper-bounds of the optimal solution, while our algorithm provides successively increasing lower-bounds. Thus, the accuracy of the current solution can be estimated at any point, and the algorithm can be stopped early once a desired degree of tolerance is met. We evaluate our algorithm on sensor placement and feature selection applications showing good performance.

[1]  Marcus Porembski Finitely convergent cutting planes for concave minimization , 2001, J. Glob. Optim..

[2]  Philip S. Yu,et al.  Near-optimal Supervised Feature Selection among Frequent Subgraphs , 2009, SDM.

[3]  G. Nemhauser,et al.  Maximizing Submodular Set Functions: Formulations and Analysis of Algorithms* , 1981 .

[4]  Abhimanyu Das,et al.  Algorithms for subset selection in linear regression , 2008, STOC.

[5]  KrauseAndreas,et al.  Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies , 2008 .

[6]  H. B. McMahan,et al.  Robust Submodular Observation Selection , 2008 .

[7]  Kazuo Murota,et al.  Discrete convex analysis , 1998, Math. Program..

[8]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[9]  G. Nemhauser,et al.  Maximizing a submodular function by integer programming: Polyhedral results for the quadratic case☆ , 1996 .

[10]  George Forman,et al.  An Extensive Empirical Study of Feature Selection Metrics for Text Classification , 2003, J. Mach. Learn. Res..

[11]  Maxim Sviridenko,et al.  A note on maximizing a submodular set function subject to a knapsack constraint , 2004, Oper. Res. Lett..

[12]  Laurence A. Wolsey,et al.  Integer and Combinatorial Optimization , 1988 .

[13]  Fred W. Glover,et al.  Convexity Cuts and Cut Search , 1973, Oper. Res..

[14]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[15]  Andreas Krause,et al.  Near-Optimal Sensor Placements in Gaussian Processes: Theory, Efficient Algorithms and Empirical Studies , 2008, J. Mach. Learn. Res..

[16]  László Lovász,et al.  Submodular functions and convexity , 1982, ISMP.

[17]  Boris Goldengorin Maximization of submodular functions: Theory and enumeration algorithms , 2009, Eur. J. Oper. Res..

[18]  James Theiler,et al.  Grafting: Fast, Incremental Feature Selection by Gradient Descent in Function Space , 2003, J. Mach. Learn. Res..

[19]  Fred W. Glover Polyhedral convexity cuts and negative edge extensions , 1974, Z. Oper. Research.

[20]  Satoru Fujishige,et al.  Submodular functions and optimization , 1991 .

[21]  Jack Edmonds,et al.  Submodular Functions, Matroids, and Certain Polyhedra , 2001, Combinatorial Optimization.

[22]  R. Horst,et al.  Global Optimization: Deterministic Approaches , 1992 .

[23]  Rong Jin,et al.  Batch mode active learning and its application to medical image classification , 2006, ICML.

[24]  Andreas Krause,et al.  Cost-effective outbreak detection in networks , 2007, KDD '07.

[25]  M. Stealey,et al.  High Resolution River Hydraulic and Water Quality Characterization Using Rapidly Deployable Networked Infomechanical Systems (NIMS RD) , 2007 .

[26]  Aimo A. Törn,et al.  Global Optimization , 1999, Science.