Convergence and Energy Landscape for Cheeger Cut Clustering

This paper provides both theoretical and algorithmic results for the l1-relaxation of the Cheeger cut problem. The l2 -relaxation, known as spectral clustering, only loosely relates to the Cheeger cut; however, it is convex and leads to a simple optimization problem. The l1-relaxation, in contrast, is non-convex but is provably equivalent to the original problem. The l1-relaxation therefore trades convexity for exactness, yielding improved clustering results at the cost of a more challenging optimization. The first challenge is understanding convergence of algorithms. This paper provides the first complete proof of convergence for algorithms that minimize the l1-relaxation. The second challenge entails comprehending the l1-energy landscape, i.e. the set of possible points to which an algorithm might converge. We show that l1-algorithms can get trapped in local minima that are not globally optimal and we provide a classification theorem to interpret these local minima. This classification gives meaning to these suboptimal solutions and helps to explain, in terms of graph structure, when the l1-relaxation provides the solution of the original Cheeger cut problem.

[1]  Robert R. Meyer,et al.  Sufficient Conditions for the Convergence of Monotonic Mathematical Programming Algorithms , 1976, J. Comput. Syst. Sci..

[2]  Fan Chung,et al.  Spectral Graph Theory , 1996 .

[3]  Matthias Hein,et al.  An Inverse Power Method for Nonlinear Eigenproblems with Applications in 1-Spectral Clustering and Sparse PCA , 2010, NIPS.

[4]  A. Ostrowski Solution of equations in Euclidean and Banach spaces , 1973 .

[5]  Gilbert Strang,et al.  Maximal flow through a domain , 1983, Math. Program..

[6]  Xavier Bresson,et al.  Multi-class Transductive Learning Based on ℓ1 Relaxations of Cheeger Cut and Mumford-Shah-Potts Model , 2013, Journal of Mathematical Imaging and Vision.

[7]  Antonin Chambolle,et al.  A First-Order Primal-Dual Algorithm for Convex Problems with Applications to Imaging , 2011, Journal of Mathematical Imaging and Vision.

[8]  Arthur D. Szlam,et al.  Total variation and cheeger cuts , 2010, ICML 2010.

[9]  J. Cheeger A lower bound for the smallest eigenvalue of the Laplacian , 1969 .

[10]  Pietro Perona,et al.  Self-Tuning Spectral Clustering , 2004, NIPS.

[11]  Matthias Hein,et al.  Beyond Spectral Clustering - Tight Relaxations of Balanced Graph Cuts , 2011, NIPS.

[12]  Matthias Hein,et al.  Constrained 1-Spectral Clustering , 2012, AISTATS.

[13]  Matthias Hein,et al.  Spectral clustering based on the graph p-Laplacian , 2009, ICML '09.

[14]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.