Abstract Algebraic Subspace Clustering

Subspace clustering is the problem of clustering data that lie close to a union of linear subspaces. In this paper, we consider an abstract version of this problem, where one is given a set of points lying in general position inside the algebraic variety of a union of subspaces, and the objective is to decompose the underlying variety into its constituent subspaces. Prior algebraic-geometric approaches to this problem require the subspaces to be of equal dimension, or the number of subspaces to be known. While an algorithm addressing the general case of an unknown number of subspaces of possibly different dimensions had been proposed, a proof for its correctness had not been given. In this paper, we propose a provably correct algorithm for addressing the general case. Our algorithm uses the gradient of a vanishing polynomial at a point in the variety to find a hyperplane containing the subspace passing through that point. By intersecting the variety with this hyperplane and recursively applying the procedure until no non-zero vanishing polynomial exists, our algorithm identifies the subspace containing that point. By repeating this procedure for other points, our algorithm eventually identifies all the subspaces by returning a basis for their orthogonal complement. 1. Introduction. Given a set of points drawn from a union of linear subspaces, subspace clustering refers to the problem of identifying the number of subspaces, their dimensions, a basis for each subspace, and the clustering of the data points according to their membership to the subspaces. This is an important problem with widespread applications in computer vision [32], systems theory [20] and genomics [15].

[1]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[2]  Ieee Xplore,et al.  IEEE Transactions on Pattern Analysis and Machine Intelligence Information for Authors , 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Harm Derksen Hilbert series of subspace arrangements , 2005 .

[4]  Emmanuel J. Candès,et al.  A Geometric Analysis of Subspace Clustering with Outliers , 2011, ArXiv.

[5]  D. Eisenbud Commutative Algebra: with a View Toward Algebraic Geometry , 1995 .

[6]  Marc Pollefeys,et al.  A General Framework for Motion Segmentation: Independent, Articulated, Rigid, Non-rigid, Degenerate and Non-degenerate , 2006, ECCV.

[7]  Huan Xu,et al.  Noisy Sparse Subspace Clustering , 2013, J. Mach. Learn. Res..

[8]  René Vidal,et al.  Abstract algebraic-geometric subspace clustering , 2014, 2014 48th Asilomar Conference on Signals, Systems and Computers.

[9]  Yong Yu,et al.  Robust Subspace Segmentation by Low-Rank Representation , 2010, ICML.

[10]  M. Anthony,et al.  Advanced linear algebra , 2006 .

[11]  D. Kirby,et al.  COMMUTATIVE RING THEORY (Cambridge Studies in Advanced Mathematics 8) , 1988 .

[12]  J. Sidman,et al.  A sharp bound for the Castelnuovo–Mumford regularity of subspace arrangements , 2001, math/0109035.

[13]  Aidong Zhang,et al.  Cluster analysis for gene expression data: a survey , 2004, IEEE Transactions on Knowledge and Data Engineering.

[14]  Shuicheng Yan,et al.  Robust and Efficient Subspace Segmentation via Least Squares Regression , 2012, ECCV.

[15]  René Vidal,et al.  Identification of Deterministic Switched ARX Systems via Identification of Algebraic Varieties , 2005, HSCC.

[16]  Allen Y. Yang,et al.  Estimation of Subspace Arrangements with Applications in Modeling and Segmenting Mixed Data , 2008, SIAM Rev..

[17]  René Vidal,et al.  Clustering disjoint subspaces via sparse representation , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  Hans-Peter Kriegel,et al.  Subspace clustering , 2012, WIREs Data Mining Knowl. Discov..

[19]  René Vidal,et al.  A new GPCA algorithm for clustering subspaces by fitting, differentiating and dividing polynomials , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[20]  René Vidal,et al.  A closed form solution to robust subspace estimation and clustering , 2011, CVPR 2011.

[21]  René Vidal,et al.  Sparse Subspace Clustering: Algorithm, Theory, and Applications , 2012, IEEE transactions on pattern analysis and machine intelligence.

[22]  S. Shankar Sastry,et al.  Generalized principal component analysis (GPCA) , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Kun Huang,et al.  Minimum effective dimension for mixtures of subspaces: a robust GPCA algorithm and its applications , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[24]  Christopher M. Bishop,et al.  Mixtures of Probabilistic Principal Component Analyzers , 1999, Neural Computation.

[25]  Guangliang Chen,et al.  Spectral Curvature Clustering (SCC) , 2009, International Journal of Computer Vision.

[26]  P. Tseng Nearest q-Flat to m Points , 2000 .

[27]  Yong Yu,et al.  Robust Recovery of Subspace Structures by Low-Rank Representation , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Emmanuel J. Candès,et al.  Robust Subspace Clustering , 2013, ArXiv.

[29]  Ehsan Elhamifar,et al.  Sparse subspace clustering , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Michael Francis Atiyah,et al.  Introduction to commutative algebra , 1969 .

[31]  Aldo Conca,et al.  Castelnuovo-Mumford regularity of products of ideals , 2002 .

[32]  Yair Weiss,et al.  Multibody factorization with uncertainty and missing data using the EM algorithm , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..