Incremental multi-view spectral clustering

Abstract Multi-view learning has attracted increasing attention in recent years, and the existing multi-view learning methods learn a consensus result by collecting all views. These methods have two obvious limitations. First, it is not scalable; with limited computational resources it would be difficult, if not impossible, to collect and process a large collection of views together. Second, in many applications views of data are available over time; it is infeasible to apply the existing multi-view learning methods to such streaming views. To address the two limitations, in this paper we propose a novel incremental multi-view spectral clustering (IMSC) method. In IMSC, instead of ensembling the collection of all views simultaneously, we integrate them one by one in an incremental way. We first learn an initial model from a small number of views; next when a new view is available, we need only use it to update the model and apply the updated model to learn a consensus result. This method is scalable and applicable to streaming views. To further reduce the time and space complexity, we apply low rank approximation by means of the well-known random Fourier features to construct the base kernels and do low rank SVD decompositions accordingly. The theoretical analysis and experimental results on benchmark data sets show that our incremental multi-view spectral clustering method is significantly faster in efficiency than the existing state-of-the-art non-incremental ones and is comparable or even better in clustering quality.

[1]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Ishfaq Ahmad Multi-View Video: Get Ready for Next-Generation Television , 2007, IEEE Distributed Systems Online.

[3]  M. Brand,et al.  Fast low-rank modifications of the thin singular value decomposition , 2006 .

[4]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[5]  Xindong Wu,et al.  Subkilometer crater discovery with boosting and transfer learning , 2011, TIST.

[6]  S T Roweis,et al.  Nonlinear dimensionality reduction by locally linear embedding. , 2000, Science.

[7]  Inderjit S. Dhillon,et al.  Learning low-rank kernel matrices , 2006, ICML.

[8]  Shiliang Sun,et al.  Multi-view Laplacian Support Vector Machines , 2011, ADMA.

[9]  Yuan Xie,et al.  On Unifying Multi-view Self-Representations for Clustering by Tensor Multi-rank Minimization , 2016, International Journal of Computer Vision.

[10]  Feiping Nie,et al.  Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Multi-View K-Means Clustering on Big Data , 2022 .

[11]  Jia Xu,et al.  Spectral Clustering with a Convex Regularizer on Millions of Images , 2014, ECCV.

[12]  Johan A. K. Suykens,et al.  Multiway Spectral Clustering with Out-of-Sample Extensions through Weighted Kernel PCA , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Xuelong Li,et al.  Auto-Weighted Multi-View Learning for Image Clustering and Semi-Supervised Classification , 2018, IEEE Transactions on Image Processing.

[14]  Xuelong Li,et al.  Multiview Clustering via Adaptively Weighted Procrustes , 2018, KDD.

[15]  Feiping Nie,et al.  Multiview Consensus Graph Clustering , 2019, IEEE Transactions on Image Processing.

[16]  Jianbo Shi,et al.  Multiclass spectral clustering , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[17]  Feiping Nie,et al.  Detecting Coherent Groups in Crowd Scenes by Multiview Clustering , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Qinghua Hu,et al.  Generalized Latent Multi-View Subspace Clustering , 2020, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[20]  Christoph H. Lampert,et al.  Learning to detect unseen object classes by between-class attribute transfer , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Zhenyue Zhang,et al.  Uniform Projection for Multi-View Learning , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Johan A. K. Suykens,et al.  Incremental kernel spectral clustering for online learning of non-stationary data , 2014, Neurocomputing.

[23]  Johan A. K. Suykens,et al.  Kernel spectral clustering for dynamic data using multiple kernel learning , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[24]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[25]  Nikolai F. Rulkov,et al.  On the performance of gas sensor arrays in open sampling systems using Inhibitory Support Vector Machines , 2013 .

[26]  Wotao Yin,et al.  A feasible method for optimization with orthogonality constraints , 2013, Math. Program..

[27]  James C. French,et al.  Integrating Multiple Multi-Channel CBIR Systems , 2003, Multimedia Information Systems.

[28]  Lei Wang,et al.  Multiple kernel extreme learning machine , 2015, Neurocomputing.

[29]  Robert P. W. Duin,et al.  Handwritten digit recognition by combined classifiers , 1998, Kybernetika.

[30]  Hal Daumé,et al.  Co-regularized Multi-view Spectral Clustering , 2011, NIPS.

[31]  Krista A. Ehinger,et al.  SUN database: Large-scale scene recognition from abbey to zoo , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[32]  Hamido Fujita,et al.  A study of graph-based system for multi-view clustering , 2019, Knowl. Based Syst..

[33]  Wotao Yin,et al.  A Curvilinear Search Method for p-Harmonic Flows on Spheres , 2009, SIAM J. Imaging Sci..

[34]  Hao Wang,et al.  Classification with Streaming Features: An Emerging-Pattern Mining Approach , 2015, TKDD.

[35]  Zhang Yi,et al.  A multitask multiview clustering algorithm in heterogeneous situations based on LLE and LE , 2019, Knowl. Based Syst..

[36]  Kaare Brandt Petersen,et al.  The Matrix Cookbook , 2006 .

[37]  Benjamin Recht,et al.  Random Features for Large-Scale Kernel Machines , 2007, NIPS.

[38]  Lei Shi,et al.  Recovery of Corrupted Multiple Kernels for Clustering , 2015, IJCAI.

[39]  Shiliang Sun,et al.  Hierarchical Multi-view Fisher Discriminant Analysis , 2009, ICONIP.