Robust Convex Clustering with Spectral Analysis-based Feature Selection

Clustering is a fundamental problem in many scientific applications. Traditional methods, such as k-means, Gaussian mixture models, and hierarchical clustering, however, are beset by local minima, which are sometimes drastically suboptimal. Furthermore recently introduced convex relaxations of methods average the weight of cluster assignment, which may lead performance deterioration. To address these issues, this paper presents a novel approach for robust convex clustering. In contrast to previously considered algorithms, the formulation utilize spectral analysis-based feature selection for alternating between minimization algorithm and multipliers. Rather than focusing on local features and their consistencies, our method aims at extracting sufficient information about the structure of the target concept. The experimental results demonstrate the effectiveness of our method in a variety of contexts on a real-world dataset.

[1]  Tie Liu,et al.  Convex clustering with metric learning , 2018, Pattern Recognit..

[2]  Eric C. Chi,et al.  Splitting Methods for Convex Clustering , 2013, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[3]  Nicu Sebe,et al.  Web Image Annotation Via Subspace-Sparsity Collaborated Feature Selection , 2012, IEEE Transactions on Multimedia.

[4]  Zhongmin Cai,et al.  Learning edge weights in file co-occurrence graphs for malware detection , 2018, Data Mining and Knowledge Discovery.

[5]  Yong Luo,et al.  Vector-Valued Multi-View Semi-Supervsed Learning for Multi-Label Image Classification , 2013, AAAI.

[7]  J. Suykens,et al.  Convex Clustering Shrinkage , 2005 .

[8]  L. Ljung,et al.  Clustering using sum-of-norms regularization: With application to particle filter output computation , 2011, 2011 IEEE Statistical Signal Processing Workshop (SSP).

[9]  Zenglin Xu,et al.  Discriminative Semi-Supervised Feature Selection Via Manifold Regularization , 2009, IEEE Transactions on Neural Networks.

[10]  LiuWei,et al.  Semi-supervised distance metric learning for collaborative image retrieval and clustering , 2010 .

[11]  Jiayu Zhou,et al.  Robust Convex Clustering Analysis , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[12]  Nicu Sebe,et al.  Discriminating Joint Feature Analysis for Multimedia Data Understanding , 2012, IEEE Transactions on Multimedia.

[13]  Jitendra Malik,et al.  Normalized cuts and image segmentation , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Francis R. Bach,et al.  Clusterpath: an Algorithm for Clustering using Convex Fusion Penalties , 2011, ICML.

[15]  Fionn Murtagh,et al.  Pattern Classification, by Richard O. Duda, Peter E. Hart, and David G. Stork , 2001, J. Classif..

[16]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[17]  Huan Liu,et al.  Semi-supervised Feature Selection via Spectral Analysis , 2007, SDM.